java - What is the {L} Unicode category? -
i came across regular expressions contain [^\\p{l}]
. understand using form of unicode category, when checked the documentation, found following "l" categories:
lu uppercase letter uppercase_letter ll lowercase letter lowercase_letter lt titlecase letter titlecase_letter lm modifier letter modifier_letter lo other letter other_letter
what l
in context?
taken link: http://www.regular-expressions.info/unicode.html
check unicode character properties
section.
\p{l} matches single code point in category "letter". if input string à encoded u+0061 u+0300, matches without accent. if input à encoded u+00e0, matches à accent. reason both code points u+0061 (a) , u+00e0 (à) in category "letter", while u+0300 in category "mark".
Comments
Post a Comment