java - What is the {L} Unicode category? -


i came across regular expressions contain [^\\p{l}]. understand using form of unicode category, when checked the documentation, found following "l" categories:

lu  uppercase letter    uppercase_letter ll  lowercase letter    lowercase_letter lt  titlecase letter    titlecase_letter lm  modifier letter     modifier_letter lo  other letter        other_letter 

what l in context?

taken link: http://www.regular-expressions.info/unicode.html

check unicode character properties section.

\p{l} matches single code point in category "letter". if input string à encoded u+0061 u+0300, matches without accent. if input à encoded u+00e0, matches à accent. reason both code points u+0061 (a) , u+00e0 (à) in category "letter", while u+0300 in category "mark".


Comments

Popular posts from this blog

c# - how to write client side events functions for the combobox items -

exception - Python, pyPdf OCR error: pyPdf.utils.PdfReadError: EOF marker not found -