Package net.sf.saxon.regex.charclass
Class Categories
java.lang.Object
net.sf.saxon.regex.charclass.Categories
Singleton class holding data for Regular expression character categories.
The data is read from an XML file derived from the Unicode database (In Saxon 9.6, this is based on Unicode 6.2.0). Since Saxon 9.4, we no longer make use of Java's support for character categories since there are too many differences from Unicode.
Some commonly used categories are hard-coded and made available as static constants.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classA Category is a CharacterClass represented in a regular expression as \p{Xx}. -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final PredicateCharacterClassstatic final CharacterClassstatic final Categories.Categorystatic final CharacterClassstatic final PredicateCharacterClassstatic final CharacterClassstatic final CharacterClassstatic final CharacterClassstatic final PredicateCharacterClassstatic final CharacterClass -
Method Summary
Modifier and TypeMethodDescriptionstatic Categories.CategorygetCategory(String cat) Get a predicate to test characters for membership of one of the Unicode character categories
-
Field Details
-
ESCAPE_s
-
ESCAPE_S
-
ESCAPE_i
-
ESCAPE_I
-
ESCAPE_c
-
ESCAPE_C
-
ESCAPE_d
-
ESCAPE_D
-
ESCAPE_w
-
ESCAPE_W
-
-
Method Details
-
getCategory
Get a predicate to test characters for membership of one of the Unicode character categories- Parameters:
cat- a one-character or two-character category name, for example L or Lu- Returns:
- a predicate that tests whether a given character belongs to the category
-