Package net.sf.saxon.regex.charclass
Class Categories
java.lang.Object
net.sf.saxon.regex.charclass.Categories
Singleton class holding data for Regular expression character categories.
The data is read from an XML file derived from the Unicode database (In Saxon 9.6, this is based on Unicode 6.2.0). Since Saxon 9.4, we no longer make use of Java's support for character categories since there are too many differences from Unicode.
Some commonly used categories are hard-coded and made available as static constants.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic class
A Category is a CharacterClass represented in a regular expression as \p{Xx}. -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final PredicateCharacterClass
static final CharacterClass
static final Categories.Category
static final CharacterClass
static final PredicateCharacterClass
static final CharacterClass
static final CharacterClass
static final CharacterClass
static final PredicateCharacterClass
static final CharacterClass
-
Method Summary
Modifier and TypeMethodDescriptionstatic Categories.Category
getCategory
(String cat) Get a predicate to test characters for membership of one of the Unicode character categories