com.ibm.icu.lang
public interface UProperty
Selection constants for Unicode properties.
These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.
The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).
For details about the properties see http://www.unicode.org.
For names of Unicode properties see the UCD file PropertyAliases.txt.
Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.
See Also: UCharacter
UNKNOWN: ICU 2.6
Nested Class Summary | |
---|---|
static interface | UProperty.NameChoice
Selector constants for UCharacter.getPropertyName() and
UCharacter.getPropertyValueName(). |
Field Summary | |
---|---|
static int | AGE
String property Age.
|
static int | ALPHABETIC Binary property Alphabetic. |
static int | ASCII_HEX_DIGIT
Binary property ASCII_Hex_Digit (0-9 A-F a-f). |
static int | BIDI_CLASS
Enumerated property Bidi_Class.
|
static int | BIDI_CONTROL Binary property Bidi_Control. Format controls which have specific functions in the Bidi Algorithm. |
static int | BIDI_MIRRORED Binary property Bidi_Mirrored. Characters that may change display in RTL text. Property for UCharacter.isMirrored(). See Bidi Algorithm; UTR 9. |
static int | BIDI_MIRRORING_GLYPH
String property Bidi_Mirroring_Glyph.
|
static int | BINARY_LIMIT One more than the last constant for binary Unicode properties. |
static int | BINARY_START
First constant for binary Unicode properties. |
static int | BLOCK
Enumerated property Block.
|
static int | CANONICAL_COMBINING_CLASS
Enumerated property Canonical_Combining_Class.
|
static int | CASE_FOLDING
String property Case_Folding.
|
static int | CASE_SENSITIVE Binary property Case_Sensitive. Either the source of a case mapping or _in_ the target of a case mapping. |
static int | DASH Binary property Dash. Variations of dashes. |
static int | DECOMPOSITION_TYPE
Enumerated property Decomposition_Type.
|
static int | DEFAULT_IGNORABLE_CODE_POINT Binary property Default_Ignorable_Code_Point (new). |
static int | DEPRECATED Binary property Deprecated (new). The usage of deprecated characters is strongly discouraged. |
static int | DIACRITIC Binary property Diacritic. Characters that linguistically modify the meaning of another character to which they apply. |
static int | DOUBLE_LIMIT
One more than the last constant for double Unicode properties. |
static int | DOUBLE_START
First constant for double Unicode properties. |
static int | EAST_ASIAN_WIDTH
Enumerated property East_Asian_Width.
|
static int | EXTENDER Binary property Extender. Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks. |
static int | FULL_COMPOSITION_EXCLUSION Binary property Full_Composition_Exclusion. CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions. |
static int | GENERAL_CATEGORY
Enumerated property General_Category.
|
static int | GENERAL_CATEGORY_MASK
Bitmask property General_Category_Mask.
|
static int | GRAPHEME_BASE Binary property Grapheme_Base (new). For programmatic determination of grapheme cluster boundaries. |
static int | GRAPHEME_CLUSTER_BREAK
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1).
|
static int | GRAPHEME_EXTEND Binary property Grapheme_Extend (new). For programmatic determination of grapheme cluster boundaries. Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ |
static int | GRAPHEME_LINK Binary property Grapheme_Link (new). For programmatic determination of grapheme cluster boundaries. |
static int | HANGUL_SYLLABLE_TYPE
Enumerated property Hangul_Syllable_Type, new in Unicode 4.
|
static int | HEX_DIGIT Binary property Hex_Digit. Characters commonly used for hexadecimal numbers. |
static int | HYPHEN Binary property Hyphen. Dashes used to mark connections between pieces of words, plus the Katakana middle dot. |
static int | IDEOGRAPHIC Binary property Ideographic. CJKV ideographs. |
static int | IDS_BINARY_OPERATOR Binary property IDS_Binary_Operator (new). For programmatic determination of Ideographic Description Sequences. |
static int | IDS_TRINARY_OPERATOR Binary property IDS_Trinary_Operator (new). |
static int | ID_CONTINUE Binary property ID_Continue. Characters that can continue an identifier. ID_Start+Mn+Mc+Nd+Pc |
static int | ID_START Binary property ID_Start. Characters that can start an identifier. Lu+Ll+Lt+Lm+Lo+Nl |
static int | INT_LIMIT
One more than the last constant for enumerated/integer Unicode
properties. |
static int | INT_START
First constant for enumerated/integer Unicode properties. |
static int | ISO_COMMENT
String property ISO_Comment.
|
static int | JOINING_GROUP
Enumerated property Joining_Group.
|
static int | JOINING_TYPE
Enumerated property Joining_Type.
|
static int | JOIN_CONTROL Binary property Join_Control. Format controls for cursive joining and ligation. |
static int | LEAD_CANONICAL_COMBINING_CLASS
Enumerated property Lead_Canonical_Combining_Class.
|
static int | LINE_BREAK
Enumerated property Line_Break.
|
static int | LOGICAL_ORDER_EXCEPTION Binary property Logical_Order_Exception (new). Characters that do not use logical order and require special handling in most processing. |
static int | LOWERCASE Binary property Lowercase. Same as UCharacter.isULowercase(), different from UCharacter.islower(). Ll+Other_Lowercase |
static int | LOWERCASE_MAPPING
String property Lowercase_Mapping.
|
static int | MASK_LIMIT
One more than the last constant for bit-mask Unicode properties. |
static int | MASK_START
First constant for bit-mask Unicode properties. |
static int | MATH Binary property Math. Sm+Other_Math |
static int | NAME
String property Name.
|
static int | NFC_INERT
Binary property NFC_Inert.
|
static int | NFC_QUICK_CHECK
Enumerated property NFC_Quick_Check.
|
static int | NFD_INERT
Binary property NFD_Inert.
|
static int | NFD_QUICK_CHECK
Enumerated property NFD_Quick_Check.
|
static int | NFKC_INERT
Binary property NFKC_Inert.
|
static int | NFKC_QUICK_CHECK
Enumerated property NFKC_Quick_Check.
|
static int | NFKD_INERT
Binary property NFKD_Inert.
|
static int | NFKD_QUICK_CHECK
Enumerated property NFKD_Quick_Check.
|
static int | NONCHARACTER_CODE_POINT Binary property Noncharacter_Code_Point. Code points that are explicitly defined as illegal for the encoding of characters. |
static int | NUMERIC_TYPE
Enumerated property Numeric_Type.
|
static int | NUMERIC_VALUE
Double property Numeric_Value.
|
static int | PATTERN_SYNTAX
Binary property Pattern_Syntax (new in Unicode 4.1).
|
static int | PATTERN_WHITE_SPACE
Binary property Pattern_White_Space (new in Unicode 4.1).
|
static int | POSIX_ALNUM
Binary property alnum (a C/POSIX character class).
|
static int | POSIX_BLANK
Binary property blank (a C/POSIX character class).
|
static int | POSIX_GRAPH
Binary property graph (a C/POSIX character class).
|
static int | POSIX_PRINT
Binary property print (a C/POSIX character class).
|
static int | POSIX_XDIGIT
Binary property xdigit (a C/POSIX character class).
|
static int | QUOTATION_MARK Binary property Quotation_Mark. |
static int | RADICAL Binary property Radical (new). For programmatic determination of Ideographic Description Sequences. |
static int | SCRIPT
Enumerated property Script.
|
static int | SEGMENT_STARTER
Binary Property Segment_Starter.
|
static int | SENTENCE_BREAK
Enumerated property Sentence_Break (new in Unicode 4.1).
|
static int | SIMPLE_CASE_FOLDING
String property Simple_Case_Folding.
|
static int | SIMPLE_LOWERCASE_MAPPING
String property Simple_Lowercase_Mapping.
|
static int | SIMPLE_TITLECASE_MAPPING
String property Simple_Titlecase_Mapping.
|
static int | SIMPLE_UPPERCASE_MAPPING
String property Simple_Uppercase_Mapping.
|
static int | SOFT_DOTTED Binary property Soft_Dotted (new). Characters with a "soft dot", like i or j. An accent placed on these characters causes the dot to disappear. |
static int | STRING_LIMIT
One more than the last constant for string Unicode properties. |
static int | STRING_START
First constant for string Unicode properties. |
static int | S_TERM
Binary property STerm (new in Unicode 4.0.1).
|
static int | TERMINAL_PUNCTUATION Binary property Terminal_Punctuation. Punctuation characters that generally mark the end of textual units. |
static int | TITLECASE_MAPPING
String property Titlecase_Mapping.
|
static int | TRAIL_CANONICAL_COMBINING_CLASS
Enumerated property Trail_Canonical_Combining_Class.
|
static int | UNICODE_1_NAME
String property Unicode_1_Name.
|
static int | UNIFIED_IDEOGRAPH Binary property Unified_Ideograph (new). For programmatic determination of Ideographic Description Sequences. |
static int | UPPERCASE Binary property Uppercase. Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase(). Lu+Other_Uppercase |
static int | UPPERCASE_MAPPING
String property Uppercase_Mapping.
|
static int | VARIATION_SELECTOR
Binary property Variation_Selector (new in Unicode 4.0.1).
|
static int | WHITE_SPACE Binary property White_Space. Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP |
static int | WORD_BREAK
Enumerated property Word_Break (new in Unicode 4.1).
|
static int | XID_CONTINUE Binary property XID_Continue. ID_Continue modified to allow closure under normalization forms NFKC and NFKD. |
static int | XID_START Binary property XID_Start. ID_Start modified to allow closure under normalization forms NFKC and NFKD. |
UNKNOWN: ICU 2.4
Binary property Alphabetic.
Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().
Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
Binary property Bidi_Control.
Format controls which have specific functions in the Bidi Algorithm.
UNKNOWN: ICU 2.6
Binary property Bidi_Mirrored.
Characters that may change display in RTL text.
Property for UCharacter.isMirrored().
See Bidi Algorithm; UTR 9.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
One more than the last constant for binary Unicode properties.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Case_Sensitive.
Either the source of a case mapping or _in_ the target of a case mapping. Not the same as the general category Cased_Letter.
UNKNOWN: ICU 2.6
Binary property Dash.
Variations of dashes.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
Binary property Default_Ignorable_Code_Point (new).
Property that indicates codepoint is ignorable in most processing.
Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)
UNKNOWN: ICU 2.6
Binary property Deprecated (new).
The usage of deprecated characters is strongly discouraged.
UNKNOWN: ICU 2.6
Binary property Diacritic.
Characters that linguistically modify the meaning of another character to which they apply.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Extender.
Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.
UNKNOWN: ICU 2.6
Binary property Full_Composition_Exclusion.
CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Grapheme_Base (new).
For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ
UNKNOWN: ICU 2.6
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
Binary property Grapheme_Extend (new).
For programmatic determination of grapheme cluster boundaries.
Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ
UNKNOWN: ICU 2.6
Binary property Grapheme_Link (new).
For programmatic determination of grapheme cluster boundaries.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.6
Binary property Hex_Digit.
Characters commonly used for hexadecimal numbers.
UNKNOWN: ICU 2.6
Binary property Hyphen.
Dashes used to mark connections between pieces of words, plus the Katakana middle dot.
UNKNOWN: ICU 2.6
Binary property Ideographic.
CJKV ideographs.
UNKNOWN: ICU 2.6
Binary property IDS_Binary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
UNKNOWN: ICU 2.6
Binary property IDS_Trinary_Operator (new).
UNKNOWN: ICU 2.6
Binary property ID_Continue.
Characters that can continue an identifier.
ID_Start+Mn+Mc+Nd+Pc
UNKNOWN: ICU 2.6
Binary property ID_Start.
Characters that can start an identifier.
Lu+Ll+Lt+Lm+Lo+Nl
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Join_Control.
Format controls for cursive joining and ligation.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 2.4
Binary property Logical_Order_Exception (new).
Characters that do not use logical order and require special handling in most processing.
UNKNOWN: ICU 2.6
Binary property Lowercase.
Same as UCharacter.isULowercase(), different from UCharacter.islower().
Ll+Other_Lowercase
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Math.
Sm+Other_Math
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
See Also: NFD_INERT
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
See Also: NFD_INERT
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
See Also: NFD_INERT
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
Binary property Noncharacter_Code_Point.
Code points that are explicitly defined as illegal for the encoding of characters.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
Binary property Quotation_Mark.
UNKNOWN: ICU 2.6
Binary property Radical (new).
For programmatic determination of Ideographic Description Sequences.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
Binary property Soft_Dotted (new).
Characters with a "soft dot", like i or j.
An accent placed on these characters causes the dot to disappear.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 2.4
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
Binary property Terminal_Punctuation.
Punctuation characters that generally mark the end of textual units.
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
UNKNOWN: ICU 2.4
Binary property Unified_Ideograph (new).
For programmatic determination of Ideographic Description Sequences.
UNKNOWN: ICU 2.6
Binary property Uppercase.
Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().
Lu+Other_Uppercase
UNKNOWN: ICU 2.6
UNKNOWN: ICU 2.4
UNKNOWN: ICU 3.0 This API might change or be removed in a future release.
Binary property White_Space.
Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace().
Space characters+TAB+CR+LF-ZWSP-ZWNBSPUNKNOWN: ICU 2.6
UNKNOWN: ICU 3.4 This API might change or be removed in a future release.
Binary property XID_Continue.
ID_Continue modified to allow closure under normalization forms NFKC and NFKD.
UNKNOWN: ICU 2.6
Binary property XID_Start.
ID_Start modified to allow closure under normalization forms NFKC and NFKD.
UNKNOWN: ICU 2.6