public static final class java.lang
Character.UnicodeBlock


Show All Login
Java SE 6
  
Extends: Character.Subset
Contained within: Character
Details
A family of character subsets representing the character blocks in the Unicode specification. Character blocks generally define characters used for a specific script or purpose. A character is contained by at most one Unicode block.
since1.2

Fields
final public static Character.UnicodeBlock BASIC_LATIN Details
Constant for the "Basic Latin" Unicode character block.
since1.2
final public static Character.UnicodeBlock LATIN_1_SUPPLEMENT Details
Constant for the "Latin-1 Supplement" Unicode character block.
since1.2
final public static Character.UnicodeBlock LATIN_EXTENDED_A Details
Constant for the "Latin Extended-A" Unicode character block.
since1.2
final public static Character.UnicodeBlock LATIN_EXTENDED_B Details
Constant for the "Latin Extended-B" Unicode character block.
since1.2
final public static Character.UnicodeBlock IPA_EXTENSIONS Details
Constant for the "IPA Extensions" Unicode character block.
since1.2
final public static Character.UnicodeBlock SPACING_MODIFIER_LETTERS Details
Constant for the "Spacing Modifier Letters" Unicode character block.
since1.2
final public static Character.UnicodeBlock COMBINING_DIACRITICAL_MARKS Details
Constant for the "Combining Diacritical Marks" Unicode character block.
since1.2
final public static Character.UnicodeBlock GREEK Details
Constant for the "Greek and Coptic" Unicode character block.

This block was previously known as the "Greek" block.
since1.2

final public static Character.UnicodeBlock CYRILLIC Details
Constant for the "Cyrillic" Unicode character block.
since1.2
final public static Character.UnicodeBlock ARMENIAN Details
Constant for the "Armenian" Unicode character block.
since1.2
final public static Character.UnicodeBlock HEBREW Details
Constant for the "Hebrew" Unicode character block.
since1.2
final public static Character.UnicodeBlock ARABIC Details
Constant for the "Arabic" Unicode character block.
since1.2
final public static Character.UnicodeBlock DEVANAGARI Details
Constant for the "Devanagari" Unicode character block.
since1.2
final public static Character.UnicodeBlock BENGALI Details
Constant for the "Bengali" Unicode character block.
since1.2
final public static Character.UnicodeBlock GURMUKHI Details
Constant for the "Gurmukhi" Unicode character block.
since1.2
final public static Character.UnicodeBlock GUJARATI Details
Constant for the "Gujarati" Unicode character block.
since1.2
final public static Character.UnicodeBlock ORIYA Details
Constant for the "Oriya" Unicode character block.
since1.2
final public static Character.UnicodeBlock TAMIL Details
Constant for the "Tamil" Unicode character block.
since1.2
final public static Character.UnicodeBlock TELUGU Details
Constant for the "Telugu" Unicode character block.
since1.2
final public static Character.UnicodeBlock KANNADA Details
Constant for the "Kannada" Unicode character block.
since1.2
final public static Character.UnicodeBlock MALAYALAM Details
Constant for the "Malayalam" Unicode character block.
since1.2
final public static Character.UnicodeBlock THAI Details
Constant for the "Thai" Unicode character block.
since1.2
final public static Character.UnicodeBlock LAO Details
Constant for the "Lao" Unicode character block.
since1.2
final public static Character.UnicodeBlock TIBETAN Details
Constant for the "Tibetan" Unicode character block.
since1.2
final public static Character.UnicodeBlock GEORGIAN Details
Constant for the "Georgian" Unicode character block.
since1.2
final public static Character.UnicodeBlock HANGUL_JAMO Details
Constant for the "Hangul Jamo" Unicode character block.
since1.2
final public static Character.UnicodeBlock LATIN_EXTENDED_ADDITIONAL Details
Constant for the "Latin Extended Additional" Unicode character block.
since1.2
final public static Character.UnicodeBlock GREEK_EXTENDED Details
Constant for the "Greek Extended" Unicode character block.
since1.2
final public static Character.UnicodeBlock GENERAL_PUNCTUATION Details
Constant for the "General Punctuation" Unicode character block.
since1.2
final public static Character.UnicodeBlock SUPERSCRIPTS_AND_SUBSCRIPTS Details
Constant for the "Superscripts and Subscripts" Unicode character block.
since1.2
final public static Character.UnicodeBlock CURRENCY_SYMBOLS Details
Constant for the "Currency Symbols" Unicode character block.
since1.2
final public static Character.UnicodeBlock COMBINING_MARKS_FOR_SYMBOLS Details
Constant for the "Combining Diacritical Marks for Symbols" Unicode character block.

This block was previously known as "Combining Marks for Symbols".
since1.2

final public static Character.UnicodeBlock LETTERLIKE_SYMBOLS Details
Constant for the "Letterlike Symbols" Unicode character block.
since1.2
final public static Character.UnicodeBlock NUMBER_FORMS Details
Constant for the "Number Forms" Unicode character block.
since1.2
final public static Character.UnicodeBlock ARROWS Details
Constant for the "Arrows" Unicode character block.
since1.2
final public static Character.UnicodeBlock MATHEMATICAL_OPERATORS Details
Constant for the "Mathematical Operators" Unicode character block.
since1.2
final public static Character.UnicodeBlock MISCELLANEOUS_TECHNICAL Details
Constant for the "Miscellaneous Technical" Unicode character block.
since1.2
final public static Character.UnicodeBlock CONTROL_PICTURES Details
Constant for the "Control Pictures" Unicode character block.
since1.2
final public static Character.UnicodeBlock OPTICAL_CHARACTER_RECOGNITION Details
Constant for the "Optical Character Recognition" Unicode character block.
since1.2
final public static Character.UnicodeBlock ENCLOSED_ALPHANUMERICS Details
Constant for the "Enclosed Alphanumerics" Unicode character block.
since1.2
final public static Character.UnicodeBlock BOX_DRAWING Details
Constant for the "Box Drawing" Unicode character block.
since1.2
final public static Character.UnicodeBlock BLOCK_ELEMENTS Details
Constant for the "Block Elements" Unicode character block.
since1.2
final public static Character.UnicodeBlock GEOMETRIC_SHAPES Details
Constant for the "Geometric Shapes" Unicode character block.
since1.2
final public static Character.UnicodeBlock MISCELLANEOUS_SYMBOLS Details
Constant for the "Miscellaneous Symbols" Unicode character block.
since1.2
final public static Character.UnicodeBlock DINGBATS Details
Constant for the "Dingbats" Unicode character block.
since1.2
final public static Character.UnicodeBlock CJK_SYMBOLS_AND_PUNCTUATION Details
Constant for the "CJK Symbols and Punctuation" Unicode character block.
since1.2
final public static Character.UnicodeBlock HIRAGANA Details
Constant for the "Hiragana" Unicode character block.
since1.2
final public static Character.UnicodeBlock KATAKANA Details
Constant for the "Katakana" Unicode character block.
since1.2
final public static Character.UnicodeBlock BOPOMOFO Details
Constant for the "Bopomofo" Unicode character block.
since1.2
final public static Character.UnicodeBlock HANGUL_COMPATIBILITY_JAMO Details
Constant for the "Hangul Compatibility Jamo" Unicode character block.
since1.2
final public static Character.UnicodeBlock KANBUN Details
Constant for the "Kanbun" Unicode character block.
since1.2
final public static Character.UnicodeBlock ENCLOSED_CJK_LETTERS_AND_MONTHS Details
Constant for the "Enclosed CJK Letters and Months" Unicode character block.
since1.2
final public static Character.UnicodeBlock CJK_COMPATIBILITY Details
Constant for the "CJK Compatibility" Unicode character block.
since1.2
final public static Character.UnicodeBlock CJK_UNIFIED_IDEOGRAPHS Details
Constant for the "CJK Unified Ideographs" Unicode character block.
since1.2
final public static Character.UnicodeBlock HANGUL_SYLLABLES Details
Constant for the "Hangul Syllables" Unicode character block.
since1.2
final public static Character.UnicodeBlock PRIVATE_USE_AREA Details
Constant for the "Private Use Area" Unicode character block.
since1.2
final public static Character.UnicodeBlock CJK_COMPATIBILITY_IDEOGRAPHS Details
Constant for the "CJK Compatibility Ideographs" Unicode character block.
since1.2
final public static Character.UnicodeBlock ALPHABETIC_PRESENTATION_FORMS Details
Constant for the "Alphabetic Presentation Forms" Unicode character block.
since1.2
final public static Character.UnicodeBlock ARABIC_PRESENTATION_FORMS_A Details
Constant for the "Arabic Presentation Forms-A" Unicode character block.
since1.2
final public static Character.UnicodeBlock COMBINING_HALF_MARKS Details
Constant for the "Combining Half Marks" Unicode character block.
since1.2
final public static Character.UnicodeBlock CJK_COMPATIBILITY_FORMS Details
Constant for the "CJK Compatibility Forms" Unicode character block.
since1.2
final public static Character.UnicodeBlock SMALL_FORM_VARIANTS Details
Constant for the "Small Form Variants" Unicode character block.
since1.2
final public static Character.UnicodeBlock ARABIC_PRESENTATION_FORMS_B Details
Constant for the "Arabic Presentation Forms-B" Unicode character block.
since1.2
final public static Character.UnicodeBlock HALFWIDTH_AND_FULLWIDTH_FORMS Details
Constant for the "Halfwidth and Fullwidth Forms" Unicode character block.
since1.2
final public static Character.UnicodeBlock SPECIALS Details
Constant for the "Specials" Unicode character block.
since1.2
final public static Character.UnicodeBlock
SURROGATES_AREA
Details
deprecatedAs of J2SE 5, use #HIGH_SURROGATES, #HIGH_PRIVATE_USE_SURROGATES, and #LOW_SURROGATES. These new constants match the block definitions of the Unicode Standard. The #of(char) and #of(int) methods return the new constants, not SURROGATES_AREA.
final public static Character.UnicodeBlock SYRIAC Details
Constant for the "Syriac" Unicode character block.
since1.4
final public static Character.UnicodeBlock THAANA Details
Constant for the "Thaana" Unicode character block.
since1.4
final public static Character.UnicodeBlock SINHALA Details
Constant for the "Sinhala" Unicode character block.
since1.4
final public static Character.UnicodeBlock MYANMAR Details
Constant for the "Myanmar" Unicode character block.
since1.4
final public static Character.UnicodeBlock ETHIOPIC Details
Constant for the "Ethiopic" Unicode character block.
since1.4
final public static Character.UnicodeBlock CHEROKEE Details
Constant for the "Cherokee" Unicode character block.
since1.4
final public static Character.UnicodeBlock UNIFIED_CANADIAN_ABORIGINAL_SYLLABICS Details
Constant for the "Unified Canadian Aboriginal Syllabics" Unicode character block.
since1.4
final public static Character.UnicodeBlock OGHAM Details
Constant for the "Ogham" Unicode character block.
since1.4
final public static Character.UnicodeBlock RUNIC Details
Constant for the "Runic" Unicode character block.
since1.4
final public static Character.UnicodeBlock KHMER Details
Constant for the "Khmer" Unicode character block.
since1.4
final public static Character.UnicodeBlock MONGOLIAN Details
Constant for the "Mongolian" Unicode character block.
since1.4
final public static Character.UnicodeBlock BRAILLE_PATTERNS Details
Constant for the "Braille Patterns" Unicode character block.
since1.4
final public static Character.UnicodeBlock CJK_RADICALS_SUPPLEMENT Details
Constant for the "CJK Radicals Supplement" Unicode character block.
since1.4
final public static Character.UnicodeBlock KANGXI_RADICALS Details
Constant for the "Kangxi Radicals" Unicode character block.
since1.4
final public static Character.UnicodeBlock IDEOGRAPHIC_DESCRIPTION_CHARACTERS Details
Constant for the "Ideographic Description Characters" Unicode character block.
since1.4
final public static Character.UnicodeBlock BOPOMOFO_EXTENDED Details
Constant for the "Bopomofo Extended" Unicode character block.
since1.4
final public static Character.UnicodeBlock CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A Details
Constant for the "CJK Unified Ideographs Extension A" Unicode character block.
since1.4
final public static Character.UnicodeBlock YI_SYLLABLES Details
Constant for the "Yi Syllables" Unicode character block.
since1.4
final public static Character.UnicodeBlock YI_RADICALS Details
Constant for the "Yi Radicals" Unicode character block.
since1.4
final public static Character.UnicodeBlock CYRILLIC_SUPPLEMENTARY Details
Constant for the "Cyrillic Supplementary" Unicode character block.
since1.5
final public static Character.UnicodeBlock TAGALOG Details
Constant for the "Tagalog" Unicode character block.
since1.5
final public static Character.UnicodeBlock HANUNOO Details
Constant for the "Hanunoo" Unicode character block.
since1.5
final public static Character.UnicodeBlock BUHID Details
Constant for the "Buhid" Unicode character block.
since1.5
final public static Character.UnicodeBlock TAGBANWA Details
Constant for the "Tagbanwa" Unicode character block.
since1.5
final public static Character.UnicodeBlock LIMBU Details
Constant for the "Limbu" Unicode character block.
since1.5
final public static Character.UnicodeBlock TAI_LE Details
Constant for the "Tai Le" Unicode character block.
since1.5
final public static Character.UnicodeBlock KHMER_SYMBOLS Details
Constant for the "Khmer Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock PHONETIC_EXTENSIONS Details
Constant for the "Phonetic Extensions" Unicode character block.
since1.5
final public static Character.UnicodeBlock MISCELLANEOUS_MATHEMATICAL_SYMBOLS_A Details
Constant for the "Miscellaneous Mathematical Symbols-A" Unicode character block.
since1.5
final public static Character.UnicodeBlock SUPPLEMENTAL_ARROWS_A Details
Constant for the "Supplemental Arrows-A" Unicode character block.
since1.5
final public static Character.UnicodeBlock SUPPLEMENTAL_ARROWS_B Details
Constant for the "Supplemental Arrows-B" Unicode character block.
since1.5
final public static Character.UnicodeBlock MISCELLANEOUS_MATHEMATICAL_SYMBOLS_B Details
Constant for the "Miscellaneous Mathematical Symbols-B" Unicode character block.
since1.5
final public static Character.UnicodeBlock SUPPLEMENTAL_MATHEMATICAL_OPERATORS Details
Constant for the "Supplemental Mathematical Operators" Unicode character block.
since1.5
final public static Character.UnicodeBlock MISCELLANEOUS_SYMBOLS_AND_ARROWS Details
Constant for the "Miscellaneous Symbols and Arrows" Unicode character block.
since1.5
final public static Character.UnicodeBlock KATAKANA_PHONETIC_EXTENSIONS Details
Constant for the "Katakana Phonetic Extensions" Unicode character block.
since1.5
final public static Character.UnicodeBlock YIJING_HEXAGRAM_SYMBOLS Details
Constant for the "Yijing Hexagram Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock VARIATION_SELECTORS Details
Constant for the "Variation Selectors" Unicode character block.
since1.5
final public static Character.UnicodeBlock LINEAR_B_SYLLABARY Details
Constant for the "Linear B Syllabary" Unicode character block.
since1.5
final public static Character.UnicodeBlock LINEAR_B_IDEOGRAMS Details
Constant for the "Linear B Ideograms" Unicode character block.
since1.5
final public static Character.UnicodeBlock AEGEAN_NUMBERS Details
Constant for the "Aegean Numbers" Unicode character block.
since1.5
final public static Character.UnicodeBlock OLD_ITALIC Details
Constant for the "Old Italic" Unicode character block.
since1.5
final public static Character.UnicodeBlock GOTHIC Details
Constant for the "Gothic" Unicode character block.
since1.5
final public static Character.UnicodeBlock UGARITIC Details
Constant for the "Ugaritic" Unicode character block.
since1.5
final public static Character.UnicodeBlock DESERET Details
Constant for the "Deseret" Unicode character block.
since1.5
final public static Character.UnicodeBlock SHAVIAN Details
Constant for the "Shavian" Unicode character block.
since1.5
final public static Character.UnicodeBlock OSMANYA Details
Constant for the "Osmanya" Unicode character block.
since1.5
final public static Character.UnicodeBlock CYPRIOT_SYLLABARY Details
Constant for the "Cypriot Syllabary" Unicode character block.
since1.5
final public static Character.UnicodeBlock BYZANTINE_MUSICAL_SYMBOLS Details
Constant for the "Byzantine Musical Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock MUSICAL_SYMBOLS Details
Constant for the "Musical Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock TAI_XUAN_JING_SYMBOLS Details
Constant for the "Tai Xuan Jing Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock MATHEMATICAL_ALPHANUMERIC_SYMBOLS Details
Constant for the "Mathematical Alphanumeric Symbols" Unicode character block.
since1.5
final public static Character.UnicodeBlock CJK_UNIFIED_IDEOGRAPHS_EXTENSION_B Details
Constant for the "CJK Unified Ideographs Extension B" Unicode character block.
since1.5
final public static Character.UnicodeBlock CJK_COMPATIBILITY_IDEOGRAPHS_SUPPLEMENT Details
Constant for the "CJK Compatibility Ideographs Supplement" Unicode character block.
since1.5
final public static Character.UnicodeBlock TAGS Details
Constant for the "Tags" Unicode character block.
since1.5
final public static Character.UnicodeBlock VARIATION_SELECTORS_SUPPLEMENT Details
Constant for the "Variation Selectors Supplement" Unicode character block.
since1.5
final public static Character.UnicodeBlock SUPPLEMENTARY_PRIVATE_USE_AREA_A Details
Constant for the "Supplementary Private Use Area-A" Unicode character block.
since1.5
final public static Character.UnicodeBlock SUPPLEMENTARY_PRIVATE_USE_AREA_B Details
Constant for the "Supplementary Private Use Area-B" Unicode character block.
since1.5
final public static Character.UnicodeBlock HIGH_SURROGATES Details
Constant for the "High Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xD800 through 0xDB7F
since1.5
final public static Character.UnicodeBlock HIGH_PRIVATE_USE_SURROGATES Details
Constant for the "High Private Use Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xDB80 through 0xDBFF
since1.5
final public static Character.UnicodeBlock LOW_SURROGATES Details
Constant for the "Low Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xDC00 through 0xDFFF
since1.5

Methods
final public static Character.UnicodeBlock forName(String blockName) Details
Returns the UnicodeBlock with the given name. Block names are determined by The Unicode Standard. The file Blocks-<version>.txt defines blocks for a particular version of the standard. The Character class specifies the version of the standard that it supports.

This method accepts block names in the following forms:

  1. Canonical block names as defined by the Unicode Standard. For example, the standard defines a "Basic Latin" block. Therefore, this method accepts "Basic Latin" as a valid block name. The documentation of each UnicodeBlock provides the canonical name.
  2. Canonical block names with all spaces removed. For example, "BasicLatin" is a valid block name for the "Basic Latin" block.
  3. The text representation of each constant UnicodeBlock identifier. For example, this method will return the #BASIC_LATIN block if provided with the "BASIC_LATIN" name. This form replaces all spaces and hyphens in the canonical name with underscores.
Finally, character case is ignored for all of the valid block name forms. For example, "BASIC_LATIN" and "basic_latin" are both valid block names. The en_US locale's case mapping rules are used to provide case-insensitive string comparisons for block name validation.

If the Unicode Standard changes block names, both the previous and current names will be accepted.
blockNameA UnicodeBlock name.
returnThe UnicodeBlock instance identified by blockName
ThrowsIllegalArgumentException: if blockName is an invalid name
ThrowsNullPointerException: if blockName is null
since1.5

public static Character.UnicodeBlock of(char c) Details
Returns the object representing the Unicode block containing the given character, or null if the character is not a member of a defined block.

Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #of(int) method.
cThe character in question
returnThe UnicodeBlock instance representing the Unicode block of which this character is a member, or null if the character is not a member of any Unicode block

public static Character.UnicodeBlock of(int codePoint) Details
Returns the object representing the Unicode block containing the given character (Unicode code point), or null if the character is not a member of a defined block.
codePointthe character (Unicode code point) in question.
returnThe UnicodeBlock instance representing the Unicode block of which this character is a member, or null if the character is not a member of any Unicode block
ThrowsIllegalArgumentException: if the specified codePoint is an invalid Unicode code point.
since1.5
See also isValidCodePoint(int)