Balinese (Unicode block)

Last updated
Balinese
RangeU+1B00..U+1B7F
(128 code points)
Plane BMP
Scripts Balinese
Major alphabetsaksara Bali
Assigned127 code points
Unused1 reserved code points
Unicode version history
5.0 (2006)121 (+121)
14.0 (2021)124 (+3)
16.0 (2024)127 (+3)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Balinese is a Unicode block containing characters of Balinese script for the Balinese language. Balinese language is mainly spoken on the island of Bali, Indonesia.

Contents

Block

Balinese [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1B0x
U+1B1x
U+1B2x
U+1B3xᬿ
U+1B4x
U+1B5x
U+1B6x
U+1B7x᭿
Notes
1. ^ As of Unicode version 16.0
2. ^ Grey area indicates non-assigned code point

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Balinese block:

See also

Related Research Articles

<span class="mw-page-title-main">Michael Everson</span> American-Irish type designer (born 1963)

Michael Everson is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over one hundred books since 2006.

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF. It contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Latin Extended Additional is a Unicode block.

Unified Canadian Aboriginal Syllabics is a Unicode block containing syllabic characters for writing Inuktitut, Carrier, Cree, Ojibwe, Blackfoot and Canadian Athabascan languages. Additions for some Cree dialects, Ojibwe, and Dene can be found at the Unified Canadian Aboriginal Syllabics Extended block.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.

Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.

Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Bodo, Maithili, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Lao is a Unicode block containing characters for the languages of Laos. The characters of the Lao block are allocated so as to be equivalent to the similarly positioned characters of the Thai block immediately preceding it.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Katakana Phonetic Extensions is a Unicode block containing additional small katakana characters for writing the Ainu language, in addition to characters in the Katakana block.

Sundanese is a Unicode block containing modern characters for writing the Sundanese script of the Sundanese language of the island of Java, Indonesia.

Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Kawi is a Unicode block containing characters for Kawi script. The script was used historically in insular Southeast Asia to write the Old Javanese, Sanskrit, Old Malay, Old Balinese, and Old Sundanese languages.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.