Kawi | |
---|---|
Range | U+11F00..U+11F5F (96 code points) |
Plane | SMP |
Scripts | Kawi |
Assigned | 86 code points |
Unused | 10 reserved code points |
Unicode version history | |
15.0 (2022) | 86 (+86) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Kawi is a Unicode block containing characters for Kawi script. The script was used historically in insular Southeast Asia to write the Old Javanese, Sanskrit, Old Malay, Old Balinese, and Old Sundanese languages. [3]
Kawi [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+11F0x | 𑼀 | 𑼁 | 𑼂 | 𑼃 | 𑼄 | 𑼅 | 𑼆 | 𑼇 | 𑼈 | 𑼉 | 𑼊 | 𑼋 | 𑼌 | 𑼍 | 𑼎 | 𑼏 |
U+11F1x | 𑼐 | 𑼒 | 𑼓 | 𑼔 | 𑼕 | 𑼖 | 𑼗 | 𑼘 | 𑼙 | 𑼚 | 𑼛 | 𑼜 | 𑼝 | 𑼞 | 𑼟 | |
U+11F2x | 𑼠 | 𑼡 | 𑼢 | 𑼣 | 𑼤 | 𑼥 | 𑼦 | 𑼧 | 𑼨 | 𑼩 | 𑼪 | 𑼫 | 𑼬 | 𑼭 | 𑼮 | 𑼯 |
U+11F3x | 𑼰 | 𑼱 | 𑼲 | 𑼳 | 𑼴 | 𑼵 | 𑼶 | 𑼷 | 𑼸 | 𑼹 | 𑼺 | 𑼾 | 𑼿 | |||
U+11F4x | 𑽀 | 𑽁 | 𑽂 | 𑽃 | 𑽄 | 𑽅 | 𑽆 | 𑽇 | 𑽈 | 𑽉 | 𑽊 | 𑽋 | 𑽌 | 𑽍 | 𑽎 | 𑽏 |
U+11F5x | 𑽐 | 𑽑 | 𑽒 | 𑽓 | 𑽔 | 𑽕 | 𑽖 | 𑽗 | 𑽘 | 𑽙 | ||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Kawi block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
15.0 | U+11F00..11F10, 11F12..11F3A, 11F3E..11F59 | 86 | L2/12-125 | N4266 | Pandey, Anshuman (2012-04-25), Preliminary Proposal to Encode the Kawi Script |
L2/12-147 | Anderson, Deborah; McGowan, Rick; Whistler, Ken (2012-04-25), "XI. KAWI", Review of Indic-related L2 documents and Recommendations to the UTC | ||||
L2/12-167 | Ganesan, Naga (2012-05-03), Comment on L2/12-125: Encoding Model for Kawi Script | ||||
L2/13-028 | Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-01-28), "22", Recommendations to UTC on Script Proposals | ||||
L2/20-256 | Perdana, Aditya Bayu; Nurwansah, Ilham (2020-09-30), Preliminary Proposal to encode Kawi | ||||
L2/20-250 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "15. Kawi", Recommendations to UTC #165 October 2020 on Script Proposals | ||||
L2/21-026 | Sinclair, Iain (2020-11-17), Comments on Kawi, L2/20-256 | ||||
L2/20-283 | Lindenberg, Norbert (2020-12-06), Repha representation for Kawi | ||||
L2/20-287 | Syarifuddin, M. Mahali (2020-12-08), Comments on L2/20-284 regarding disunificaton of KAWI VOWEL SIGN AA | ||||
L2/20-284R | Perdana, Aditya Bayu; Nurwansah, Ilham (2020-12-22), Proposal to encode Kawi (revised) | ||||
L2/21-016R | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-01-14), "18. Kawi", Recommendations to UTC #166 January 2021 on Script Proposals | ||||
L2/21-009 | Moore, Lisa (2021-01-27), "Consensus 166-C30", UTC #166 Minutes | ||||
L2/21-048 | Perdana, Aditya Bayu; Lindenberg, Norbert (2021-02-16), Reply to 'On the Kawi Space Filler and the Names of Punctuation Characters' | ||||
L2/21-073 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-04-23), "Kawi", Recommendations to UTC #167 April 2021 on Script Proposals | ||||
L2/21-176 | Syarifuddin, M. Mahali (2021-07-30), Proposal to Unencode KAWI VOWEL SIGN VOCALIC L | ||||
L2/21-180 | Perdana, Aditya Bayu; Nurwansah, Ilham (2021-09-23), Note on Kawi Sign Vocalic L U+11F3C and Possible Cognate of U+1B00 | ||||
L2/21-174 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Liang, Hai (2021-10-01), "12. Kawi", Recommendations to UTC #169 October 2021 on Script Proposals | ||||
L2/21-167 | Cummings, Craig (2022-01-27), "Consensus 169-C15", Approved Minutes of UTC Meeting 169 | ||||
L2/22-056 | Silva, Eduardo Marín (2022-03-05), "Kawi", Review of the code charts of Unicode 15 Alpha (PRI 442) | ||||
L2/22-093 | Kinaṇṭi, Waṣkiṭa; Sh., Rikza F. (2022-03-21), Review of Kawi code chart of Unicode 15 Alpha (PRI 442) | ||||
L2/22-068 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-04-15), "19. Kawi Name Change and Annotation, 20. Kawi Review of Code Chart for 15.0", Recommendations to UTC #171 April 2022 on Script Proposals | ||||
L2/22-080R2 | Lindenberg, Norbert; et al. (2022-08-22), Line breaking at orthographic syllable boundaries | ||||
L2/22-061 | Constable, Peter (2022-07-27), "D.1 Section 18, D.1 Section 19, D.1 Section 20, D.2", Approved Minutes of UTC Meeting 171 | ||||
|
Devanagari is an Indic script used in the northern Indian subcontinent. Also simply called Nāgari, it is a left-to-right abugida, based on the ancient Brāhmi script. It is one of the official scripts of the Republic of India and Nepal. It was developed and in regular use by the 7th century CE and achieved its modern form by 1000 CE. The Devanāgari script, composed of 48 primary characters, including 14 vowels and 34 consonants, is the fourth most widely adopted writing system in the world, being used for over 120 languages.
Unicode, formally The Unicode Standard, is a text encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 15.1 of the standard defines 149813 characters and 161 scripts used in various ordinary, literary, academic, and technical contexts.
The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used by various languages in several language families in South, East and Southeast Asia: Indo-Aryan, Dravidian, Tibeto-Burman, Mongolic, Austroasiatic, Austronesian, and Tai. They were also the source of the dictionary order (gojūon) of Japanese kana.
Baybayin is a Philippine script. The script is an abugida belonging to the family of the Brahmic scripts. Geographically, it was widely used in Luzon and other parts of the Philippines prior to and during the 16th and 17th centuries before being replaced by the Latin alphabet during the period of Spanish colonization. It was used in the Tagalog language and, to a lesser extent, Kapampangan-speaking areas; its use spread to the Ilocanos in the early 17th century. In the 19th and 20th centuries, baybayin survived and evolved into multiple forms—the Tagbanwa script of Palawan, and the Hanuno'o and Buhid scripts of Mindoro—and was used to create the constructed modern Kulitan script of the Kapampangan and the Ibalnan script of the Palawan people. Under the Unicode Standard and ISO 15924, the script is encoded as the Tagalog block.
The Batak script is a writing system used to write the Austronesian Batak languages spoken by several million people on the Indonesian island of Sumatra. The script may be derived from the Kawi and Pallava script, ultimately derived from the Brahmi script of India, or from the hypothetical Proto-Sumatran script influenced by Pallava.
Surat Buhid is an abugida used to write the Buhid language. As a Brahmic script indigenous to the Philippines, it closely related to Baybayin and Hanunó'o. It is still used today by the Mangyans, found mainly on island of Mindoro, to write their language, Buhid, together with the Filipino latin script.
Tagbanwa is one of the scripts indigenous to the Philippines, used by the Tagbanwa and the Palawan people as their ethnic writing system.
The Kawi or Old Javanese script is a Brahmic script found primarily in Java and used across much of Maritime Southeast Asia between the 8th century and the 16th century. The script is an abugida, meaning that characters are read with an inherent vowel. Diacritics are used, either to suppress the vowel and represent a pure consonant, or to represent other vowels.
The Cham script is a Brahmic abugida used to write Cham, an Austronesian language spoken by some 245,000 Chams in Vietnam and Cambodia. It is written horizontally left to right, just like other Brahmic abugidas.
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 15.1, five of the planes have assigned code points (characters), and seven are named.
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.
The Unicode Standard assigns various properties to each Unicode character and code point.
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.
The Mon–Burmese script is an abugida that derives from the Pallava Grantha script of southern India and later of Southeast Asia. It is the basis of the alphabets used for modern Burmese, Mon, Shan, Rakhine, Jingpho and Karen.
Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.
Noto is a font family comprising over 100 individual computer fonts, which are together designed to cover all the scripts encoded in the Unicode standard. As of October 2016, Noto fonts cover all 93 scripts defined in Unicode version 6.1, although fewer than 30,000 of the nearly 75,000 CJK unified ideographs in version 6.0 are covered. In total, Noto fonts cover over 77,000 characters, which is around half of the 149,186 characters defined in Unicode 15.0.
The Mon alphabet is a Brahmic abugida used for writing the Mon language. It is an example of the Mon-Burmese script, which derives from the Pallava Grantha script of southern India.
Hanifi Rohingya is a Unicode block containing characters for Hanifi Rohingya script used for writing the Rohingya language in Myanmar and Bangladesh.
Old Sogdian is a Unicode block containing characters for a group of related, non-cursive Sogdian writing systems used to write historic Sogdian in the 3rd to 5th centuries CE.