Makasar (Unicode block)

Last updated
Makasar
RangeU+11EE0..U+11EFF
(32 code points)
Plane SMP
Scripts Makasar
Assigned25 code points
Unused7 reserved code points
Unicode version history
11.0 (2018)25 (+25)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Makasar is a Unicode block containing characters for Makasar script (also known as "Old Makassarese" or "Makassarese bird script" in English-language scholarly works). [3] The script was used historically in South Sulawesi, Indonesia for writing the Makassarese language. [4]

Contents

Block

Makasar [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+11EEx𑻠𑻡𑻢𑻣𑻤𑻥𑻦𑻧𑻨𑻩𑻪𑻫𑻬𑻭𑻮𑻯
U+11EFx𑻰𑻱𑻲𑻳𑻴𑻵𑻶𑻷𑻸
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Makasar block:

Version Final code points [lower-alpha 1] Count L2  IDDocument
11.0U+11EE0..11EF825 L2/15-100 Pandey, Anshuman (2015-06-24), Preliminary Proposal to Encode the Makassarese Bird Script
L2/15-179 Pandey, Anshuman (2015-07-18), Proposal to Encode the Old Makassarese Script
L2/15-312 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2015-11-01), "4. Makasar Script", Recommendations to UTC #145 November 2015 on Script Proposals
L2/15-233 Pandey, Anshuman (2015-11-02), Proposal to encode the Makasar script
L2/15-254 Moore, Lisa (2015-11-16), "D.4", UTC #145 Minutes
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

<span class="mw-page-title-main">Makassarese language</span> Austronesian language of South Sulawesi, Indonesia

Makassarese, sometimes called Makasar, Makassar, or Macassar, is a language of the Makassarese people, spoken in South Sulawesi province of Indonesia. It is a member of the South Sulawesi group of the Austronesian language family, and thus closely related to, among others, Buginese.

Macassar, Makassar or Makasar may refer to:

The Lontara script, also known as the Bugis script, Bugis-Makassar script, or Urupu Sulapa’ Eppa’ "four-cornered letters", is one of Indonesia's traditional scripts developed in the South Sulawesi and West Sulawesi region. The script is primarily used to write the Buginese language, followed by Makassarese and Mandar. Closely related variants of Lontara are also used to write several languages outside of Sulawesi such as Bima, Ende, and Sumbawa. The script was actively used by several South Sulawesi societies for day-to-day and literary texts from at least mid-15th Century CE until the mid-20th Century CE, before its function was gradually supplanted by the Latin alphabet. Today the script is taught in South Sulawesi Province as part of the local curriculum, but with very limited usage in everyday life.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

Gurmukhi is a Unicode block containing characters for the Punjabi language, in the Gurmukhi script. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gujarati is a Unicode block containing characters for writing the Gujarati language. In its original incarnation, the code points U+0A81..U+0AD0 were a direct copy of the Gujarati characters A1-F0 from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Oriya is a Unicode block containing characters for the Odia, Khondi and Santali languages of the state of Odisha in India. In its original incarnation, the code points U+0B01..U+0B4D were a direct copy of the Odia characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Balinese is a Unicode block containing characters of Balinese script for the Balinese language. Balinese language is mainly spoken on the island of Bali, Indonesia.

Sundanese is a Unicode block containing modern characters for writing the Sundanese script of the Sundanese language of the island of Java, Indonesia.

Modi is a Unicode block containing the Modi alphabet characters for writing the Marathi language.

Ahom is a Unicode block containing characters used for writing the Ahom alphabet, which was used to write the Ahom language spoken by the Ahom people in Assam between the 13th and the 18th centuries.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

<span class="mw-page-title-main">Dogra (Unicode block)</span> Unicode character block

Dogra is a Unicode block for the Dogri script, for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent. The Takri script version of Jammu is known as Dogra Akkhar.

Gunjala Gondi is a Unicode block containing characters of Gunjala Gondi script used for writing the Adilabad dialect of the Gondi language.

Hanifi Rohingya is a Unicode block containing characters for Hanifi Rohingya script used for writing the Rohingya language in Myanmar and Bangladesh.

Sogdian is a Unicode block containing characters used to write the Sogdian language from the 7th to 14th centuries CE.

The Makasar script, also known as Ukiri' Jangang-jangang or Old Makasar script, is a historical Indonesian Writing system that was used in South Sulawesi to write the Makassarese language between the 17th and 19th centuries until it was supplanted by the Lontara Bugis script.

Kawi is a Unicode block containing characters for Kawi script. The script was used historically in insular Southeast Asia to write the Old Javanese, Sanskrit, Old Malay, Old Balinese, and Old Sundanese languages.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Pandey, Anshuman (2015-11-02). "L2/15-233: Proposal to encode the Makasar script in Unicode" (PDF).
  4. "Chapter 17: Indonesia and Oceania". The Unicode Standard, Version 11.0 (PDF). Mountain View, CA: Unicode, Inc. June 2018. ISBN   978-1-936213-19-1.