Makasar (Unicode block)

Makasar
Makasar
Range	U+11EE0..U+11EFF; (32 code points)
Plane	SMP
Scripts	Makasar
Assigned	25 code points
Unused	7 reserved code points
Unicode version history
11.0 (2018)	25 (+25)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated July 27, 2024

Makasar is a Unicode block containing characters for Makasar script (also known as "Old Makassarese" or "Makassarese bird script" in English-language scholarly works).^[3] The script was used historically in South Sulawesi, Indonesia for writing the Makassarese language.^[4]

Block

Makasar ^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+11EEx	𑻠	𑻡	𑻢	𑻣	𑻤	𑻥	𑻦	𑻧	𑻨	𑻩	𑻪	𑻫	𑻬	𑻭	𑻮	𑻯
U+11EFx	𑻰	𑻱	𑻲	𑻳	𑻴	𑻵	𑻶	𑻷	𑻸
Notes 1. ^ As of Unicode version 15.1 2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Makasar block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	Document
11.0	U+11EE0..11EF8	25	L2/15-100	Pandey, Anshuman (2015-06-24), Preliminary Proposal to Encode the Makassarese Bird Script
			L2/15-179	Pandey, Anshuman (2015-07-18), Proposal to Encode the Old Makassarese Script
			L2/15-312	Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2015-11-01), "4. Makasar Script", Recommendations to UTC #145 November 2015 on Script Proposals
			L2/15-233	Pandey, Anshuman (2015-11-02), Proposal to encode the Makasar script
			L2/15-254	Moore, Lisa (2015-11-16), "D.4", UTC #145 Minutes
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

Makassarese, sometimes called Makasar, Makassar, or Macassar, is a language of the Makassarese people, spoken in South Sulawesi province of Indonesia. It is a member of the South Sulawesi group of the Austronesian language family, and thus closely related to, among others, Buginese, also known as Bugis. The areas where Makassarese is spoken include the Gowa, Sinjai, Maros, Takalar, Jeneponto, Bantaeng, Pangkajene and Islands, Bulukumba, and Selayar Islands Regencies, and Makassar. Within the Austronesian language family, Makassarese is part of the South Sulawesi language group, although its vocabulary is considered divergent compared to its closest relatives. In 2000, Makassarese had approximately 2.1 million native speakers.

Macassar, Makassar or Makasar may refer to:

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

The Lontara script, also known as the Bugis script, Bugis-Makassar script, or Urupu Sulapa’ Eppa’ "four-cornered letters", is one of Indonesia's traditional scripts developed in the South Sulawesi and West Sulawesi region. The script is primarily used to write the Buginese language, followed by Makassarese and Mandar. Closely related variants of Lontara are also used to write several languages outside of Sulawesi such as Bima, Ende, and Sumbawa. The script was actively used by several South Sulawesi societies for day-to-day and literary texts from at least mid-15th Century CE until the mid-20th Century CE, before its function was gradually supplanted by the Latin alphabet. Today the script is taught in South Sulawesi Province as part of the local curriculum, but with very limited usage in everyday life.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.

The Unicode Standard assigns various properties to each Unicode character and code point.

Gurmukhi is a Unicode block containing characters for the Punjabi language, in the Gurmukhi script. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gujarati is a Unicode block containing characters for writing the Gujarati language. In its original incarnation, the code points U+0A81..U+0AD0 were a direct copy of the Gujarati characters A1-F0 from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Oriya is a Unicode block containing characters for the Odia, Khondi and Santali languages of the state of Odisha in India. In its original incarnation, the code points U+0B01..U+0B4D were a direct copy of the Odia characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Balinese is a Unicode block containing characters of Balinese script for the Balinese language. Balinese language is mainly spoken on the island of Bali, Indonesia.

Sundanese is a Unicode block containing modern characters for writing the Sundanese script of the Sundanese language of the island of Java, Indonesia.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

<span class="mw-page-title-main">Dogra (Unicode block)</span> Unicode character block

Dogra is a Unicode block for the Dogri script, for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent. The Takri script version of Jammu is known as Dogra Akkhar.

Gunjala Gondi is a Unicode block containing characters of Gunjala Gondi script used for writing the Adilabad dialect of the Gondi language.

Hanifi Rohingya is a Unicode block containing characters for Hanifi Rohingya script used for writing the Rohingya language in Myanmar and Bangladesh.

Sogdian is a Unicode block containing characters used to write the Sogdian language from the 7th to 14th centuries CE.

The Makasar script, also known as Ukiri' Jangang-jangang or Old Makasar script, is a historical Indonesian writing system that was used in South Sulawesi to write the Makassarese language between the 17th and 19th centuries until it was supplanted by the Lontara Bugis script.

Kawi is a Unicode block containing characters for Kawi script. The script was used historically in insular Southeast Asia to write the Old Javanese, Sanskrit, Old Malay, Old Balinese, and Old Sundanese languages.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
↑ Pandey, Anshuman (2015-11-02). "L2/15-233: Proposal to encode the Makasar script in Unicode" (PDF).
↑ "Chapter 17: Indonesia and Oceania". The Unicode Standard, Version 11.0 (PDF). Mountain View, CA: Unicode, Inc. June 2018. ISBN 978-1-936213-19-1.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-5] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[3] Pandey, Anshuman (2015-11-02). "L2/15-233: Proposal to encode the Makasar script in Unicode" (PDF).

[4] "Chapter 17: Indonesia and Oceania". The Unicode Standard, Version 11.0 (PDF). Mountain View, CA: Unicode, Inc. June 2018. ISBN 978-1-936213-19-1.

[1]

[2]

[3]

[4]

[lower-alpha 1]

Makasar
Range	U+11EE0..U+11EFF (32 code points)
Plane	SMP
Scripts	Makasar
Assigned	25 code points
Unused	7 reserved code points
Unicode version history

11.0 (2018)	25 (+25)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]

Makasar (Unicode block)

Contents

Block

History

Related Research Articles

References