Tagalog | |
---|---|
Range | U+1700..U+171F (32 code points) |
Plane | BMP |
Scripts | Tagalog |
Major alphabets | Baybayin |
Assigned | 23 code points |
Unused | 9 reserved code points |
Unicode version history | |
3.2 (2002) | 20 (+20) |
14.0 (2021) | 23 (+3) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before and during Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has been a part of the Unicode Standard since version 3.2 in April 2002. Tagalog characters can be found in the Noto Sans Tagalog font, among others. The Tagalog Baybayin script was originally proposed for inclusion in Unicode alongside its descendant Hanunoo, Buhid and Tagbanwa scripts as a single block called "Philippine Scripts" and two punctuation marks are only part of the Hanunoo block. In 2021, with version 14.0, the Unicode Standard was updated to add three new characters: the "ra" and archaic "ra", and the pamudpod.
Tagalog [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+170x | ||||||||||||||||
U+171x | ||||||||||||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Tagalog block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
3.2 | U+1700..170C, 170E..1714 | 20 | L2/98-217 | N1755 (pdf, Attach) | Everson, Michael (1998-05-25), Proposal for encoding the Philippine scripts in the BMP of ISO/IEC 10646 |
L2/98-397 | Everson, Michael (1998-11-23), Revised proposal for encoding the Philippine scripts in the UCS | ||||
L2/99-014 | N1933 | Everson, Michael (1998-11-23), Revised proposal for encoding the Philippine scripts in the UCS | |||
L2/98-419 (pdf, doc) | Aliprand, Joan (1999-02-05), "Philippine Scripts", Approved Minutes -- UTC #78 & NCITS Subgroup L2 # 175 Joint Meeting, San Jose, CA -- December 1-4, 1998, [#78-M8] Motion:To accept document L2/98-397, Revised proposal for encoding Philippine scripts, for addition to the Unicode Standard after Version 3.0. | ||||
L2/99-232 | N2003 | Umamaheswaran, V. S. (1999-08-03), "9.4.1", Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15 | |||
L2/00-097 | N2194 | Sato, T. K. (2000-02-22), Philippino characters (status report) | |||
L2/00-357 | Everson, Michael (2000-10-16), Philippine Scripts (draft block description) | ||||
L2/01-050 | N2253 | Umamaheswaran, V. S. (2001-01-21), "7.14 Philippine scripts", Minutes of the SC2/WG2 meeting in Athens, September 2000 | |||
14.0 | U+170D, 171F | 2 | L2/19-258R | Brennan, Fredrick R. (2019-07-18), The baybayin "ra", its origins and a plea for its formal recognition | |
L2/19-286 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-07-22), "12. Tagalog", Recommendations to UTC #160 July 2019 on Script Proposals | ||||
L2/19-270 | Moore, Lisa (2019-10-07), "Consensus 160-C24", UTC #160 Minutes | ||||
U+1715 | 1 | L2/20-257 | Brennan, Fredrick R. (2020-09-23), "18 Tagalog and Hanunoo", Please reclassify the Philippine pamudpod | ||
L2/20-250 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "14. Hanunoo / Tagalog", Recommendations to UTC #165 October 2020 on Script Proposals | ||||
L2/20-272 | Brennan, Fredrick R. (2020-10-03), Amended proposal to encode the Tagalog pamudpod | ||||
L2/20-237 | Moore, Lisa (2020-10-27), "Consensus 165-C18", UTC #165 Minutes | ||||
L2/21-117 | Pournader, Roozbeh (2021-05-20), Pamudpod properties (Tagalog and Hanunoo) | ||||
L2/21-130 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Liang, Hai (2021-07-26), "18 Tagalog and Hanunoo", Recommendations to UTC #168 July 2021 on Script Proposals | ||||
L2/21-123 | Cummings, Craig (2021-08-03), "Consensus 168-C29", Draft Minutes of UTC Meeting 168 | ||||
|
Baybayin is a Philippine script. The script is an abugida belonging to the family of the Brahmic scripts. Geographically, it was widely used in Luzon and other parts of the Philippines prior to and during the 16th and 17th centuries before being replaced by the Latin alphabet during the period of Spanish colonization. It was used in the Tagalog language and, to a lesser extent, Kapampangan-speaking areas; its use spread to the Ilocanos in the early 17th century. In the 19th and 20th centuries, baybayin survived and evolved into multiple forms—the Tagbanwa script of Palawan, and the Hanuno'o and Buhid scripts of Mindoro—and was used to create the constructed modern Kulitan script of the Kapampangan and the Ibalnan script of the Palawan people. Under the Unicode Standard and ISO 15924, the script is encoded as the Tagalog block.
Michael Everson is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over one hundred books since 2006.
Surat Buhid is an abugida used to write the Buhid language. As a Brahmic script indigenous to the Philippines, it closely related to Baybayin and Hanunó'o. It is still used today by the Mangyans, found mainly on island of Mindoro, to write their language, Buhid, together with the Filipino latin script.
Tagbanwa is one of the scripts indigenous to the Philippines, used by the Tagbanwa and the Palawan people as their ethnic writing system.
The Kawi, Indonesian: aksara kawi, aksara carakan kuna) or Old Javanese script is a Brahmic script found primarily in Java and used across much of Maritime Southeast Asia between the 8th century and the 16th century. The script is an abugida, meaning that characters are read with an inherent vowel. Diacritics are used, either to suppress the vowel and represent a pure consonant, or to represent other vowels.
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
Tagalog may refer to:
Hanunoo, also rendered Hanunó'o, is one of the scripts indigenous to the Philippines and is used by the Mangyan peoples of southern Mindoro to write the Hanunó'o language.
Kulitan, also known as súlat Kapampángan and pamagkulit, is one of the various indigenous suyat writing systems in the Philippines. It was used for writing Kapampangan, a language mainly spoken in Central Luzon, until it was gradually replaced by the Latin alphabet.
Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.
Hanunoo is a Unicode block containing characters used for writing the Hanunó'o language. It also contains the two punctuation marks which are unified characters for all the Philippine scripts.
The Takri block U+11680–U+116CF was added to the Unicode Standard in January 2012 with the release of version 6.1.
Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.
Tagalog alphabet may refer to:
Adlam is a Unicode block containing characters from the Adlam script, an alphabetic script devised during the late 1980s for writing the Fula language in Guinea, Nigeria, Liberia, and other nearby countries.
Suyat is the modern collective name of the indigenous scripts of various ethnolinguistic groups in the Philippines prior to Spanish colonization in the 16th century up to the independence era in the 21st century. The scripts are highly varied; nonetheless, the term was suggested and used by cultural organizations in the Philippines to denote a unified neutral terminology for Philippine indigenous scripts.
Dogra is a Unicode block for the Dogri script, for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent. The Takri script version of Jammu is known as Dogra Akkhar.