Wancho (Unicode block)

Last updated
Wancho
RangeU+1E2C0..U+1E2FF
(64 code points)
Plane SMP
Scripts Wancho
Assigned59 code points
Unused5 reserved code points
Unicode version history
12.0 (2019)59 (+59)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Wancho is a Unicode block containing the characters of the script used to write the Wancho language. [3]

Wancho [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1E2Cx𞋀𞋁𞋂𞋃𞋄𞋅𞋆𞋇𞋈𞋉𞋊𞋋𞋌𞋍𞋎𞋏
U+1E2Dx𞋐𞋑𞋒𞋓𞋔𞋕𞋖𞋗𞋘𞋙𞋚𞋛𞋜𞋝𞋞𞋟
U+1E2Ex𞋠𞋡𞋢𞋣𞋤𞋥𞋦𞋧𞋨𞋩𞋪𞋫𞋬𞋭𞋮𞋯
U+1E2Fx𞋰𞋱𞋲𞋳𞋴𞋵𞋶𞋷𞋸𞋹𞋿
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Wancho block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
12.0U+1E2C0..1E2F9, 1E2FF59 L2/17-042 N4785 Everson, Michael (2017-01-23), Preliminary proposal to encode the Wancho script
L2/17-153 Anderson, Deborah (2017-05-17), "9. Wancho", Recommendations to UTC #151 May 2017 on Script Proposals
L2/17-255 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-07-28), "13. Wancho", Recommendations to UTC #152 July-August 2017 on Script Proposals
L2/17-222 Moore, Lisa (2017-08-11), "C.3", UTC #152 Minutes
N4953 (pdf, doc)"9.2.2", Unconfirmed minutes of WG 2 meeting 66, 2018-03-23
L2/17-067R2 N4787R2 Everson, Michael (2017-10-22), Proposal to encode the Wancho script
L2/17-362 Moore, Lisa (2018-02-02), "C.18.2 Corrections to two Wancho character names", UTC #153 Minutes
N4976R5 Evidence for Four Wancho Diacritics, 2018-06-16
L2/18-264 Anderson, Deborah (2018-08-06), Error in three Wancho character names
L2/18-300 Anderson, Deborah; et al. (2018-09-14), "10. Wancho", Recommendations to UTC #157 on Script Proposals
L2/18-272 Moore, Lisa (2018-10-29), "D.4", UTC #157 Minutes
L2/18-183 Moore, Lisa (2018-11-20), "Consensus 156-C11", UTC #156 Minutes
N5020 (pdf, doc)Umamaheswaran, V. S. (2019-01-11), "7.4.1 T4", Unconfirmed minutes of WG 2 meeting 67
L2/20-121 Scheuren, Zachary; Losu, Banwang (2020-02-22), Proposal to change the code chart font for Wancho
L2/20-105 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-04-20), "10. Wancho", Recommendations to UTC #163 April 2020 on Script Proposals
L2/20-102 Moore, Lisa (2020-05-06), "Action Item 163-A55", UTC #163 Minutes, Prepare an erratum for the Wancho font
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorise these characters as being "letterlike".

Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the characters in the Number Forms block, three fractions were inherited from ISO-8859-1, which was incorporated whole as the Latin-1 Supplement block.

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits.

<span class="mw-page-title-main">Wancho language</span> Language

Wancho (वांचो‎) is a Konyak language of north-eastern India. Wancho is spoken in 36 villages of southeastern Longding district, Tirap district, Arunachal Pradesh, as well as in Assam and Nagaland (Ethnologue). Alternate names include Banpara Naga, Joboka, Jokoba.

Mandaic is a Unicode block containing characters of the Mandaic script used for writing the historic Eastern Aramaic, also called Classical Mandaic, and the modern Neo-Mandaic language.

Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.

Georgian Supplement is a Unicode block containing characters for the ecclesiastical form of the Georgian script, Nuskhuri. To write the full ecclesiastical Khutsuri orthography, the Asomtavruli capitals encoded in the Georgian block.

Sinhala is a Unicode block containing characters for the Sinhala and Pali languages of Sri Lanka, and is also used for writing Sanskrit in Sri Lanka. The Sinhala allocation is loosely based on the ISCII standard, except that Sinhala contains extra prenasalized consonant letters, leading to inconsistencies with other ISCII-Unicode script allocations.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Vedic Extensions is a Unicode block containing characters for representing tones and other vedic symbols in Devanagari and other Indic scripts. Related symbols are defined in two other blocks: Devanagari (U+0900–U+097F) and Devanagari Extended (U+A8E0–U+A8FF).

Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has been a part of the Unicode Standard since version 3.2 in April 2002. Tagalog characters can be found in the Noto Sans Tagalog font, among others. The Tagalog Baybayin script was originally proposed for inclusion in Unicode alongside its descendant Hanunoo, Buhid and Tagbanwa scripts as a single block called "Philippine Scripts" and two punctuation marks are only part of the Hanunoo block. In 2021, with version 14.0, the Unicode Standard was updated to add three new characters: the "ra" and archaic "ra", and the pamudpod.

Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language.

Cham is a Unicode block containing characters of the Cham script, which is used for writing the Cham language, primarily used for the Eastern dialect in Cambodia and Vietnam.

Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Ideographic Symbols and Punctuation is a Unicode block containing symbols and punctuation marks used by ideographic scripts such as Tangut and Nüshu.

Wancho script is an alphabet created between 2001 and 2012 by middle school teacher Banwang Losu in Longding district, Arunachal Pradesh for writing the Wancho language. Letters represent consonants and vowels. Conjunct consonants are not used. Tone is indicated with diacritical marks on vowel letters.

Khitan Small Script is a Unicode block containing characters from the Khitan small script, which was used for writing the Khitan language spoken by the Khitan people in northern China during the Liao dynasty.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Everson, Michael (2017-07-26). "L2/17-067R: Proposal to encode the Wancho script in the UCS" (PDF).