Tirhuta (Unicode block)

Last updated
Tirhuta
RangeU+11480..U+114DF
(96 code points)
Plane SMP
Scripts Tirhuta
Major alphabets Maithili
Assigned82 code points
Unused14 reserved code points
Unicode version history
7.0 (2014)82 (+82)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Tirhuta is a Unicode block containing characters for Brahmi-derived Tirhuta script which was the primary writing system for Maithili in Bihar, India and Madhesh, Nepal until the 20th century. [3]

Contents

Block

Tirhuta [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1148x𑒀𑒁𑒂𑒃𑒄𑒅𑒆𑒇𑒈𑒉𑒊𑒋𑒌𑒍𑒎𑒏
U+1149x𑒐𑒑𑒒𑒓𑒔𑒕𑒖𑒗𑒘𑒙𑒚𑒛𑒜𑒝𑒞𑒟
U+114Ax𑒠𑒡𑒢𑒣𑒤𑒥𑒦𑒧𑒨𑒩𑒪𑒫𑒬𑒭𑒮𑒯
U+114Bx𑒰𑒱𑒲𑒳𑒴𑒵𑒶𑒷𑒸𑒻𑒻𑒼𑒽𑒾𑒿
U+114Cx𑓀𑓁𑓃𑓂𑓄𑓅𑓆𑓇
U+114Dx𑓐𑓑𑓒𑓓𑓔𑓕𑓖𑓗𑓘𑓙
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Tirhuta block:

Related Research Articles

<span class="mw-page-title-main">Tirhuta script</span> Script of Maithili language

The Tirhuta or Maithili script was the primary historical script for the Maithili language, as well as one of the historical scripts for Sanskrit. It is believed to have originated in the 10th century CE. It is very similar to Bengali–Assamese script, with most consonants being effectively identical in appearance. For the most part, writing in Maithili has switched to the Devanagari script, which is used to write neighbouring Central Indic languages to the west and north such as Hindi and Nepali, and the number of people with a working knowledge of Tirhuta has dropped considerably in recent years.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

Phaistos Disc is a Unicode block containing the characters found on the undeciphered Phaistos Disc artefact.

Meroitic Hieroglyphs is a Unicode block formal hieroglyphic containing characters for writing the Meroitic language.

Meroitic Cursive is a Unicode block containing demotic-style characters for writing the Meroitic language.

Caucasian Albanian is a Unicode block containing characters used by the Caucasian Albanian peoples of Azerbaijan and Dagestan for writing Northeast Caucasian languages.

Khudawadi is a Unicode block containing characters of the Khudabadi script used by some Sindhis in India for writing the Sindhi language.

Linear A is a Unicode block containing the characters of the ancient, undeciphered Linear A.

Mahajani is a Unicode block containing characters historically used for writing Punjabi and Marwari.

Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars.

Modi is a Unicode block containing the Modi alphabet characters for writing the Marathi language.

Nabataean is a Unicode block containing characters for writing the ancient Nabataean language.

Old Permic is a Unicode block containing Old Permic characters for writing the Komi language.

Pahawh Hmong is a Unicode block containing characters for writing Hmong languages.

Psalter Pahlavi is a Unicode block containing characters for writing Middle Persian. The script derives its name from the "Pahlavi Psalter", a 6th- or 7th-century translation of a Syriac book of psalms.

Siddham is a Unicode block containing characters for the historical, Brahmi-derived Siddham script used for writing Sanskrit between the years c. 550 – c. 1200.

Multani is a Unicode block containing characters used for writing the Multani alphabet, a Brahmic script used in the Multan region of Punjab and in northern Sindh in Pakistan. The script is now obsolete, but was historically used to write the Saraiki language.

Newa is a Unicode block containing characters from the Newa alphabet, which is used to write Nepal Bhasa.

Osage is a Unicode block containing characters from the Osage alphabet, which was devised in 2006 for writing the Osage language spoken by the Osage people of Oklahoma, United States.

Elymaic is a Unicode block containing characters for the Elymaic alphabet, used in the ancient state of Elymais.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Pandey, Anshuman (2011-05-05). "N4035: Proposal to Encode the Tirhuta Script in ISO/IEC 10646" (PDF). Working Group Document, ISO/IEC JTC1/SC2/WG2.