Sinhala Archaic Numbers

Last updated
Sinhala Archaic Numbers
RangeU+111E0..U+111FF
(32 code points)
Plane SMP
Scripts Sinhala
Symbol setsSinhala Illakkam
Assigned20 code points
Unused12 reserved code points
Unicode version history
7.0 (2014)20 (+20)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Sinhala Archaic Numbers is a Unicode block containing Sinhala Illakkam number characters.

Contents

Block

Sinhala Archaic Numbers [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+111Ex𑇡𑇢𑇣𑇤𑇥𑇦𑇧𑇨𑇩𑇪𑇫𑇬𑇭𑇮𑇯
U+111Fx𑇰𑇱𑇲𑇳𑇴
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Sinhala Archaic Numbers block:

Related Research Articles

Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the characters in the Number Forms block, three fractions were inherited from ISO-8859-1, which was incorporated whole as the Latin-1 Supplement block.

In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 15.1, five of the planes have assigned code points (characters), and seven are named.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

<span class="mw-page-title-main">Greek and Coptic</span> Unicode character block

Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally also used for writing Coptic, using the similar Greek letters in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.

<span class="mw-page-title-main">Hangul Jamo Extended-A</span> Unicode character block

Hangul Jamo Extended-A is a Unicode block containing choseong forms of archaic Hangul consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode; specifically, syllables that are not used in standard modern Korean.

<span class="mw-page-title-main">Hangul Jamo Extended-B</span> Unicode character block

Hangul Jamo Extended-B is a Unicode block containing positional forms of archaic Hangul vowel and consonant clusters. They can be used to dynamically compose syllables that are not available as precomposed Hangul syllables in Unicode; specifically, syllables that are not used in standard modern Korean.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Sinhala is a Unicode block containing characters for the Sinhala and Pali languages of Sri Lanka, and is also used for writing Sanskrit in Sri Lanka. The Sinhala allocation is loosely based on the ISCII standard, except that Sinhala contains extra prenasalized consonant letters, leading to inconsistencies with other ISCII-Unicode script allocations.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.

Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before and during Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has been a part of the Unicode Standard since version 3.2 in April 2002. Tagalog characters can be found in the Noto Sans Tagalog font, among others. The Tagalog Baybayin script was originally proposed for inclusion in Unicode alongside its descendant Hanunoo, Buhid and Tagbanwa scripts as a single block called "Philippine Scripts" and two punctuation marks are only part of the Hanunoo block. In 2021, with version 14.0, the Unicode Standard was updated to add three new characters: the "ra" and archaic "ra", and the pamudpod.

Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language.

Aegean Numbers is a Unicode block containing punctuation, number, and unit characters for Linear A, Linear B, and the Cypriot syllabary, together Aegean numerals.

<span class="mw-page-title-main">Ancient Greek Numbers (Unicode block)</span> Unicode character block

Ancient Greek Numbers is a Unicode block containing acrophonic numerals used in ancient Greece, including ligatures and special symbols.

Phoenician is a Unicode block containing characters used across the Mediterranean world from the 12th century BCE to the 3rd century CE. The Phoenician alphabet was added to the Unicode Standard in July 2006 with the release of version 5.0. An alternative proposal to handle it as a font variation of Hebrew was turned down.

Coptic Epact Numbers is a Unicode block containing Old Coptic number forms.

Early Dynastic Cuneiform is the name of a Unicode block of the Supplementary Multilingual Plane (SMP), at U+12480–U+1254F, introduced in version 8.0. It is a supplement to the earlier encoding of the cuneiform script in the two blocks U+12000–U+123FF "Cuneiform" and U+12400–U+1247F "Cuneiform Numbers and Punctuation".

Indic Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in India under the Mughals by the 17th century through the middle of the 20th century.

Ottoman Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in Ottoman Turkish documents.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.