Caucasian Albanian (Unicode block)

Last updated
Caucasian Albanian
RangeU+10530..U+1056F
(64 code points)
Plane SMP
Scripts Caucasian Albanian
Major alphabetsCaucasian Albanian
Assigned53 code points
Unused11 reserved code points
Unicode version history
7.0 (2014)53 (+53)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Caucasian Albanian is a Unicode block containing characters used by the Caucasian Albanian peoples of Azerbaijan and Dagestan for writing Northeast Caucasian languages. [3]

Caucasian Albanian [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1053x𐔰𐔱𐔲𐔳𐔴𐔵𐔶𐔷𐔸𐔹𐔺𐔻𐔼𐔽𐔾𐔿
U+1054x𐕀𐕁𐕂𐕃𐕄𐕅𐕆𐕇𐕈𐕉𐕊𐕋𐕌𐕍𐕎𐕏
U+1055x𐕐𐕑𐕒𐕓𐕔𐕕𐕖𐕗𐕘𐕙𐕚𐕛𐕜𐕝𐕞𐕟
U+1056x𐕠𐕡𐕢𐕣𐕯
Notes
1. ^ As of Unicode version 15.1
2. ^ Grey areas indicate non-assigned code points

Typefaces supporting this script include such as Arian AMU [hy], Noto Sans Caucasian Albanian and Unifont Upper. Recent versions of Windows also support Calibri and Microsoft Sans Serif.

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Caucasian Albanian block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
7.0U+10530..10563, 1056F53 L2/11-143 N4039 Everson, Michael (2011-05-02), Preliminary proposal for encoding the Caucasian Albanian script in the SMP of the UCS
L2/11-296R N4131 Everson, Michael (2011-10-28), Proposal for encoding the Caucasian Albanian script in the SMP of the UCS
N4243 Everson, Michael; Gippert, Jost (2012-02-14), Documentation for Two Characters FE2B and FE2C for Caucasian Albanian (N4131R)
L2/12-112 Moore, Lisa (2012-05-17), "Consensus 131-C24", UTC #131 / L2 #228 Minutes
N4253 (pdf, doc)"M59.14", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts often have specific, fixed meanings in particular areas of mathematics. By providing uniformity over numerous mathematical articles and books, these conventions help to read mathematical formulas. These also may be used to differentiate between concepts that share a letter in a single problem.

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."

Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the characters in the Number Forms block, three fractions were inherited from ISO-8859-1, which was incorporated whole as the Latin-1 Supplement block.

Mandaic is a Unicode block containing characters of the Mandaic script used for writing the historic Eastern Aramaic, also called Classical Mandaic, and the modern Neo-Mandaic language.

Currency Symbols is a Unicode block containing characters for representing unique monetary signs. Many currency signs can be found in other Unicode blocks, especially when the currency symbol is unique to a country that uses a script not generally used outside that country.

Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has been a part of the Unicode Standard since version 3.2 in April 2002. Tagalog characters can be found in the Noto Sans Tagalog font, among others. The Tagalog Baybayin script was originally proposed for inclusion in Unicode alongside its descendant Hanunoo, Buhid and Tagbanwa scripts as a single block called "Philippine Scripts" and two punctuation marks are only part of the Hanunoo block. In 2021, with version 14.0, the Unicode Standard was updated to add three new characters: the "ra" and archaic "ra", and the pamudpod.

Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet consists of glyphs resembling capital letters in the basic Latin alphabet in their standard form and turned.

Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language.

<span class="mw-page-title-main">Noto fonts</span> Multilingual font family from Google

Noto is a font family comprising over 100 individual fonts, which are together designed to cover all the scripts encoded in the Unicode standard. As of October 2016, Noto fonts cover all 93 scripts defined in Unicode version 6.1, although fewer than 30,000 of the nearly 75,000 CJK unified ideographs in version 6.0 are covered. In total, Noto fonts cover over 77,000 characters, which is around half of the 149,186 characters defined in Unicode 15.0.

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Anthropos alphabet, Sakha and Americanist usage.

Old North Arabian is a Unicode block containing characters for writing the Ancient North Arabian language.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Adlam is a Unicode block containing characters from the Adlam script, an alphabetic script devised during the late 1980s for writing the Fula language in Guinea, Nigeria, Liberia, and other nearby countries.

Ideographic Symbols and Punctuation is a Unicode block containing symbols and punctuation marks used by ideographic scripts such as Tangut and Nüshu.

<span class="mw-page-title-main">Dogra (Unicode block)</span> Unicode character block

Dogra is a Unicode block for the Dogri script, for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent. The Takri script version of Jammu is known as Dogra Akkhar.

Old Sogdian is a Unicode block containing characters for a group of related, non-cursive Sogdian writing systems used to write historic Sogdian in the 3rd to 5th centuries CE.

Sogdian is a Unicode block containing characters used to write the Sogdian language from the 7th to 14th centuries CE.

Tangut Supplement is a Unicode block containing characters from the Tangut script, which was used for writing the Tangut language spoken by the Tangut people in the Western Xia Empire, and in China during the Yuan dynasty and early Ming dynasty. This block is a supplement to the main Tangut block.

Vithkuqi is a Unicode block containing characters for Naum Veqilharxhi's script for writing Albanian.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Everson, Michael; Gippert, Jost (2011-10-28). "N4131R: Proposal for encoding the Caucasian Albanian script in the SMP of the UCS" (PDF). Working Group Document, ISO/IEC JTC1/SC2/WG2.