Vithkuqi (Unicode block)

Last updated
Vithkuqi
RangeU+10570..U+105BF
(80 code points)
Plane SMP
Scripts Vithkuqi
Assigned70 code points
Unused10 reserved code points
Unicode version history
14.0 (2021)70 (+70)
Chart
Code chart
Note: [1] [2]

Vithkuqi is a Unicode block containing characters for Naum Veqilharxhi's script for writing Albanian.

Vithkuqi [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1057x𐕰𐕱𐕲𐕳𐕴𐕵𐕶𐕷𐕸𐕹𐕺𐕼𐕽𐕾𐕿
U+1058x𐖀𐖁𐖂𐖃𐖄𐖅𐖆𐖇𐖈𐖉𐖊𐖌𐖍𐖎𐖏
U+1059x𐖐𐖑𐖒𐖔𐖕𐖗𐖘𐖙𐖚𐖛𐖜𐖝𐖞𐖟
U+105Ax𐖠𐖡𐖣𐖤𐖥𐖦𐖧𐖨𐖩𐖪𐖫𐖬𐖭𐖮𐖯
U+105Bx𐖰𐖱𐖳𐖴𐖵𐖶𐖷𐖸𐖹𐖻𐖼
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Vithkuqi block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
14.0U+10570..1057A, 1057C..1058A, 1058C..10592, 10594..10595, 10597..105A1, 105A3..105B1, 105B3..105B9, 105BB..105BC70 L2/09-328 Anderson, Deborah; Glavy, Jason (2009-11-30), Old Albanian Scripts
L2/17-316 N4854 Everson, Michael (2017-09-08), Preliminary proposal for encoding the Vithkuqi script
L2/17-384 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-10-22), "3. Vithkuqi", Recommendations to UTC #153 October 2017 on Script Proposals
L2/20-169 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-07-21), "5. Vithkuqi", Recommendations to UTC #164 July 2020 on Script Proposals
L2/20-250 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "3. Vithkuqi", Recommendations to UTC #165 October 2020 on Script Proposals
L2/20-237 Moore, Lisa (2020-10-27), "B.1 (Section 3, Vithkuqi)", UTC #165 Minutes
L2/20-187R2 N5138R2 Everson, Michael (2020-12-07), Proposal for encoding the Vithkuqi script in the SMP
L2/21-016R Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-01-14), "5 Vithkuqi", Recommendations to UTC #166 January 2021 on Script Proposals
L2/21-009 Moore, Lisa (2021-01-27), "B.1 — 5", UTC #166 Minutes
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.

Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories: astrological, astronomical, chess, dice, musical notation, political symbols, recycling, religious symbols, trigrams, warning signs, and weather, among others.

Vithkuqi script, also called Büthakukye or Beitha Kukju after the appellation applied to it by German Albanologist Johann Georg von Hahn, was an alphabetic script invented for writing the Albanian language between 1825 and 1845 by Albanian scholar Naum Veqilharxhi.

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills, astrological symbols, technical symbols, intonation marks, and others.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

The Unicode Standard assigns various properties to each Unicode character and code point.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Kana Extended-B is a Unicode block containing kana originally created by Japanese linguists to write Taiwanese Hokkien known as Taiwanese kana.

Latin Extended-G is a Unicode block containing additional characters for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP).

<span class="mw-page-title-main">Znamenny Musical Notation</span> Unicode character block

Znamenny Musical Notation is a Unicode block containing characters for Znamenny musical notation from Russia.

Devanagari Extended-A is a Unicode block containing characters for auspicious signs from Indian inscriptions and manuscripts from the 11th century onward.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2021-09-15.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2021-09-15.