Armenian (Unicode block)

Last updated
Armenian
RangeU+0530..U+058F
(96 code points)
Plane BMP
Scripts Armenian
Major alphabets Armenian alphabet
Assigned91 code points
Unused5 reserved code points
Unicode version history
1.0.0 (1991)84 (+84)
1.1 (1993)85 (+1)
3.0 (1999)86 (+1)
6.1 (2012)87 (+1)
7.0 (2014)89 (+2)
11.0 (2018)91 (+2)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Armenian is a Unicode block containing characters for writing the Armenian language, both the classical and reformed orthographies. Five Armenian ligatures are encoded in the Alphabetic Presentation Forms block.

Contents

Block

Armenian [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+053xԱԲԳԴԵԶԷԸԹԺԻԼԽԾԿ
U+054xՀՁՂՃՄՅՆՇՈՉՊՋՌՍՎՏ
U+055xՐՑՒՓՔՕՖՙ՚՛՜՝՞՟
U+056xՠաբգդեզէըթժիլխծկ
U+057xհձղճմյնշոչպջռսվտ
U+058xրցւփքօֆևֈ։֊֍֎֏
Notes
1. ^ As of Unicode version 16.0
2. ^ Grey areas indicate non-assigned code points

U+2019RIGHT SINGLE QUOTATION MARK is preferred over U+055A՚ARMENIAN APOSTROPHE. [3] U+02BBʻMODIFIER LETTER TURNED COMMA is preferred over U+0559ՙARMENIAN MODIFIER LETTER LEFT HALF RING. [3]

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Armenian block:

Related Research Articles

<span class="mw-page-title-main">ArmSCII</span> Set of obsolete single-byte character encodings

ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.

Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts often have specific, fixed meanings in particular areas of mathematics. By providing uniformity over numerous mathematical articles and books, these conventions help to read mathematical formulas. These also may be used to differentiate between concepts that share a letter in a single problem.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

As of Unicode version 16.0, Cyrillic script is encoded across several blocks:

<span class="mw-page-title-main">Modifier letter apostrophe</span> Phonetic modifier letter (ʼ)

The modifier letter apostropheʼ is a letter found in Unicode encoding, used primarily for various glottal sounds.

Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization. The word spacing indicates that these characters occupy their own horizontal space within a line of text. Its block name in Unicode 1.0 was simply Modifier Letters.

Yi Syllables is a Unicode block containing the 1,165 characters of the Liangshan Standard Yi script for writing the Nuosu language.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Modifier Tone Letters is a Unicode block containing tone markings for Chinese, Chinantec, Africanist, and other phonetic transcriptions. It does not contain the standard IPA tone marks, which are found in Spacing Modifier Letters.

Miscellaneous Symbols and Pictographs is a Unicode block containing meteorological and astronomical symbols, emoji characters largely for compatibility with Japanese telephone carriers' implementations of Shift JIS, and characters originally from the Wingdings and Webdings fonts found in Microsoft Windows.

Bopomofo is a Unicode block containing phonetic characters for Chinese. The original set of 40 Bopomofo characters is based on the Chinese standard GB 2312. Additional Bopomofo characters can be found in the Bopomofo Extended block.

Bopomofo Extended is a Unicode block containing additional Bopomofo characters for writing phonetic Min Nan, Hakka Chinese, Cantonese, Hmu, and Ge. The basic set of Bopomofo characters can be found in the Bopomofo block.

Tags is a Unicode block containing formatting tag characters. The block is designed to mirror ASCII. It was originally intended for language tags, but has now been repurposed as emoji modifiers, specifically for region flags.

Emoticons is a Unicode block containing emoticons or emoji. Most of them are intended as representations of faces, although some of them include hand gestures or non-human characters.

Variation Selectors is a Unicode block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1–VS4, VS7, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Anthropos alphabet, Sakha and Americanist usage.

Supplemental Symbols and Pictographs is a Unicode block containing emoji characters. It extends the set of symbols included in the Miscellaneous Symbols and Pictographs block. It also includes Typikon symbols.

A number of Greek letters, variants, digits, and other symbols are supported by the Unicode character encoding standard.

Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023. Some computers have 𐞃, 𐞎 and 𐞥 supported on the font Calibri.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. 1 2 "7.6 Armenian". The Unicode Standard, Version 15.0 (PDF). Mountain View, CA: Unicode, Inc. September 2022.