Armenian (Unicode block)

Armenian
Armenian
Range	U+0530..U+058F; (96 code points)
Plane	BMP
Scripts	Armenian
Major alphabets	Armenian alphabet
Assigned	91 code points
Unused	5 reserved code points
Unicode version history
1.0.0 (1991)	84 (+84)
1.1 (1993)	85 (+1)
3.0 (1999)	86 (+1)
6.1 (2012)	87 (+1)
7.0 (2014)	89 (+2)
11.0 (2018)	91 (+2)
Unicode documentation
	Code chart ∣ Web page
	Note: This article contains Armenian text. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Armenian letters.

Last updated January 06, 2025

Armenian is a Unicode block containing characters for writing the Armenian language, both the classical and reformed orthographies. Five Armenian ligatures are encoded in the Alphabetic Presentation Forms block.

Block

Armenian ^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+053x		Ա	Բ	Գ	Դ	Ե	Զ	Է	Ը	Թ	Ժ	Ի	Լ	Խ	Ծ	Կ
U+054x	Հ	Ձ	Ղ	Ճ	Մ	Յ	Ն	Շ	Ո	Չ	Պ	Ջ	Ռ	Ս	Վ	Տ
U+055x	Ր	Ց	Ւ	Փ	Ք	Օ	Ֆ			ՙ	՚	՛	՜	՝	՞	՟
U+056x	ՠ	ա	բ	գ	դ	ե	զ	է	ը	թ	ժ	ի	լ	խ	ծ	կ
U+057x	հ	ձ	ղ	ճ	մ	յ	ն	շ	ո	չ	պ	ջ	ռ	ս	վ	տ
U+058x	ր	ց	ւ	փ	ք	օ	ֆ	և	ֈ	։	֊			֍	֎	֏
Notes 1. ^ As of Unicode version 16.0 2. ^ Grey areas indicate non-assigned code points

U+2019’RIGHT SINGLE QUOTATION MARK is preferred over U+055A՚ARMENIAN APOSTROPHE.^[3]U+02BBʻMODIFIER LETTER TURNED COMMA is preferred over U+0559ՙARMENIAN MODIFIER LETTER LEFT HALF RING.^[3]

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Armenian block:

Version	Final code points^[a]	Count	L2 ID	WG2 ID	Document
1.0.0	U+0531..0556, 0559..055F, 0561..0586, 0589	84			(to be determined)
			L2/17-315		Golev, Yury; Anderson, Deborah (2017-09-07), Evidence of diaeresis in Armenian
			L2/17-384		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-10-22), "4. Armenian", Recommendations to UTC #153 October 2017 on Script Proposals
			L2/17-362		Moore, Lisa (2018-02-02), "C.5. Evidence of diaeresis in Armenian", UTC #153 Minutes
1.1	U+0587	1			(to be determined)
3.0	U+058A	1		N1395	Youatt, Richard (1996-06-04), Armenian Repertoire Proposal Summary Form
				N1444	Everson, Michael (1996-08-07), Proposed amendments to N 1395 - Armenian
				N1446	U.S. Position on Armenian (N 1395), 1996-08-09
				N1453	Ksar, Mike; Umamaheswaran, V. S. (1996-12-06), "8.11", WG 2 Minutes - Quebec Meeting 31
				N1560	Youatt, Richard (1997-05-16), Response on US feedback on Armenian
			L2/97-161	N1616	Suignard, Michel (1997-07-03), Armenian ad hoc report
			L2/97-288	N1603	Umamaheswaran, V. S. (1997-10-24), "8.23", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997
			L2/98-004R	N1681	Text of ISO 10646 – AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22
			L2/98-318	N1894	Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22
6.1	U+058F	1	L2/10-015R		Moore, Lisa (2010-02-09), "D.5", UTC #122 / L2 #219 Minutes
			L2/10-008R	N3771	Pentzlin, Karl (2010-02-10), Proposal to encode an Armenian Dram currency symbol
				N3824-SARM	SARM – Armenian NB Comments on SC 2 N 4125, ISO/IEC FCD 10646) [Proposal for Armenian symbols], 2010-04-19
			L2/10-136	N3827	Suignard, Michel (2010-04-21), "Proposal to add DRAM SIGN", Disposition of comments on SC2 N 4125 (ISO/IEC FCD 10646)
				N3803 (pdf, doc)	"M56.08g", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24
7.0	U+058D..058E	2	L2/98-426	N1981	Armenian Comments on SC 2 N 3134, Application for Registration No.221, Armenian alphabet coded character set for bibliographic information interchange, 1998-12-16
			L2/99-047	N1984	Suignard, Michel (1999-02-05), Encoding of the Armenian script in ISO/IEC 10646, answer to SC2 N3222
			L2/99-054R		Aliprand, Joan (1999-06-21), "Armenian", Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999
			L2/99-232	N2003	Umamaheswaran, V. S. (1999-08-03), "7.2.1.5", Minutes of WG 2 meeting 36, Fukuoka, Japan, 1999-03-09--15
			L2/10-133	N3824	Summary of Voting on SC 2 N 4125, ISO/IEC FCD 10646, 2010-04-19
				N3824-SARM	SARM – Armenian NB Comments on SC 2 N 4125, ISO/IEC FCD 10646) [Proposal for Armenian symbols], 2010-04-19
			L2/10-136	N3827	Suignard, Michel (2010-04-21), "Proposal to add ETERNITY SIGN", Disposition of comments on SC2 N 4125 (ISO/IEC FCD 10646)
			L2/10-354	N3924	Everson, Michael (2010-09-24), Proposal to encode two symbols for Armenian
			L2/10-372	N3921	Summary of Voting on SC 2 N 4146, ISO/IEC CD 10646 (3rd Ed.), 2010-09-24
			L2/10-373	N3923	Pentzlin, Karl (2010-09-24), Proposal to add an Armenian Eternity Sign to the UCS
				N3903 (pdf, doc)	"M57.13", Unconfirmed minutes of WG2 meeting 57, 2011-03-31
			L2/11-261R2		Moore, Lisa (2011-08-16), "Consensus 128-C29", UTC #128 / L2 #225 Minutes, Approve revised code points for the two Armenian eternity signs...
				N4103	"T.3. Miscellaneous Pictographic Symbols", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
11.0	U+0560, 0588	2	L2/17-032	N4806	Baronian, Luc V. (2017-01-19), Armenian Phonetic Characters in Unicode
			L2/17-037		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa; Liang, Hai; Ishida, Richard; Misra, Karan; McGowan, Rick (2017-01-21), "1. Armenian", Recommendations to UTC #150 January 2017 on Script Proposals
			L2/17-016		Moore, Lisa (2017-02-08), "C.13", UTC #150 Minutes
				N4953 (pdf, doc)	"M66.03a", Unconfirmed minutes of WG 2 meeting 66, 2018-03-23, 058B ARMENIAN SMALL LETTER TURNED AYB is moved to 0560, and 058C ARMENIAN SMALL LETTER YI WITH STROKE is moved to 0588.
			L2/17-353		Anderson, Deborah; Whistler, Ken (2017-10-02), "A. Armenian", WG2 Consent Docket
			L2/17-362		Moore, Lisa (2018-02-02), "Consensus 153-C1", UTC #153 Minutes
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

<span class="mw-page-title-main">ArmSCII</span> Set of obsolete single-byte character encodings

ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.

Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts often have specific, fixed meanings in particular areas of mathematics. By providing uniformity over numerous mathematical articles and books, these conventions help to read mathematical formulas. These also may be used to differentiate between concepts that share a letter in a single problem.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

As of Unicode version 16.0, Cyrillic script is encoded across several blocks:

The modifier letter apostropheʼ is a letter found in Unicode encoding, used primarily for various glottal sounds.

Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization. The word spacing indicates that these characters occupy their own horizontal space within a line of text. Its block name in Unicode 1.0 was simply Modifier Letters.

Yi Syllables is a Unicode block containing the 1,165 characters of the Liangshan Standard Yi script for writing the Nuosu language.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Modifier Tone Letters is a Unicode block containing tone markings for Chinese, Chinantec, Africanist, and other phonetic transcriptions. It does not contain the standard IPA tone marks, which are found in Spacing Modifier Letters.

Miscellaneous Symbols and Pictographs is a Unicode block containing meteorological and astronomical symbols, emoji characters largely for compatibility with Japanese telephone carriers' implementations of Shift JIS, and characters originally from the Wingdings and Webdings fonts found in Microsoft Windows.

Bopomofo is a Unicode block containing phonetic characters for Chinese. The original set of 40 Bopomofo characters is based on the Chinese standard GB 2312. Additional Bopomofo characters can be found in the Bopomofo Extended block.

Bopomofo Extended is a Unicode block containing additional Bopomofo characters for writing phonetic Min Nan, Hakka Chinese, Cantonese, Hmu, and Ge. The basic set of Bopomofo characters can be found in the Bopomofo block.

Tags is a Unicode block containing formatting tag characters. The block is designed to mirror ASCII. It was originally intended for language tags, but has now been repurposed as emoji modifiers, specifically for region flags.

Emoticons is a Unicode block containing emoticons or emoji. Most of them are intended as representations of faces, although some of them include hand gestures or non-human characters.

Variation Selectors is a Unicode block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1–VS4, VS7, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Anthropos alphabet, Sakha and Americanist usage.

Supplemental Symbols and Pictographs is a Unicode block containing emoji characters. It extends the set of symbols included in the Miscellaneous Symbols and Pictographs block. It also includes Typikon symbols.

A number of Greek letters, variants, digits, and other symbols are supported by the Unicode character encoding standard.

Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023. Some computers have 𐞃, 𐞎 and 𐞥 supported on the font Calibri.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
1 2 "7.6 Armenian". The Unicode Standard, Version 15.0 (PDF). Mountain View, CA: Unicode, Inc. September 2022.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-4] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[tus76-3] 1 2 "7.6 Armenian". The Unicode Standard, Version 15.0 (PDF). Mountain View, CA: Unicode, Inc. September 2022.

[1]

[2]

[3]

[a]

Armenian
Range	U+0530..U+058F (96 code points)
Plane	BMP
Scripts	Armenian
Major alphabets	Armenian alphabet
Assigned	91 code points
Unused	5 reserved code points
Unicode version history

1.0.0 (1991)	84 (+84)
1.1 (1993)	85 (+1)
3.0 (1999)	86 (+1)
6.1 (2012)	87 (+1)
7.0 (2014)	89 (+2)
11.0 (2018)	91 (+2)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2] This article contains Armenian text. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Armenian letters.

Armenian (Unicode block)

Contents

Block

History

Related Research Articles

References