Cyrillic Supplement

Cyrillic Supplement
Cyrillic Supplement
Range	U+0500..U+052F; (48 code points)
Plane	BMP
Scripts	Cyrillic
Major alphabets	Abkhaz; Komi; Mordvin; Aleut
Assigned	48 code points
Unused	0 reserved code points
Unicode version history
3.2 (2002)	16 (+16)
5.0 (2006)	20 (+4)
5.1 (2008)	36 (+16)
5.2 (2009)	38 (+2)
6.0 (2010)	40 (+2)
7.0 (2014)	48 (+8)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated March 06, 2024

Cyrillic Supplement is a Unicode block containing Cyrillic letters for writing several minority languages, including Abkhaz, Kurdish, Komi, Mordvin, Aleut, Azerbaijani, and Jakovlev's Chuvash orthography.

Block

Cyrillic Supplement ^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+050x	Ԁ	ԁ	Ԃ	ԃ	Ԅ	ԅ	Ԇ	ԇ	Ԉ	ԉ	Ԋ	ԋ	Ԍ	ԍ	Ԏ	ԏ
U+051x	Ԑ	ԑ	Ԓ	ԓ	Ԕ	ԕ	Ԗ	ԗ	Ԙ	ԙ	Ԛ	ԛ	Ԝ	ԝ	Ԟ	ԟ
U+052x	Ԡ	ԡ	Ԣ	ԣ	Ԥ	ԥ	Ԧ	ԧ	Ԩ	ԩ	Ԫ	ԫ	Ԭ	ԭ	Ԯ	ԯ
Notes 1. ^ As of Unicode version 15.1

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Cyrillic Supplement block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
3.2	U+0500..050F	16	L2/98-211	N1744	Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
			L2/98-292R (pdf, html, Figure 1)		"2.3", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-19
			L2/98-292	N1840	"2.3", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-25
			L2/98-293	N1885	"2.4", Comments on proposals to add various characters to ISO/IEC 10646, 1998-08-25
			L2/98-301	N1847	Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
			L2/98-372	N1884R2 (pdf, doc)	Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
			L2/99-010	N1903 (pdf, html, doc)	Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
			L2/00-182	N2224	Everson, Michael; Ruppel, Klaas; Trosterud, Trond (2000-06-09), Archaic Komi Cyrillic characters for the BMP of the UCS
			L2/00-187		Moore, Lisa (2000-08-23), "Komi Cyrillic", UTC minutes -- Boston, August 8-11, 2000
			L2/01-050	N2253	Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
5.0	U+0510..0513	4	L2/05-080R2	N2933	Priest, Lorna (2005-08-02), Proposal to Encode Additional Cyrillic Characters (rev 2005/08/18)
			L2/05-215		Anderson, Deborah (2005-08-03), Feedback on Cyrillic letters EL WITH HOOK and HA WITH HOOK (L2/05-080)
			L2/05-230		Priest, Lorna (2005-08-11), Nameslist annotations for new Cyrillic letters
			L2/05-180		Moore, Lisa (2005-08-17), "Cyrillic (C.18)", UTC #104 Minutes
				N2953 (pdf, doc)	Umamaheswaran, V. S. (2006-02-16), "7.2.4", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15
5.1	U+0514..0523	16	L2/98-211	N1744	Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
			L2/98-301	N1847	Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
			L2/98-372	N1884R2 (pdf, doc)	Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
			L2/99-010	N1903 (pdf, html, doc)	Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
			L2/01-050	N2253	Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
			L2/06-042		Cleminson, Ralph (2006-01-26), Proposal for additional Cyrillic characters
			L2/06-181		Anderson, Deborah (2006-05-08), Responses to the UTC regarding L2/06-042, Proposal for Additional Cyrillic Characters
			L2/06-359		Cleminson, Ralph (2006-10-31), Proposal for additional Cyrillic characters
			L2/07-003	N3194	Everson, Michael; Birnbaum, David; Cleminson, Ralph; Derzhanski, Ivan; Dorosh, Vladislav; Kryukov, Alexey; Paliga, Sorin; Ruppel, Klaas (2007-01-12), Proposal to encode additional Cyrillic characters in the BMP of the UCS
			L2/07-055		Cleminson, Ralph (2007-01-19), Comments on Additional Cyrillic Characters (L2/07-003 = WG2 N3194)
			L2/07-015		Moore, Lisa (2007-02-08), "Cyrillic (C.13)", UTC #110 Minutes
			L2/07-268	N3253 (pdf, doc)	Umamaheswaran, V. S. (2007-07-26), "M50.11", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
5.2	U+0524..0525	2	L2/08-144	N3435R	Everson, Michael; Priest, Lorna (2008-04-11), Proposal to encode two Cyrillic characters for Abkhaz
			L2/08-134	N3429	Ankwab, Alexander; Bzhanija, Akhara (2008-04-14), Glyph Changes for Abkhaz
			L2/08-161R2		Moore, Lisa (2008-11-05), "Motion 115-M2", UTC #115 Minutes
			L2/08-412	N3553 (pdf, doc)	Umamaheswaran, V. S. (2008-11-05), "M53.11", Unconfirmed minutes of WG 2 meeting 53
6.0	U+0526..0527	2	L2/08-182	N3481	Priest, Lorna (2008-07-28), Proposal to Encode Additional Latin and Cyrillic Characters
			L2/08-161R2		Moore, Lisa (2008-11-05), "Consensus 115-C29", UTC #115 Minutes
			L2/08-412	N3553 (pdf, doc)	Umamaheswaran, V. S. (2008-11-05), "M53.24a", Unconfirmed minutes of WG 2 meeting 53
7.0	U+0528..0529	2	L2/11-265	N4137	Yevlampiev, Ilya; Pentzlin, Karl (2011-07-06), Proposal to encode a missing Cyrillic letter pair for the Orok language
			L2/11-261R2		Moore, Lisa (2011-08-16), "Consensus 128-C39", UTC #128 / L2 #225 Minutes
				N4253 (pdf, doc)	"M59.16h", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
	U+052A..052D	4	L2/12-040	N4199	Everson, Michael (2012-01-31), Proposal to encode four Cyrillic characters in the BMP of the UCS
			L2/12-007		Moore, Lisa (2012-02-14), "C.14", UTC #130 / L2 #227 Minutes
				N4253 (pdf, doc)	"M59.16i", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
	U+052E..052F	2	L2/97-146	N1590	Trosterud, Trond (1997-06-09), Proposal to add 10 Cyrillic Sámi characters to ISO/IEC 10646
			L2/97-288	N1603	Umamaheswaran, V. S. (1997-10-24), "8.24.7", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997
			L2/98-211	N1744	Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
			L2/98-281R (pdf, html)		Aliprand, Joan (1998-07-31), "Cyrillic characters (IV.C.4)", Unconfirmed Minutes – UTC #77 & NCITS Subgroup L2 # 174 JOINT MEETING, Redmond, WA -- July 29-31, 1998
			L2/98-301	N1847	Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
			L2/98-372	N1884R2 (pdf, doc)	Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
			L2/99-010	N1903 (pdf, html, doc)	Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
			L2/01-050	N2253	Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
			L2/12-052R	N4219R	Anderson, Deborah (2012-02-09), Request for 2 New Cyrillic Characters for the Khanty and Nenets Languages
			L2/12-007		Moore, Lisa (2012-02-14), "C.14", UTC #130 / L2 #227 Minutes
				N4253 (pdf, doc)	"M59.16j", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

As of Unicode version 15.1, Cyrillic script is encoded across several blocks:

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Cyrillic Extended-A is a Unicode block containing Cyrillic combining characters used in Old Church Slavonic texts.

Cyrillic Extended-B is a Unicode block containing Cyrillic characters for writing Old Cyrillic and Old Abkhazian, and combining numeric signs for Cyrillic numerals used in early Slavic or Church Slavonic texts.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Georgian Supplement is a Unicode block containing characters for the ecclesiastical form of the Georgian script, Nuskhuri. To write the full ecclesiastical Khutsuri orthography, the Asomtavruli capitals encoded in the Georgian block.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Variation Selectors Supplement is a Unicode block containing additional variation selectors beyond those found in the Variation Selectors block.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Cyrillic Extended-C is a Unicode block containing Cyrillic characters for facsimile reprinting Old Believer service books. They are (contextual) graphic variants of standard Cyrillic rather than distinct letters.

Kana Extended-A is a Unicode block containing hentaigana and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

Kana Extended-B is a Unicode block containing Taiwanese kana.

Cyrillic Extended-D is a Unicode block containing superscript and subscript Cyrillic characters used in Cyrillic-based phonetic transcription. The block contains the first Cyrillic characters defined outside of the Basic Multilingual Plane (BMP).

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[1]

[2]

[1]

[lower-alpha 1]

Cyrillic Supplement
Range	U+0500..U+052F (48 code points)
Plane	BMP
Scripts	Cyrillic
Major alphabets	Abkhaz Komi Mordvin Aleut
Assigned	48 code points
Unused	0 reserved code points
Unicode version history

3.2 (2002)	16 (+16)
5.0 (2006)	20 (+4)
5.1 (2008)	36 (+16)
5.2 (2009)	38 (+2)
6.0 (2010)	40 (+2)
7.0 (2014)	48 (+8)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]

Cyrillic Supplement

Contents

Block

History

Related Research Articles

References