Cyrillic Extended-C

Cyrillic Extended-C
Cyrillic Extended-C
Range	U+1C80..U+1C8F; (16 code points)
Plane	BMP
Scripts	Cyrillic
Major alphabets	Old Cyrillic
Assigned	11 code points
Unused	5 reserved code points
Unicode version history
9.0 (2016)	9 (+9)
16.0 (2024)	11 (+2)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated September 11, 2024

Cyrillic Extended-C is a Unicode block containing Cyrillic characters for facsimile reprinting Old Believer service books. They are (contextual) graphic variants of standard Cyrillic rather than distinct letters.

Cyrillic Extended-C ^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+1C8x	ᲀ	ᲁ	ᲂ	ᲃ	ᲄ	ᲅ	ᲆ	ᲇ	ᲈ	Ᲊ	ᲊ
Notes 1. ^ As of Unicode version 16.0 2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Cyrillic Extended-C block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
9.0	U+1C80..1C88	9	L2/00-128		Bunz, Carl-Martin (2000-03-01), Scripts from the Past in Future Versions of Unicode
			L2/13-153		Andreev, Aleksandr; Shardt, Yuri; Simmons, Nikita (2013-07-20), Proposal to Use Standardized Variation Sequences to Encode Church Slavonic Glyph Variants in Unicode
			L2/13-164		Cleminson, Ralph; Birnbaum, David (2013-07-25), Feedback from Experts on Cyrillic proposals
			L2/14-196	N4607 ^{[lower-alpha 2]}	Andreev, Aleksandr; Shardt, Yuri; Simmons, Nikita (2014-08-06), Proposal to Encode Additional Cyrillic Characters used in Early Church Slavonic Printed Books
				N4645	Kravetsky, Aleksandr G. (2014-09-27), Letter in support of "Proposal to Encode Additional Cyrillic Characters used in Early Church Slavonic Printed Books"
			L2/14-177		Moore, Lisa (2014-10-17), "Proposal to encode additional Cyrillic characters (C.7.1.1)", UTC #140 Minutes
			L2/16-052	N4603 (pdf, doc)	Umamaheswaran, V. S. (2015-09-01), "M63.08", Unconfirmed minutes of WG 2 meeting 63
16.0	U+1C89..1C8A	2	L2/22-119		Manulov, Nikita (2022-06-16), Proposal to encode Cyrillic letter Khanty Tje
			L2/22-128		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-07-20), "1a Khanty Cyrillic Letter", Recommendations to UTC #172 July 2022 on Script Proposals
			L2/22-121		Constable, Peter (2022-08-01), "Consensus 172-C4", Draft Minutes of UTC Meeting 172, The UTC accepts U+1C89 CYRILLIC CAPITAL LETTER TJE and U+1C8A CYRILLIC SMALL LETTER TJE
			L2/23-234		Scherer, Markus; Hadley, Josh (2023-10-25), "2.2", UTC #177 properties feedback & recommendations
			L2/23-231		Constable, Peter (2023-12-08), "Note 177-N8", UTC #177 Minutes, The PAG has corrected the Titlecase_Mapping for U+1C8A CYRILLIC SMALL LETTER TJE
			L2/24-104		Manulov, Nikita (2024-03-30), Annotation for Cyrillic Tje
			L2/24-068		Anderson, Deborah; Goregaokar, Manish; Kučera, Jan; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2024-04-18), "3. Cyrillic", Recommendations to UTC #179 April 2024 on Script Proposals
↑ Proposed code points and characters names may differ from final code points and names ↑ WG2 document differs from the L2 document

Related Research Articles

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

As of Unicode version 15.1, Cyrillic script is encoded across several blocks:

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English Dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet.

Combining Half Marks is a Unicode block containing diacritical combining characters for spanning multiple characters.

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.

Cyrillic Extended-A is a Unicode block containing Cyrillic combining characters used in Old Church Slavonic texts.

Cyrillic Extended-B is a Unicode block containing Cyrillic characters for writing Old Cyrillic and Old Abkhazian, and combining numeric signs for Cyrillic numerals used in early Slavic or Church Slavonic texts.

Cyrillic Supplement is a Unicode block containing Cyrillic letters for writing several minority languages, including Abkhaz, Kurdish, Komi, Mordvin, Aleut, Azerbaijani, and Jakovlev's Chuvash orthography.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Myanmar Extended-A is a Unicode block containing Myanmar characters for writing the Khamti Shan and Aiton languages.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

Myanmar Extended-B is a Unicode block containing Burmese script characters for writing Pali and Tai Laing.

Kana Extended-A is a Unicode block containing hentaigana and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

Kana Extended-B is a Unicode block containing Taiwanese kana.

Cyrillic Extended-D is a Unicode block containing superscript and subscript Cyrillic characters used in Cyrillic-based phonetic transcription. The block contains the first Cyrillic characters defined outside of the Basic Multilingual Plane (BMP).

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[wg2differs-4] WG2 document differs from the L2 document

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[1]

[2]

[lower-alpha 1]

[lower-alpha 2]

Cyrillic Extended-C
Range	U+1C80..U+1C8F (16 code points)
Plane	BMP
Scripts	Cyrillic
Major alphabets	Old Cyrillic
Assigned	11 code points
Unused	5 reserved code points
Unicode version history

9.0 (2016)	9 (+9)
16.0 (2024)	11 (+2)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]