Cyrillic Supplement

Last updated
Cyrillic Supplement
RangeU+0500..U+052F
(48 code points)
Plane BMP
Scripts Cyrillic
Major alphabetsAbkhaz
Komi
Mordvin
Aleut
Assigned48 code points
Unused0 reserved code points
Unicode version history
3.2 (2002)16 (+16)
5.0 (2006)20 (+4)
5.1 (2008)36 (+16)
5.2 (2009)38 (+2)
6.0 (2010)40 (+2)
7.0 (2014)48 (+8)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Cyrillic Supplement is a Unicode block containing Cyrillic letters for writing several minority languages, including Abkhaz, Kurdish, Komi, Mordvin, Aleut, Azerbaijani, and Jakovlev's Chuvash orthography.

Contents

Block

Cyrillic Supplement [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+050x Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
U+051x Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
U+052x Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ
Notes
1. ^ As of Unicode version 15.1

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Cyrillic Supplement block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
3.2U+0500..050F16 L2/98-211 N1744 Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
L2/98-292R (pdf, html, Figure 1)"2.3", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-19
L2/98-292 N1840 "2.3", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-25
L2/98-293 N1885 "2.4", Comments on proposals to add various characters to ISO/IEC 10646, 1998-08-25
L2/98-301 N1847 Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
L2/98-372 N1884R2 (pdf, doc)Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
L2/99-010 N1903 (pdf, html, doc)Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
L2/00-182 N2224 Everson, Michael; Ruppel, Klaas; Trosterud, Trond (2000-06-09), Archaic Komi Cyrillic characters for the BMP of the UCS
L2/00-187 Moore, Lisa (2000-08-23), "Komi Cyrillic", UTC minutes -- Boston, August 8-11, 2000
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
5.0U+0510..05134 L2/05-080R2 N2933 Priest, Lorna (2005-08-02), Proposal to Encode Additional Cyrillic Characters (rev 2005/08/18)
L2/05-215 Anderson, Deborah (2005-08-03), Feedback on Cyrillic letters EL WITH HOOK and HA WITH HOOK (L2/05-080)
L2/05-230 Priest, Lorna (2005-08-11), Nameslist annotations for new Cyrillic letters
L2/05-180 Moore, Lisa (2005-08-17), "Cyrillic (C.18)", UTC #104 Minutes
N2953 (pdf, doc)Umamaheswaran, V. S. (2006-02-16), "7.2.4", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15
5.1U+0514..052316 L2/98-211 N1744 Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
L2/98-301 N1847 Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
L2/98-372 N1884R2 (pdf, doc)Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
L2/99-010 N1903 (pdf, html, doc)Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
L2/06-042 Cleminson, Ralph (2006-01-26), Proposal for additional Cyrillic characters
L2/06-181 Anderson, Deborah (2006-05-08), Responses to the UTC regarding L2/06-042, Proposal for Additional Cyrillic Characters
L2/06-359 Cleminson, Ralph (2006-10-31), Proposal for additional Cyrillic characters
L2/07-003 N3194 Everson, Michael; Birnbaum, David; Cleminson, Ralph; Derzhanski, Ivan; Dorosh, Vladislav; Kryukov, Alexey; Paliga, Sorin; Ruppel, Klaas (2007-01-12), Proposal to encode additional Cyrillic characters in the BMP of the UCS
L2/07-055 Cleminson, Ralph (2007-01-19), Comments on Additional Cyrillic Characters (L2/07-003 = WG2 N3194)
L2/07-015 Moore, Lisa (2007-02-08), "Cyrillic (C.13)", UTC #110 Minutes
L2/07-268 N3253 (pdf, doc)Umamaheswaran, V. S. (2007-07-26), "M50.11", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
5.2U+0524..05252 L2/08-144 N3435R Everson, Michael; Priest, Lorna (2008-04-11), Proposal to encode two Cyrillic characters for Abkhaz
L2/08-134 N3429 Ankwab, Alexander; Bzhanija, Akhara (2008-04-14), Glyph Changes for Abkhaz
L2/08-161R2 Moore, Lisa (2008-11-05), "Motion 115-M2", UTC #115 Minutes
L2/08-412 N3553 (pdf, doc)Umamaheswaran, V. S. (2008-11-05), "M53.11", Unconfirmed minutes of WG 2 meeting 53
6.0U+0526..05272 L2/08-182 N3481 Priest, Lorna (2008-07-28), Proposal to Encode Additional Latin and Cyrillic Characters
L2/08-161R2 Moore, Lisa (2008-11-05), "Consensus 115-C29", UTC #115 Minutes
L2/08-412 N3553 (pdf, doc)Umamaheswaran, V. S. (2008-11-05), "M53.24a", Unconfirmed minutes of WG 2 meeting 53
7.0U+0528..05292 L2/11-265 N4137 Yevlampiev, Ilya; Pentzlin, Karl (2011-07-06), Proposal to encode a missing Cyrillic letter pair for the Orok language
L2/11-261R2 Moore, Lisa (2011-08-16), "Consensus 128-C39", UTC #128 / L2 #225 Minutes
N4253 (pdf, doc)"M59.16h", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
U+052A..052D4 L2/12-040 N4199 Everson, Michael (2012-01-31), Proposal to encode four Cyrillic characters in the BMP of the UCS
L2/12-007 Moore, Lisa (2012-02-14), "C.14", UTC #130 / L2 #227 Minutes
N4253 (pdf, doc)"M59.16i", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
U+052E..052F2L2/97-146N1590Trosterud, Trond (1997-06-09), Proposal to add 10 Cyrillic Sámi characters to ISO/IEC 10646
L2/97-288 N1603 Umamaheswaran, V. S. (1997-10-24), "8.24.7", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997
L2/98-211 N1744 Everson, Michael (1998-05-25), Additional Cyrillic characters for the UCS
L2/98-281R (pdf, html)Aliprand, Joan (1998-07-31), "Cyrillic characters (IV.C.4)", Unconfirmed Minutes – UTC #77 & NCITS Subgroup L2 # 174 JOINT MEETING, Redmond, WA -- July 29-31, 1998
L2/98-301 N1847 Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals
L2/98-372 N1884R2 (pdf, doc)Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS
L2/99-010 N1903 (pdf, html, doc)Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.15 Komi Cyrillic", Minutes of the SC2/WG2 meeting in Athens, September 2000
L2/12-052R N4219R Anderson, Deborah (2012-02-09), Request for 2 New Cyrillic Characters for the Khanty and Nenets Languages
L2/12-007 Moore, Lisa (2012-02-14), "C.14", UTC #130 / L2 #227 Minutes
N4253 (pdf, doc)"M59.16j", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

As of Unicode version 15.1, Cyrillic script is encoded across several blocks:

Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Cyrillic Extended-A is a Unicode block containing Cyrillic combining characters used in Old Church Slavonic texts.

Cyrillic Extended-B is a Unicode block containing Cyrillic characters for writing Old Cyrillic and Old Abkhazian, and combining numeric signs for Cyrillic numerals used in early Slavic or Church Slavonic texts.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Georgian Supplement is a Unicode block containing characters for the ecclesiastical form of the Georgian script, Nuskhuri. To write the full ecclesiastical Khutsuri orthography, the Asomtavruli capitals encoded in the Georgian block.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Variation Selectors Supplement is a Unicode block containing additional variation selectors beyond those found in the Variation Selectors block.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

<span class="mw-page-title-main">Enclosed Ideographic Supplement</span> Unicode character block

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Cyrillic Extended-C is a Unicode block containing Cyrillic characters for facsimile reprinting Old Believer service books. They are (contextual) graphic variants of standard Cyrillic rather than distinct letters.

Kana Extended-A is a Unicode block containing hentaigana and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.

Kana Extended-B is a Unicode block containing Taiwanese kana.

Cyrillic Extended-D is a Unicode block containing superscript and subscript Cyrillic characters used in Cyrillic-based phonetic transcription. The block contains the first Cyrillic characters defined outside of the Basic Multilingual Plane (BMP).

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.