Cyrillic (Unicode block)

Last updated
Cyrillic
RangeU+0400..U+04FF
(256 code points)
Plane BMP
Scripts Cyrillic (254 characters)
Inherited (2 characters)
Major alphabets Russian
Ukrainian
Belarusian
Bulgarian
Serbian
Macedonian
Abkhaz
Assigned256 code points
Unused0 reserved code points
Source standards ISO 8859-5
Unicode version history
1.0.0 (1991)192 (+192)
1.0.1 (1992)188 (-4)
1.1 (1993)226 (+38)
3.0 (1999)238 (+12)
3.2 (2002)246 (+8)
4.1 (2005)248 (+2)
5.0 (2006)255 (+7)
5.1 (2008)256 (+1)
Unicode documentation
Code chart ∣ Web page
Note: Four characters (two upper and lower case letter pairs) were removed from the Cyrillic block in version 1.0.1 during the process of unifying with ISO 10646. [1] [2] [3]

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Contents

Block

Cyrillic [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+040x Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
U+041x А Б В Г Д Е Ж З И Й К Л М Н О П
U+042x Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
U+043x а б в г д е ж з и й к л м н о п
U+044x р с т у ф х ц ч ш щ ъ ы ь э ю я
U+045x ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
U+046x Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
U+047x Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
U+048x Ҁ ҁ ҂ ҃ ҄ ҅ ҆ ҇ ҈ ҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
U+049x Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
U+04Ax Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
U+04Bx Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
U+04Cx Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
U+04Dx Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
U+04Ex Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
U+04Fx Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
Notes
1. ^ As of Unicode version 16.0

List of characters

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Cyrillic block:

Related Research Articles

<span class="mw-page-title-main">Cyrillic script</span> Writing system used for various Eurasian languages

The Cyrillic script, Slavonic script or simply Slavic script is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Eastern Europe, the Caucasus, Central Asia, North Asia, and East Asia, and used by many other minority languages.

<span class="mw-page-title-main">Ya (Cyrillic)</span> Cyrillic letter

Ya, Ia or Ja is a letter of the Cyrillic script, the civil script variant of Old Cyrillic Little Yus, and possibly Iotated A. Among modern Slavic languages, it is used in the East Slavic languages and Bulgarian. It is also used in the Cyrillic alphabets used by Mongolian and many Uralic, Caucasian and Turkic languages of the former Soviet Union.

<span class="mw-page-title-main">Russian alphabet</span> Alphabet that uses letters from the Cyrillic script

The Russian alphabet is the script used to write the Russian language. It is derived from the Cyrillic script, which was modified in the 9th century to capture accurately the phonology of the first Slavic literary language, Old Slavonic. Initially an old variant of the Bulgarian alphabet, it was used in Kievan Rus' from the 10th century onward to write what would become the modern Russian language.

<span class="mw-page-title-main">Ghe with upturn</span> Cyrillic letter

Ge or G is a letter of the Cyrillic script. It is part of the Ukrainian alphabet, the Pannonian Rusyn alphabet and both the Carpathian Rusyn alphabets, and also some variants of the Urum and Belarusian alphabets. In these languages it is usually called ge, while the letter it follows, ⟨Г г⟩ is called he.

The European ordering rules define an ordering for strings written in languages that are written with the Latin, Greek and Cyrillic alphabets. The standard covers languages used by the European Union, the European Free Trade Association, and parts of the former Soviet Union. It is a tailoring of the Common Tailorable Template of ISO/IEC 14651. EOR can in turn be tailored for different (European) languages. But in inter-European contexts, EOR can be used without further tailoring.

<span class="mw-page-title-main">Ukrainian alphabet</span> Alphabet that uses letters from the Cyrillic script

The Ukrainian alphabet is the set of letters used to write Ukrainian, which is the official language of Ukraine. It is one of several national variations of the Cyrillic script. It comes from the Cyrillic script, which was devised in the 9th century for the first Slavic literary language, called Old Slavonic. In the 10th century, it became used in Kievan Rus' to write Old East Slavic, from which the Belarusian, Russian, Rusyn, and Ukrainian alphabets later evolved. The modern Ukrainian alphabet has 33 letters in total: 21 consonants, 1 semivowel, 10 vowels and 1 palatalization sign. Sometimes the apostrophe (') is also included, which has a phonetic meaning and is a mandatory sign in writing, but is not considered as a letter and is not included in the alphabet.

<span class="mw-page-title-main">Faux Cyrillic</span> Using Cyrillic letters to represent Latin ones

Faux Cyrillic, pseudo-Cyrillic, pseudo-Russian or faux Russian typography is the use of Cyrillic letters in Latin text, usually to evoke the Soviet Union or Russia, though it may be used in other contexts as well. It is a common Western trope used in book covers, film titles, comic book lettering, artwork for computer games, or product packaging which are set in or wish to evoke Eastern Europe, the Soviet Union, or Russia. A typeface designed to emulate Cyrillic is classed as a mimicry typeface.

<span class="mw-page-title-main">Dje</span> Cyrillic letter used in Serbian

Dje is a letter of the Cyrillic script.

<span class="mw-page-title-main">Kazakh alphabets</span> Alphabets used to write the Kazakh language

Three alphabets are used to write Kazakh: the Cyrillic, Latin and Arabic scripts. The Cyrillic script is used in Kazakhstan, Russia, and Mongolia. An October 2017 Presidential Decree in Kazakhstan ordered that the transition from Cyrillic to a Latin script be completed by 2031. The Arabic script is used in Saudi Arabia, Iran, Afghanistan, and parts of China.

The romanization of Macedonian is the transliteration of text in Macedonian from the Macedonian Cyrillic alphabet into the Latin alphabet. Romanization can be used for various purposes, such as rendering of proper names in foreign contexts, or for informal writing of Macedonian in environments where Cyrillic is not easily available. Official use of romanization by North Macedonia's authorities is found, for instance, on road signage and in passports. Several different codified standards of transliteration currently exist and there is widespread variability in practice.

YUSCII is an informal name for several JUS standards for 7-bit character encoding. These include:

<span class="mw-page-title-main">U with macron (Cyrillic)</span> Cyrillic letter

U with macron is a letter of the Cyrillic script, derived from the Cyrillic letter U (У у У у).

<span class="mw-page-title-main">Yakut scripts</span> Scripts used to write the Yakut language

There are 4 stages in the history of Yakut writing systems:

The Ukrainian orthography is the orthography for the Ukrainian language, a system of generally accepted rules that determine the ways of transmitting speech in writing.

<span class="mw-page-title-main">Cyrillic alphabets</span> Related alphabets based on Cyrillic scripts

Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the theologians Cyril and Methodius. It is the basis of alphabets used in various languages, past and present, Slavic origin, and non-Slavic languages influenced by Russian. As of 2011, around 252 million people in Eurasia use it as the official alphabet for their national languages. About half of them are in Russia. Cyrillic is one of the most-used writing systems in the world. The creator is Saint Clement of Ohrid from the Preslav literary school in the First Bulgarian Empire.

JCUKEN is the main Cyrillic keyboard layout for the Russian language in computers and typewriters.

The Komi language, a Uralic language spoken in the north-eastern part of European Russia, has been written in several different alphabets. Currently, Komi writing uses letters from the Cyrillic script. There have been five distinct stages in the history of Komi writing:

Since its inception in the 18th century and up to the present, it is based on the Cyrillic alphabet to write the Udmurt language. Attempts were also made to use the Latin alphabet to write the Udmurt language. In its modern form, the Udmurt alphabet was approved in 1937.

There are several conventions for phonetic transcription using the Cyrillic script, typically augmented with Latin and Greek to fill in missing sounds. The details vary by author, and depend on which letters are available for the language of the text. For instance, in a work written in Ukrainian, ⟨г⟩ may be used for, whereas in Russian texts, ⟨г⟩ is used for. This article follows common Russian usage.

Khakass alphabets are the alphabets used to write the Khakas language.

References

  1. "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09.
  2. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  3. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.