Spacing Modifier Letters

Last updated
Spacing Modifier Letters
RangeU+02B0..U+02FF
(80 code points)
Plane BMP
Scripts Bopomofo (2 char.)
Latin (14 char.)
Common (64 char.)
Major alphabets IPA
Assigned80 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)57 (+57)
3.0 (1999)63 (+6)
4.0 (2003)80 (+17)
Chart
Code chart
Note: [1] [2]

Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization. The word spacing indicates that these characters occupy their own horizontal space within a line of text. Its block name in Unicode 1.0 was simply Modifier Letters. [3]

Contents

Character table

CodeGlyph (with guide)DecimalDescription
Latin superscript modifier letters
U+02B0◌ʰʰModifier Letter Small H
U+02B1◌ʱʱModifier Letter Small H with hook
U+02B2◌ʲʲModifier Letter Small J
U+02B3◌ʳʳModifier Letter Small R
U+02B4◌ʴʴModifier Letter Small Turned R
U+02B5◌ʵʵModifier Letter Small Turned R with hook
U+02B6◌ʶʶModifier Letter Small Capital Inverted R
U+02B7◌ʷʷModifier Letter Small W
U+02B8◌ʸʸModifier Letter Small Y
Miscellaneous phonetic modifiers
U+02B9◌ʹʹModifier Letter Prime
U+02BA◌ʺʺModifier Letter Double Prime
U+02BB◌ʻʻModifier Letter Turned Comma
U+02BC◌ʼʼ Modifier Letter Apostrophe
U+02BD◌ʽʽModifier Letter Reversed Comma
U+02BE◌ʾʾ Modifier Letter Right Half Ring
U+02BF◌ʿʿ Modifier Letter Left Half Ring
U+02C0◌ˀˀModifier Letter Glottal Stop
U+02C1◌ˁˁModifier Letter Reversed Glottal Stop
U+02C2◌˂˂Modifier Letter Left Arrowhead
U+02C3◌˃˃Modifier Letter Right Arrowhead
U+02C4◌˄˄Modifier Letter Up Arrowhead
U+02C5◌˅˅Modifier Letter Down Arrowhead
U+02C6◌ˆˆModifier Letter Circumflex Accent
U+02C7◌ˇˇ Caron
U+02C8◌ˈˈModifier Letter Vertical Line
U+02C9◌ˉˉModifier Letter Macron
U+02CA◌ˊˊModifier Letter Acute Accent
U+02CB◌ˋˋModifier Letter Grave Accent
U+02CC◌ˌˌModifier Letter Low Vertical Line
U+02CD◌ˍˍModifier Letter Low Macron
U+02CE◌ˎˎModifier Letter Low Grave Accent
U+02CF◌ˏˏModifier Letter Low Acute Accent
U+02D0◌ːːModifier Letter Triangular Colon
U+02D1◌ˑˑModifier Letter Half Triangular Colon
U+02D2◌˒˒Modifier Letter Centered Right Half Ring
U+02D3◌˓˓Modifier Letter Centered Left Half Ring
U+02D4◌˔˔Modifier Letter Up Tack
U+02D5◌˕˕Modifier Letter Down Tack
U+02D6◌˖˖Modifier Letter Plus Sign
U+02D7◌˗˗Modifier Letter Minus Sign
Spacing clones of diacritics
U+02D8◌˘˘ Breve
U+02D9◌˙˙Dot Above
U+02DA◌˚˚Ring Above
U+02DB◌˛˛ Ogonek
U+02DC◌˜˜Small Tilde
U+02DD◌˝˝Double Acute Accent
Additions based on 1989 IPA
U+02DE◌˞˞Modifier Letter Rhotic Hook
U+02DF◌˟˟Modifier Letter Cross Accent
U+02E0◌ˠˠModifier Letter Small Gamma
U+02E1◌ˡˡModifier Letter Small L
U+02E2◌ˢˢModifier Letter Small S
U+02E3◌ˣˣModifier Letter Small X
U+02E4◌ˤˤModifier Letter Small Reversed Glottal Stop
Tone letters
U+02E5◌˥˥ Modifier Letter Extra-High Tone Bar
U+02E6◌˦˦ Modifier Letter High Tone Bar
U+02E7◌˧˧ Modifier Letter Mid Tone Bar
U+02E8◌˨˨ Modifier Letter Low Tone Bar
U+02E9◌˩˩ Modifier Letter Extra-Low Tone Bar
Extended Bopomofo tone marks
U+02EA◌˪˪Extended Bopomofo Yin Departing (for Minnan and Hakka languages)
U+02EB◌˫˫Extended Bopomofo Yang Departing (for Minnan and Hakka languages)
IPA modifiers
U+02EC◌ˬˬModifier Letter Voicing
U+02ED◌˭˭Modifier Letter Unaspirated
Other modifier letter
U+02EE◌ˮˮ Modifier Letter Double Apostrophe
UPA modifiers
U+02EF◌˯˯Modifier Letter Low Down Arrowhead
U+02F0◌˰˰Modifier Letter Low Up Arrowhead
U+02F1◌˱˱Modifier Letter Low Left Arrowhead
U+02F2◌˲˲Modifier Letter Low Right Arrowhead
U+02F3◌˳˳Modifier Letter Low Ring
U+02F4◌˴˴Modifier Letter Middle Grave Accent
U+02F5◌˵˵Modifier Letter Middle Double Grave Accent
U+02F6◌˶˶Modifier Letter Middle Double Acute Accent
U+02F7◌˷˷Modifier Letter Low Tilde
U+02F8◌˸˸Modifier Letter Raised Colon
U+02F9◌˹˹Modifier Letter Begin High Tone
U+02FA◌˺˺Modifier Letter End High Tone
U+02FB◌˻˻Modifier Letter Begin Low Tone
U+02FC◌˼˼Modifier Letter End Low Tone
U+02FD◌˽˽Modifier Letter Shelf
U+02FE◌˾˾Modifier Letter Open Shelf
U+02FF◌˿˿Modifier Letter Low Left Arrow
CodeGlyphDecimalDescription

Compact table

Spacing Modifier Letters [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+02Bx ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ʹ ʺ ʻ ʼ ʽ ʾ ʿ
U+02Cx ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ
U+02Dx ː ˑ ˒ ˓ ˔ ˕ ˖ ˗ ˘ ˙ ˚ ˛ ˜ ˝ ˞ ˟
U+02Ex ˠ ˡ ˢ ˣ ˤ ˥ ˦ ˧ ˨ ˩ ˪ ˫ ˬ ˭ ˮ ˯
U+02Fx ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Notes
1. ^ As of Unicode version 14.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Spacing Modifier Letters block:

See also

Related Research Articles

In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.

Glottal stop (letter) Letter of the Latin alphabet

The character ⟨ʔ⟩, called glottal stop, is an alphabetic letter in some Latin alphabets, most notably in several languages of Canada where it indicates a glottal stop sound. Such usage derives from phonetic transcription, for example the International Phonetic Alphabet (IPA), that use this letter for the glottal stop sound. The letter derives graphically from use of the apostrophe ⟨ʼ⟩ or the symbol ʾ for glottal stop.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

As of Unicode version 14.0 Cyrillic script is encoded across several blocks, all in the BMP:

The modifier letter apostropheʼ is a letter in Unicode encoding, used primarily for various glottal sounds.

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.

Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

<span class="mw-page-title-main">GNU FreeFont</span> Font family

GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.

Macron below is a combining diacritical mark that is used in various orthographies.

Bopomofo, or Mandarin Phonetic Symbols, also named Zhuyin, is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe other varieties of Chinese, particularly other varieties of Mandarin Chinese dialects, as well as Taiwanese Hokkien. Consisting of 37 characters and five tone marks, it transcribes all possible sounds in Mandarin.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.

IPA Extensions is a block (0250–02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Modifier Tone Letters is a Unicode block containing tone markings for Chinese, Chinantec, Africanist, and other phonetic transcriptions. It does not contain the standard IPA tone marks, which are found in Spacing Modifier Letters.

Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally used for writing Coptic, using the similar Greek letters, in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.

Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1, were included in the Latin-1 Supplement block.

Bopomofo is a Unicode block containing phonetic characters for Chinese. The original set of 40 Bopomofo characters is based on the Chinese standard GB 2312. Additional Bopomofo characters can be found in the Bopomofo Extended block.

Bopomofo Extended is a Unicode block containing additional Bopomofo characters for writing phonetic Min Nan, Hakka Chinese, Cantonese, Hmu, and Ge. The basic set of Bopomofo characters can be found in the Bopomofo block.

PragmataPro Typeface

PragmataPro is a monospaced font family designed for programming, created by Fabrizio Schiavi. It is a narrow programming font designed for legibility. The font implements Unicode characters, including (polytonic) Greek, Cyrillic, Arabic, Hebrew and the APL codepoints. The font specifically implements ligatures for programming, such as multiple-character operators. The characters are hinted by hand.

Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP).

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.