Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. [1] These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
The World Wide Web Consortium and the Unicode Consortium have made recommendations on the choice between using markup and using superscript and subscript characters:
When used in mathematical context (MathML) it is recommended to consistently use style markup for superscripts and subscripts […] However, when super and sub-scripts are to reflect semantic distinctions, it is easier to work with these meanings encoded in text rather than markup, for example, in phonetic or phonemic transcription. [2]
The intended use [2] when these characters were added to Unicode was to produce true superscripts and subscripts so that chemical and algebraic formulas could be written without markup. Thus "H₂O" (using a subscript 2 character) is supposed to be identical to "H2O" (with subscript markup).
In reality, many fonts that include these characters ignore the Unicode definition, and instead design the digits for mathematical numerator and denominator glyphs, [3] [4] which are aligned with the cap line and the baseline, respectively. When used with the solidus, these glyphs are a common substitute for diagonal fractions, such as ³/₄ for the ¾ glyph. This change was made because using markup does not give a good graphic approximation of fractions (compare markup 3/4 with super/sub-script ³/₄). The change also makes the superscript letters useful for ordinal indicators, more closely matching the ª and º characters. However, it makes them incorrect for normal superscript and subscript, and so chemical and algebraic formulas are better rendered by using markup.
Unicode intended that diagonal fractions be rendered by a different mechanism: the fraction slash U+2044 is visually similar to the solidus, but when used with the ordinary digits (not the superscripts and subscripts), it instructs the layout system that a fraction such as ¾ is to be rendered using automatic glyph substitution. [5] [lower-alpha 1] User-end support was quite poor for a number of years, but fonts, browsers, [lower-alpha 2] word processors [lower-alpha 3] and so on increasingly support the intended Unicode behavior. A selection of supporting fonts is displayed in the table below. (These will not display properly if you do not have the fonts installed, or if your browser does not support this behavior.)
Font | U+00BDVULGAR FRACTION ONE HALF | U+0031DIGIT ONEU+2044FRACTION SLASHU+0032DIGIT TWO |
---|---|---|
Browser default font | ½ | 1⁄2 |
Andika | ½ | 1⁄2 |
Arno Pro | ½ | 1⁄2 |
URW Bookman | ½ | 1⁄2 |
Brill | ½ | 1⁄2 |
Brioso Pro | ½ | 1⁄2 |
Calibri | ½ | 1⁄2 |
Candara | ½ | 1⁄2 |
Carlito | ½ | 1⁄2 |
Cantarell | ½ | 1⁄2 |
FiraGO | ½ | 1⁄2 |
EB Garamond | ½ | 1⁄2 |
Gentium Book | ½ | 1⁄2 |
URW Gothic | ½ | 1⁄2 |
Lato | ½ | 1⁄2 |
Linux Libertine | ½ | 1⁄2 |
Nimbus Roman | ½ | 1⁄2 |
Nimbus Sans | ½ | 1⁄2 |
Noto Sans | ½ | 1⁄2 |
Noto Serif | ½ | 1⁄2 |
Open Sans | ½ | 1⁄2 |
Ubuntu | ½ | 1⁄2 |
Yrsa | ½ | 1⁄2 |
The most common superscript digits (1, 2, and 3) were in ISO-8859-1 and were therefore carried over into those positions in the Latin-1 range of Unicode. The rest were placed in a dedicated section of Unicode at U+2070 to U+209F. The two tables below show these characters. Each superscript or subscript character is preceded by a normal x to show the subscripting/superscripting. The table on the left contains the actual Unicode characters; the one on the right contains the equivalents using HTML markup for the subscript or superscript.
|
|
Unicode version 15.1 also includes subscript and superscript characters that are intended for semantic usage, in the following blocks: [1] [6]
Consolidated, the Unicode standard contains superscript and subscript versions of a subset of Latin, Greek and Cyrillic letters. Here they are arranged in alphabetical order for comparison (or for copy and paste convenience). Since these characters appear in different Unicode ranges, they may not appear to be the same size or position due to font substitution in the browser. Shaded cells mark small capitals that are not very distinct from minuscules, and Greek letters that are indistinguishable from Latin, and so would not be expected to be supported by Unicode.
Little punctuation is encoded. Parentheses and the exclamation mark are shown above. A question mark may be created with a superscript gelded question mark and a combining dot: ⟨ˀ̣⟩, although some fonts do not render it properly.
A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Superscript capital | ᴬ | ᴮ | ꟲ | ᴰ | ᴱ | ꟳ | ᴳ | ᴴ | ᴵ | ᴶ | ᴷ | ᴸ | ᴹ | ᴺ | ᴼ | ᴾ | ꟴ | ᴿ | [8] | ᵀ | ᵁ | ⱽ | ᵂ | |||
Superscript small cap | 𐞄 | 𐞒 | 𐞖 | ᶦ | ᶫ | ᶰ | 𐞪 | ᶸ | 𐞲 | |||||||||||||||||
Superscript minuscule | ᵃ | ᵇ | ᶜ | ᵈ | ᵉ | ᶠ | ᵍ | ʰ | ⁱ | ʲ | ᵏ | ˡ | ᵐ | ⁿ | ᵒ | ᵖ | 𐞥 | ʳ | ˢ | ᵗ | ᵘ | ᵛ | ʷ | ˣ | ʸ | ᶻ |
Overscript capital | ◌ᷛ | ◌ᷞ | ◌ᷟ | ◌ᷡ | ◌ᷢ | |||||||||||||||||||||
Overscript minuscule | ◌ͣ | ◌ᷨ | ◌ͨ | ◌ͩ | ◌ͤ | ◌ᷫ | ◌ᷚ | ◌ͪ | ◌ͥ | ◌ᷜ | ◌ᷝ | ◌ͫ | ◌ᷠ | ◌ͦ | ◌ᷮ | ◌ͬ | ◌ᷤ | ◌ͭ | ◌ͧ | ◌ͮ | ◌ᷱ | ◌ͯ | ◌ᷦ | |||
Subscript minuscule | ₐ | ₑ | ₕ | ᵢ | ⱼ | ₖ | ₗ | ₘ | ₙ | ₒ | ₚ | ᵣ | ₛ | ₜ | ᵤ | ᵥ | ₓ | |||||||||
Underscript minuscule | ◌᷊ | ◌ᪿ |
Α | Β | Γ | Δ | Ε | Ζ | Η | Θ | Ι | Κ | Λ | Μ | Ν | Ξ | Ο | Π | Ρ | Σ | Τ | Υ | Φ | Χ | Ψ | Ω | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Superscript minuscule | [upper-alpha 1] | ᵝ | ᵞ | ᵟ | ᵋ | ᶿ | ᶥ | [upper-alpha 1] | ᵠ | ᵡ | ||||||||||||||
Overscript minuscule | ◌ᷩ | |||||||||||||||||||||||
Subscript minuscule | ᵦ | ᵧ | ᵨ | ᵩ | ᵪ |
А | Ә | Б | В | Г | Ґ | Д | Е | Є | Ж | З | Ѕ | Ꚉ | И | І | Ї | Ј | К | Л | М | Н | О | Ө | П | Р | С | Ҫ | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Superscript | 𞀰 | 𞁋 | 𞀱 | 𞀲 | 𞀳 | 𞀴 | 𞀵 | 𞀶 | 𞀷 | 𞁊 | 𞀸 | 𞁌 | 𞁍 | 𞀹 | 𞀺 | 𞀻 | ᵸ | 𞀼 | 𞁎 | 𞀽 | 𞀾 | 𞀿 | 𞁫 | ||||
Overscript | ◌ⷶ | ◌ⷠ | ◌ⷡ | ◌ⷢ | ◌ⷣ | ◌ⷷ | ◌ꙴ | ◌ⷤ | ◌ⷥ | ◌ꙵ | ◌𞂏 | ◌ꙶ | ◌ⷦ | ◌ⷧ | ◌ⷨ | ◌ⷩ | ◌ⷪ | ◌ⷫ | ◌ⷬ | ◌ⷭ | |||||||
Subscript | 𞁑 | 𞁒 | 𞁓 | 𞁔 | 𞁧 | 𞁕 | 𞁖 | 𞁗 | 𞁘 | 𞁩 | 𞁙 | 𞁨 | 𞁚 | 𞁛 | 𞁜 | 𞁝 | 𞁞 | ||||||||||
Т | У | Ү | Ұ | Ꙋ | Ф | Х | Ѡ | Ц | Ч | Џ | Ш | Щ | Ъ | Ꙑ | Ы | Ь | Ѣ | Э | Ю | Ꙗ | Ѥ | Ѧ | Ѫ | Ѭ | Ѳ | Ӏ | |
Superscript | 𞁀 | 𞁁 | 𞁏 | 𞁭 | 𞁂 | 𞁃 | 𞁄 | 𞁅 | 𞁆 | ꚜ | 𞁬 | 𞁇 | ꚝ | 𞁈 | 𞁉 | 𞁐 | |||||||||||
Overscript | ◌ⷮ | ◌ꙷ | ◌ⷹ | ◌ꚞ | ◌ⷯ | ◌ꙻ | ◌ⷰ | ◌ⷱ | ◌ⷲ | ◌ⷳ | ◌ꙸ | ◌ꙹ | ◌ꙺ | ◌ⷺ | ◌ⷻ | ◌ⷼ | ◌ꚟ | ◌ⷽ | ◌ⷾ | ◌ⷿ | ◌ⷴ | ||||||
Subscript | 𞁟 | 𞁠 | 𞁡 | 𞁢 | 𞁣 | 𞁪 | 𞁤 | 𞁥 | 𞁦 |
Many of the Cyrillic characters were added to the Cyrillic Extended-D block, which was added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023.
See also small caps in Unicode.
The Latin Extended-F block was created for the remaining superscript IPA letters. They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023.
The Unicode characters for superscript (modifier) IPA and extIPA consonant letters are as follows. Characters for sounds with secondary articulation are set off in parentheses and placed below the base letters:
Bilabial | Labiodental | Dental | Alveolar | Postalveolar | Retroflex | Palatal | Velar | Uvular | Pharyngeal | Glottal | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Nasal | m ᵐ 1D50 | ɱ ᶬ 1DAC | n ⁿ 207F | ɳ ᶯ 1DAF | ɲ ᶮ 1DAE | ŋ ᵑ 1D51 | ɴ ᶰ 1DB0 | |||||||||||||||
Plosive | p ᵖ 1D56 | b ᵇ 1D47 | t ᵗ 1D57 | d ᵈ 1D48 | ʈ 𐞯 107AF | ɖ 𐞋 1078B | c ᶜ 1D9C | ɟ ᶡ 1DA1 | k ᵏ 1D4F | ɡ ᶢ/g ᵍ 1DA2/1D4D | q 𐞥 107A5 | ɢ 𐞒 10792 | ʡ 𐞳 107B3 | ʔ ˀ 02C0 | ||||||||
Affricate | ʦ 𐞬 107AC | ʣ 𐞇 10787 | ʧ 𐞮 107AE (ʨ𐞫) 107AB | ʤ 𐞊 1078A (ʥ𐞉) 10789 | ꭧ 𐞭 107AD | ꭦ 𐞈 10788 | ||||||||||||||||
Fricative | ɸ ᶲ 1DB2 | β ᵝ 1D5D | f ᶠ 1DA0 | v ᵛ 1D5B | θ ᶿ 1DBF | ð ᶞ 1D9E | s ˢ 02E2 | z ᶻ 1DBB | ʃ ᶴ 1DB4 (ɕᶝ) 1D9D | ʒ ᶾ 1DBE (ʑᶽ) 1DBD | ʂ ᶳ 1DB3 | ʐ ᶼ 1DBC | ç ᶜ̧ 1D9C + 0327 [lower-alpha 4] | ʝ ᶨ 1DA8 | x ˣ 02E3 (ɧ 𐞗) 10797 | ɣ ˠ 02E0 | χ ᵡ 1D61 | ʁ ʶ 02B6 | ħ 𐞕 10795 (ʩ 𐞐) 10790 | ʕ ˤ 02E4 [lower-alpha 5] | h ʰ 02B0 | ɦ ʱ 02B1 |
Approximant | ʋ ᶹ 1DB9 | ɹ ʴ 02B4 | ɻ ʵ 02B5 | j ʲ 02B2 (ɥ ᶣ) 1DA3 | (ʍ ꭩ) AB69 | ɰ ᶭ 1DAD (w ʷ) 02B7 | ||||||||||||||||
Tap/flap | ⱱ 𐞰 107B0 | ɾ 𐞩 107A9 | ɽ 𐞨 107A8 | |||||||||||||||||||
Trill | ʙ 𐞄 10784 | r ʳ 02B3 | ʀ 𐞪 107AA | ʜ 𐞖 10796 | ʢ 𐞴 107B4 | |||||||||||||||||
Lateral fricative | ɬ 𐞛 1079B (ʪ 𐞙) 10799 | ɮ 𐞞 1079E (ʫ 𐞚) 1079A | ꞎ 𐞝 1079D | 𝼅 𐞟 1079F | 𝼆 𐞡 107A1 | 𝼄 𐞜 1079C | ||||||||||||||||
Lateral approximant | l ˡ 02E1 (ɫ ꭞ) AB5E [lower-alpha 6] | ɭ ᶩ 1DA9 | ʎ 𐞠 107A0 | ʟ ᶫ 1DAB | ||||||||||||||||||
Lateral tap/flap | ɺ 𐞦 107A6 | 𝼈 𐞧 107A7 | ||||||||||||||||||||
Implosive | ƥ – (TBA) | ɓ 𐞅 10785 | ƭ – (TBA) | ɗ 𐞌 1078C | 𝼉 – (TBA) | ᶑ 𐞍 1078D | ƈ – (TBA) | ʄ 𐞘 10798 | ƙ – (TBA) | ɠ 𐞓 10793 | ʠ – (TBA) | ʛ 𐞔 10794 | ||||||||||
Click release [lower-alpha 7] | ʘ 𐞵 107B5 | ǀ 𐞶 107B6 | ʇ 107BB | ǃ ꜝ A71D | ʗ 107BD | 𝼊 𐞹 107B9 | ψ – (TBA) | ǂ 𐞸 107B8 | 𝼋 107BF | (ʞ ) 107BE | ||||||||||||
Lateral click release | ǁ 𐞷 107B7 | ʖ 107BC | ||||||||||||||||||||
Percussive | ¡ ꜞ A71E [lower-alpha 8] |
The spacing diacritic for ejective consonants, U+2BC, works with superscript letters despite not being superscript itself: ⟨ᵖʼ ᵗʼ ᶜʼ ᵏˣʼ⟩. If a distinction needs to be made, the combining apostrophe U+315 may be used: ⟨ᵖ̕ ᵗ̕ ᶜ̕ ᵏˣ̕⟩. The spacing diacritic should be used for a baseline letter with a superscript release, such as [tˢʼ] or [kˣʼ], where the scope of the apostrophe includes the non-superscript letter, but the combining apostrophe U+315 might be used to indicate a weakly articulated ejective consonant like [ᵗ̕] or [ᵏ̕], where the whole consonant is written as a superscript, or together with U+2BC when separate apostrophes have scope over the base and modifier letters, as in ⟨pʼᵏˣ̕⟩. [9]
Spacing diacritics, as in ⟨tʲ⟩, cannot be secondarily superscripted in plain text: ⟨ᵗʲ⟩. (In this instance, the old IPA letter for [tʲ], ⟨ƫ⟩, has a superscript variant in Unicode, U+1DB5 ⟨ᶵ⟩, as does the lateral, U+1DDA ⟨ᶪ⟩, but that is not generally the case.)
Among older letters, ⟨ꜧ⟩ (U+A727), a graphic variant of ⟨ɮ⟩, is supported at ⟨ꭜ⟩ (U+AB5C).
The Unicode characters for superscript (modifier) IPA vowel letters, plus an extended letter found in English dictionaries, are as follows. Most recently retired alternative letters are also supported; they are set off in parentheses and placed below the standard IPA letters:
Front | Central | Back | ||||
---|---|---|---|---|---|---|
Close | i ⁱ 2071 | y ʸ 02B8 | ɨ ᶤ 1DA4 | ʉ ᶶ 1DB6 | ɯ ᵚ 1D5A | u ᵘ 1D58 |
Near-close | ɪ ᶦ 1DA6 (ɩ ᶥ) 1DA5 | ʏ 𐞲 107B2 | ᵻ ᶧ 1DA7 | ʊ ᶷ 1DB7 (ɷ 𐞤) 107A4 | ||
Close-mid | e ᵉ 1D49 | ø 𐞢 107A2 | ɘ 𐞎 1078E | ɵ ᶱ 1DB1 | ɤ 𐞑 10791 | o ᵒ 1D52 |
Mid | ə ᵊ 1D4A | |||||
Open-mid | ɛ ᵋ 1D4B | œ ꟹ A7F9 | ɜ ᶟ 1D9F (ᴈ ᵌ) 1D4C | ɞ 𐞏 1078F | ʌ ᶺ 1DBA | ɔ ᵓ 1D53 |
Near-open | æ 𐞃 10783 | ɶ 𐞣 107A3 | ɐ ᵄ 1D44 | ɑ ᵅ 1D45 | ɒ ᶛ 1D9B | |
Open | a ᵃ 1D43 |
The precomposed Unicode rhotic vowel letters ⟨ɚ ɝ⟩ are not directly supported. The rhotic diacritic U+02DE ◌˞ should be used instead: ⟨ᵊ˞ ᶟ˞⟩. [10]
⟨ɜ⟩ and ⟨ᶟ⟩ are reversed ɛ. The older IPA turned ɛ, ⟨ᴈ⟩, is also supported, at U+1D4C ⟨ᵌ⟩. However, the briefly resurrected vowel letter ⟨ʚ⟩ (U+029A) is not supported, only its reversed replacement ⟨ɞ⟩ is.
Among older vowels, ⟨ᴜ⟩ (U+1D1C), a graphic variant of ⟨ʊ⟩, is supported at ⟨ᶸ⟩ (U+1DB8).
The two length marks are also supported:
Long | Half-long |
---|---|
ː 𐞁 10781 | ˑ 𐞂 10782 |
Superscript wildcards (full caps) are partially supported: e.g. ᴺC (prenasalized consonant), ꟲN (prestopped nasal), Pꟳ (fricative release), NᴾF (epenthetic plosive), CVNᵀ (tone-bearing syllable), Cᴸ (liquid or lateral release), Cᴿ (rhotic or resonant release), Vᴳ (off-glide/diphthong), Cⱽ (fleeting vowel). Superscript S for sibilant release has preliminary approval for Unicode 17 (as ); superscript Ʞ for fleeting/epenthetic click does not. Other basic Latin superscript wildcards for tone and weak indeterminate sounds, as described in the article on the International Phonetic Alphabet, are mostly supported. (See table in previous section.)
In addition, a very few IPA letters beyond the basic Latin alphabet have combining forms or are supported as subscripts:
ɑ | æ | ç | ð | ə | ʃ | ʍ | ʔ | ʼ | |
---|---|---|---|---|---|---|---|---|---|
Overscript | ◌ᷧ | ◌ᷔ | ◌ᷗ | ◌ᷙ | ◌ᷪ | ◌ᷯ | ◌̉ [lower-alpha 9] | ◌̓ | |
Subscript | ₔ | ||||||||
Underscript | ◌ᫀ | ◌̦ |
Primarily for compatibility with earlier character sets, Unicode contains a number of characters that compose super- and subscripts with other symbols. [1] In most fonts these render much better than attempts to construct these symbols from the above characters or by using markup.
A diacritic is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek διακριτικός, from διακρίνω. The word diacritic is a noun, though it is sometimes used in an attributive sense, whereas diacritical is only an adjective. Some diacritics, such as the acute ⟨á⟩, grave ⟨à⟩, and circumflex ⟨â⟩, are often called accents. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters.
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation of speech sounds in written form. The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators, and translators.
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.
The International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme that allows the lossless romanisation of Indic scripts as employed by Sanskrit and related Indic languages. It is based on a scheme that emerged during the 19th century from suggestions by Charles Trevelyan, William Jones, Monier Monier-Williams and other scholars, and formalised by the Transliteration Committee of the Geneva Oriental Congress, in September 1894. IAST makes it possible for the reader to read the Indic text unambiguously, exactly as if it were in the original Indic script. It is this faithfulness to the original scripts that accounts for its continuing popularity amongst scholars.
Gentium is a Unicode serif typeface designed by Victor Gaultney. Gentium fonts are free and open source software, and are released under the SIL Open Font License (OFL), which permits modification and redistribution. Gentium has wide support for languages using the Latin, Greek, and Cyrillic alphabets, and the International Phonetic Alphabet (IPA). Gentium Plus variants released since November 2010 now include over 5,500 glyphs and advanced typographic features through OpenType and formerly Graphite.
Bitstream Cyberbit is a commercial serif Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was one of the first widely available fonts to support a large portion of the Unicode repertoire.
Apple Symbols is a font introduced in Mac OS X 10.3 “Panther”. This is a TrueType font intended to provide coverage for characters defined as symbols in the Unicode Standard. It continues to ship with Mac OS X as part of the default installation. Prior to Mac OS X 10.5, its path was /Library/Fonts/Apple Symbols.ttf
. From Mac OS X 10.5 onward, it is to be found at /System/Library/Fonts/Apple Symbols.ttf
, meaning it is now considered an essential part of the system software, not to be deleted by users.
The Yi scripts are two scripts used to write the Yi languages; Classical Yi, and the later Yi syllabary. The script is historically known in Chinese as Cuan Wen or Wei Shu and various other names (夷字、倮語、倮倮文、畢摩文), among them "tadpole writing" (蝌蚪文).
Diacritical marks of two dots¨, placed side-by-side over or under a letter, are used in a number of languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons.
Monospace is a monospaced Unicode font, developed by George Williams. It is based on the typeface Courier. This font contains 2860 glyphs. It includes characters in the following unicode ranges: Basic Latin, Latin-1 Supplement, Latin Extended-A, Latin Extended-B, IPA Extensions, Spacing Modifier Letters, Combining Diacritical Marks, Greek, Cyrillic, Hebrew, Latin Extended Additional, Greek Extended, General Punctuation, Superscripts and Subscripts, Currency Symbols, Combining Diacritical Marks for Symbols, Letterlike Symbols, Number Forms, Arrows, Mathematical Operators, Miscellaneous Technical, Control Pictures, Enclosed Alphanumerics, Box Drawing, Block Elements, Geometric Shapes, Miscellaneous Symbols, Alphabetic Presentation Forms, Halfwidth and Fullwidth Forms.
L, or l, is the twelfth letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is el, plural els.
Unicode supports several phonetic scripts and notations through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.
Dz is a digraph of the Latin script, consisting of the consonants D and Z. It may represent, , or, depending on the language.
Microsoft Sans Serif is a sans-serif typeface introduced with early Microsoft Windows versions. It is the successor of MS Sans Serif, formerly Helv, a proportional bitmap font introduced in Windows 1.0. Both typefaces are very similar in design to Arial and Helvetica. The typeface was designed to match the MS Sans bitmap included in the early releases of Microsoft Windows.
Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character sets, which often included similar or identical characters.
In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:
A character that would not have been encoded except for compatibility and round-trip convertibility with other standards
A subscript or superscript is a character that is set slightly below or above the normal line of type, respectively. It is usually smaller than the rest of the text. Subscripts appear at or below the baseline, while superscripts are above. Subscripts and superscripts are perhaps most often used in formulas, mathematical expressions, and specifications of chemical compounds and isotopes, but have many other uses as well.
In typesetting, the hook or tail is a diacritic mark attached to letters in many alphabets. In shape it looks like a hook and it can be attached below as a descender, on top as an ascender and sometimes to the side. The orientation of the hook can change its meaning: when it is below and curls to the left it can be interpreted as a palatal hook, and when it curls to the right is called hook tail or tail and can be interpreted as a retroflex hook. It should not be mistaken with the hook above, a diacritical mark used in Vietnamese, or the rhotic hook, used in the International Phonetic Alphabet.
PragmataPro is a monospaced font family designed for programming, created by Fabrizio Schiavi. It is a narrow programming font designed for legibility. The font implements Unicode characters, including (polytonic) Greek, Cyrillic, Arabic, Hebrew and the APL codepoints. The font specifically implements ligatures for programming, such as multiple-character operators. The characters are hinted by hand.
Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023. Some computers have 𐞃, 𐞎 and 𐞥 supported on the font Calibri.