Latin Extended-B | |
---|---|
Range | U+0180..U+024F (208 code points) |
Plane | BMP |
Scripts | Latin |
Major alphabets | Africa alphabet Americanist Azerbaijani Khoisan Pan-Nigerian Pinyin Romanian |
Assigned | 208 code points |
Unused | 0 reserved code points |
Unicode version history | |
1.0.0 (1991) | 113 (+113) |
1.1 (1993) | 148 (+35) |
3.0 (1999) | 178 (+30) |
3.2 (2002) | 179 (+1) |
4.0 (2003) | 183 (+4) |
4.1 (2005) | 194 (+11) |
5.0 (2006) | 208 (+14) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: Block range was extended by 80 code points in Unicode 1.1 during the unification with ISO 10646. [1] [2] |
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin. [3]
Code | Glyph | Decimal | Description |
---|---|---|---|
Non-European and historic Latin | |||
U+0180 | ƀ | Latin Small Letter B with Stroke | |
U+0181 | Ɓ | Latin Capital Letter B with Hook | |
U+0182 | Ƃ | Latin Capital Letter B with Top Bar | |
U+0183 | ƃ | Latin Small Letter B with Top Bar | |
U+0184 | Ƅ | Latin Capital Letter Tone Six | |
U+0185 | ƅ | Latin Small Letter Tone Six | |
U+0186 | Ɔ | Latin Capital Letter Open O | |
U+0187 | Ƈ | Latin Capital Letter C with Hook | |
U+0188 | ƈ | Latin Small Letter C with Hook | |
U+0189 | Ɖ | Latin Capital Letter African D | |
U+018A | Ɗ | Latin Capital Letter D with Hook | |
U+018B | Ƌ | Latin Capital Letter D with Top Bar | |
U+018C | ƌ | Latin Small Letter D with Top Bar | |
U+018D | ƍ | Latin Small Letter Turned Delta | |
U+018E | Ǝ | Latin Capital Letter Reversed E | |
U+018F | Ə | Latin Capital Letter Schwa | |
U+0190 | Ɛ | Latin Capital Letter Open E (= Latin Capital Letter Epsilon) | |
U+0191 | Ƒ | Latin Capital Letter F with Hook | |
U+0192 | ƒ | Latin Small Letter F with Hook | |
U+0193 | Ɠ | Latin Capital Letter G with Hook | |
U+0194 | Ɣ | Latin Capital Letter Gamma | |
U+0195 | ƕ | Latin Small Letter HV | |
U+0196 | Ɩ | Latin Capital Letter Iota | |
U+0197 | Ɨ | Latin Capital Letter I with Stroke | |
U+0198 | Ƙ | Latin Capital Letter K with Hook | |
U+0199 | ƙ | Latin Small Letter K with Hook | |
U+019A | ƚ | Latin Small Letter L with Bar | |
U+019B | ƛ | Latin Small Letter Lambda with Stroke | |
U+019C | Ɯ | Latin Capital Letter Turned M | |
U+019D | Ɲ | Latin Capital Letter N with Left Hook | |
U+019E | ƞ | Latin Small Letter N with Long Right Leg | |
U+019F | Ɵ | Latin Capital Letter O with Middle Tilde | |
U+01A0 | Ơ | Latin Capital Letter O with Horn | |
U+01A1 | ơ | Latin Small Letter O with Horn | |
U+01A2 | Ƣ | Latin Capital Letter OI (= Latin Capital Letter Gha) | |
U+01A3 | ƣ | Latin Small Letter OI (= Latin Small Letter Gha) | |
U+01A4 | Ƥ | Latin Capital Letter P with Hook | |
U+01A5 | ƥ | Latin Small Letter P with Hook | |
U+01A6 | Ʀ | Latin Letter YR | |
U+01A7 | Ƨ | Latin Capital Letter Tone Two | |
U+01A8 | ƨ | Latin Small Letter Tone Two | |
U+01A9 | Ʃ | Latin Capital Letter Esh | |
U+01AA | ƪ | Latin Letter Reversed Esh Loop | |
U+01AB | ƫ | Latin Small Letter T with Palatal Hook | |
U+01AC | Ƭ | Latin Capital Letter T with Hook | |
U+01AD | ƭ | Latin Small Letter T with Hook | |
U+01AE | Ʈ | Latin Capital Letter T with Retroflex Hook | |
U+01AF | Ư | Latin Capital Letter U with Horn | |
U+01B0 | ư | Latin Small Letter U with Horn | |
U+01B1 | Ʊ | Latin Capital Letter Upsilon | |
U+01B2 | Ʋ | Latin Capital Letter V with Hook | |
U+01B3 | Ƴ | Latin Capital Letter Y with Hook | |
U+01B4 | ƴ | Latin Small Letter Y with Hook | |
U+01B5 | Ƶ | Latin Capital Letter Z with Stroke | |
U+01B6 | ƶ | Latin Small Letter Z with Stroke | |
U+01B7 | Ʒ | Latin Capital Letter Ezh | |
U+01B8 | Ƹ | Latin Capital Letter Ezh Reversed | |
U+01B9 | ƹ | Latin Small Letter Ezh Reversed | |
U+01BA | ƺ | Latin Small Letter Ezh with Tail | |
U+01BB | ƻ | Latin Letter Two with Stroke | |
U+01BC | Ƽ | Latin Capital Letter Tone Five | |
U+01BD | ƽ | Latin Small Letter Tone Five | |
U+01BE | ƾ | Latin Letter Inverted Glottal Stop with Stroke | |
U+01BF | ƿ | Latin Letter Wynn | |
African letters for clicks | |||
U+01C0 | ǀ | Latin Letter Dental Click | |
U+01C1 | ǁ | Latin Letter Lateral Click | |
U+01C2 | ǂ | Latin Letter Alveolar Click | |
U+01C3 | ǃ | Latin Letter Retroflex Click | |
Croatian digraphs matching Serbian Cyrillic letters | |||
U+01C4 | DŽ | Latin Capital Letter DZ with Caron | |
U+01C5 | Dž | Latin Capital Letter D with Small Letter Z with Caron | |
U+01C6 | dž | Latin Small Letter DZ with Caron | |
U+01C7 | LJ | Latin Capital Letter LJ | |
U+01C8 | Lj | Latin Capital Letter L with Small Letter J | |
U+01C9 | lj | Latin Small Letter LJ | |
U+01CA | NJ | Latin Capital Letter NJ | |
U+01CB | Nj | Latin Capital Letter N with Small Letter J | |
U+01CC | nj | Latin Small Letter NJ | |
Pinyin diacritic-vowel combinations | |||
U+01CD | Ǎ | Latin Capital Letter A with Caron | |
U+01CE | ǎ | Latin Small Letter A with Caron | |
U+01CF | Ǐ | Latin Capital Letter I with Caron | |
U+01D0 | ǐ | Latin Small Letter I with Caron | |
U+01D1 | Ǒ | Latin Capital Letter O with Caron | |
U+01D2 | ǒ | Latin Small Letter O with Caron | |
U+01D3 | Ǔ | Latin Capital Letter U with Caron | |
U+01D4 | ǔ | Latin Small Letter U with Caron | |
U+01D5 | Ǖ | Latin Capital Letter U with Diaeresis and Macron | |
U+01D6 | ǖ | Latin Small Letter U with Diaeresis and Macron | |
U+01D7 | Ǘ | Latin Capital Letter U with Diaeresis and Acute | |
U+01D8 | ǘ | Latin Small Letter U with Diaeresis and Acute | |
U+01D9 | Ǚ | Latin Capital Letter U with Diaeresis and Caron | |
U+01DA | ǚ | Latin Small Letter U with Diaeresis and Caron | |
U+01DB | Ǜ | Latin Capital Letter U with Diaeresis and Grave | |
U+01DC | ǜ | Latin Small Letter U with Diaeresis and Grave | |
Phonetic and historic letters | |||
U+01DD | ǝ | Latin Small Letter Turned E | |
U+01DE | Ǟ | Latin Capital Letter A with Diaeresis and Macron | |
U+01DF | ǟ | Latin Small Letter A with Diaeresis and Macron | |
U+01E0 | Ǡ | Latin Capital Letter A with Dot Above and Macron | |
U+01E1 | ǡ | Latin Small Letter A with Dot Above and Macron | |
U+01E2 | Ǣ | Latin Capital Letter AE with Macron | |
U+01E3 | ǣ | Latin Small Letter AE with Macron | |
U+01E4 | Ǥ | Latin Capital Letter G with Stroke | |
U+01E5 | ǥ | Latin Small Letter G with Stroke | |
U+01E6 | Ǧ | Latin Capital Letter G with Caron | |
U+01E7 | ǧ | Latin Small Letter G with Caron | |
U+01E8 | Ǩ | Latin Capital Letter K with Caron | |
U+01E9 | ǩ | Latin Small Letter K with Caron | |
U+01EA | Ǫ | Latin Capital Letter O with Ogonek | |
U+01EB | ǫ | Latin Small Letter O with Ogonek | |
U+01EC | Ǭ | Latin Capital Letter O with Ogonek and Macron (=Latin Capital Letter O with Macron and Ogonek) | |
U+01ED | ǭ | Latin Small Letter O with Ogonek and Macron (=Latin Small Letter O with Macron and Ogonek) | |
U+01EE | Ǯ | Latin Capital Letter Ezh with Caron | |
U+01EF | ǯ | Latin Small Letter Ezh with Caron | |
U+01F0 | ǰ | Latin Small Letter J with Caron | |
U+01F1 | DZ | Latin Capital Letter DZ | |
U+01F2 | Dz | Latin Capital Letter D with Small Letter Z | |
U+01F3 | dz | Latin Small Letter DZ | |
U+01F4 | Ǵ | Latin Capital Letter G with Acute | |
U+01F5 | ǵ | Latin Small Letter G with Acute | |
U+01F6 | Ƕ | Latin Capital Letter Hwair | |
U+01F7 | Ƿ | Latin Capital Letter Wynn | |
U+01F8 | Ǹ | Latin Capital Letter N with Grave | |
U+01F9 | ǹ | Latin Small Letter N with Grave | |
U+01FA | Ǻ | Latin Capital Letter A with Ring Above and Acute | |
U+01FB | ǻ | Latin Small Letter A with Ring Above and Acute | |
U+01FC | Ǽ | Latin Capital Letter AE with Acute | |
U+01FD | ǽ | Latin Small Letter AE with Acute | |
U+01FE | Ǿ | Latin Capital Letter O with Stroke and Acute | |
U+01FF | ǿ | Latin Small Letter O with Stroke and Acute | |
Additions for Slovenian and Croatian | |||
U+0200 | Ȁ | Latin Capital Letter A with Double Grave | |
U+0201 | ȁ | Latin Small Letter A with Double Grave | |
U+0202 | Ȃ | Latin Capital Letter A with Inverted Breve | |
U+0203 | ȃ | Latin Small Letter A with Inverted Breve | |
U+0204 | Ȅ | Latin Capital Letter E with Double Grave | |
U+0205 | ȅ | Latin Small Letter E with Double Grave | |
U+0206 | Ȇ | Latin Capital Letter E with Inverted Breve | |
U+0207 | ȇ | Latin Small Letter E with Inverted Breve | |
U+0208 | Ȉ | Latin Capital Letter I with Double Grave | |
U+0209 | ȉ | Latin Small Letter I with Double Grave | |
U+020A | Ȋ | Latin Capital Letter I with Inverted Breve | |
U+020B | ȋ | Latin Small Letter I with Inverted Breve | |
U+020C | Ȍ | Latin Capital Letter O with Double Grave | |
U+020D | ȍ | Latin Small Letter O with Double Grave | |
U+020E | Ȏ | Latin Capital Letter O with Inverted Breve | |
U+020F | ȏ | Latin Small Letter O with Inverted Breve | |
U+0210 | Ȑ | Latin Capital Letter R with Double Grave | |
U+0211 | ȑ | Latin Small Letter R with Double Grave | |
U+0212 | Ȓ | Latin Capital Letter R with Inverted Breve | |
U+0213 | ȓ | Latin Small Letter R with Inverted Breve | |
U+0214 | Ȕ | Latin Capital Letter U with Double Grave | |
U+0215 | ȕ | Latin Small Letter U with Double Grave | |
U+0216 | Ȗ | Latin Capital Letter U with Inverted Breve | |
U+0217 | ȗ | Latin Small Letter U with Inverted Breve | |
Additions for Romanian | |||
U+0218 | Ș | Latin Capital Letter S with Comma Below | |
U+0219 | ș | Latin Small Letter S with Comma Below | |
U+021A | Ț | Latin Capital Letter T with Comma Below | |
U+021B | ț | Latin Small Letter T with Comma Below | |
Miscellaneous additions | |||
U+021C | Ȝ | Latin Capital Letter Yogh | |
U+021D | ȝ | Latin Small Letter Yogh | |
U+021E | Ȟ | Latin Capital Letter H with Caron | |
U+021F | ȟ | Latin Small Letter H with Caron | |
U+0220 | Ƞ | Latin Capital Letter N with Long Right Leg | |
U+0221 | ȡ | Latin Small Letter D with Curl | |
U+0222 | Ȣ | Latin Capital Letter OU | |
U+0223 | ȣ | Latin Small Letter OU | |
U+0224 | Ȥ | Latin Capital Letter Z with Hook | |
U+0225 | ȥ | Latin Small Letter Z with Hook | |
U+0226 | Ȧ | Latin Capital Letter A with Dot Above | |
U+0227 | ȧ | Latin Small Letter A with Dot Above | |
U+0228 | Ȩ | Latin Capital Letter E with Cedilla | |
U+0229 | ȩ | Latin Small Letter E with Cedilla | |
Additions for Livonian | |||
U+022A | Ȫ | Latin Capital Letter O with Diaeresis and Macron | |
U+022B | ȫ | Latin Small Letter O with Diaeresis and Macron | |
U+022C | Ȭ | Latin Capital Letter O with Tilde and Macron | |
U+022D | ȭ | Latin Small Letter O with Tilde and Macron | |
U+022E | Ȯ | Latin Capital Letter O with Dot Above | |
U+022F | ȯ | Latin Small Letter O with Dot Above | |
U+0230 | Ȱ | Latin Capital Letter O with Dot Above and Macron | |
U+0231 | ȱ | Latin Small Letter O with Dot Above and Macron | |
U+0232 | Ȳ | Latin Capital Letter Y with Macron | |
U+0233 | ȳ | Latin Small Letter Y with Macron | |
Additions for Sinology | |||
U+0234 | ȴ | Latin Small Letter L with Curl | |
U+0235 | ȵ | Latin Small Letter N with Curl | |
U+0236 | ȶ | Latin Small Letter T with Curl | |
Miscellaneous addition | |||
U+0237 | ȷ | Latin Small Letter Dotless J | |
Additions for Africanist linguistics | |||
U+0238 | ȸ | Latin Small Letter DB Digraph | |
U+0239 | ȹ | Latin Small Letter QP Digraph | |
Additions for Sencoten | |||
U+023A | Ⱥ | Latin Capital Letter A with Stroke | |
U+023B | Ȼ | Latin Capital Letter C with Stroke | |
U+023C | ȼ | Latin Small Letter C with Stroke | |
U+023D | Ƚ | Latin Capital Letter L with Bar | |
U+023E | Ⱦ | Latin Capital Letter T with Diagonal Stroke | |
Additions for Africanist linguistics | |||
U+023F | ȿ | Latin Small Letter S with Swash Tail | |
U+0240 | ɀ | Latin Small Letter Z with Swash Tail | |
Miscellaneous additions | |||
U+0241 | Ɂ | Latin Capital Letter Glottal Stop | |
U+0242 | ɂ | Latin Small Letter Glottal Stop | |
U+0243 | Ƀ | Latin Capital Letter B with Stroke | |
U+0244 | Ʉ | Latin Capital Letter U Bar | |
U+0245 | Ʌ | Latin Capital Letter Turned V | |
U+0246 | Ɇ | Latin Capital Letter E with Stroke | |
U+0247 | ɇ | Latin Small Letter E with Stroke | |
U+0248 | Ɉ | Latin Capital Letter J with Stroke | |
U+0249 | ɉ | Latin Small Letter J with Stroke | |
U+024A | Ɋ | Latin Capital Letter Q with Hook Tail | |
U+024B | ɋ | Latin Small Letter Q with Hook Tail | |
U+024C | Ɍ | Latin Capital Letter R with Stroke | |
U+024D | ɍ | Latin Small Letter R with Stroke | |
U+024E | Ɏ | Latin Capital Letter Y with Stroke | |
U+024F | ɏ | Latin Small Letter Y with Stroke |
The Latin Extended-B block contains ten subheadings for groups of characters: Non-European and historic Latin, African letters for clicks, Croatian digraphs matching Serbian Cyrillic letters, Pinyin diacritic-vowel combinations, Phonetic and historic letters, Additions for Slovenian and Croatian, Additions for Romanian, Miscellaneous additions, Additions for Livonian, and Additions for Sinology. The Non-European and historic, African clicks, Croatian digraphs, Pinyin, and the first part of the Phonetic and historic letters were present in Unicode 1.0; additional Phonetic and historic letters were added for version 3.0; and other Phonetic and historic, as well as the rest of the sub-blocks were the characters added for version 1.1.
The Non-European and historic Latin subheading contains the first 64 characters of the block, and includes various variant letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. It does not contain any standard letters with diacritics.
The four African letters for clicks are used in Khoisan orthography.
The Croatian digraphs matching Serbian Cyrillic letters are three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts, Serbo-Croatian being a digraphic language.
The 16 Pinyin diacritic-vowel combinations are used to represent the standard Mandarin Chinese vowel sounds with tone marks.
The 35 Phonetic and historic letters are largely various standard and variant Latin letters with diacritic marks.
The 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve.
The Additions for Romanian are 4 characters that were erroneously unified as having a cedilla, when they have a comma below. The conflation of S and T with cedilla vs. comma below continues to plague Romanian language implementation up to the present. [4]
The Miscellaneous additions subheading contains 39 characters of various description and origin.
The Additions for Livonian are 10 letters with diacritics for writing the Livonian language.
The Additions for Sinology are three lowercase letters with curls used in the study of classical Chinese language.
The Additions for Africanist linguistics are two lowercase letter with swash tails used in Africanist linguistics.
The Additions for Sencoten are 5 letters with strokes for writing Saanich.
This section needs to be updated.(July 2018) |
The following table shows the number of letters in the Latin Extended-B block.
Type of subheading | Number of symbols | Range of characters |
---|---|---|
Non-European and historic Latin | 64 various letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. | U+0180 to U+01BF |
African letters for clicks | Four African letters for clicks are used in Khoisan orthography. | U+01C0 to U+01C3 |
Croatian digraphs matching Serbian Cyrillic letters | Three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts. | U+01C4 to U+01CC |
Pinyin diacritic-vowel combinations | Sixteen diacritic-vowel combinations which are used to represent the standard Mandarin Chinese vowel sounds with tone marks. | U+01CD to U+01DC |
Phonetic and historic letters | 35 Phonetic and historic letters which are largely various standard and variant Latin letters with diacritic marks. | U+01DD to U+01FF |
Additions for Slovenian and Croatian | 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve. | U+0200 to U+0217 |
Additions for Romanian | 4 characters that were erroneously unified as having a cedilla, when they have a comma below. | U+0218 to U+021B |
Miscellaneous additions | 14 characters of various description and origin. | U+021C to U+0229 |
Additions for Livonian | 10 letters with diacritics for writing the Livonian language. | U+022A to U+0233 |
Additions for Sinology | Three lowercase letters with curls used in the study of classical Chinese language. | U+0234 to U+0236 |
Latin Extended-B [1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+018x | ƀ | Ɓ | Ƃ | ƃ | Ƅ | ƅ | Ɔ | Ƈ | ƈ | Ɖ | Ɗ | Ƌ | ƌ | ƍ | Ǝ | Ə |
U+019x | Ɛ | Ƒ | ƒ | Ɠ | Ɣ | ƕ | Ɩ | Ɨ | Ƙ | ƙ | ƚ | ƛ | Ɯ | Ɲ | ƞ | Ɵ |
U+01Ax | Ơ | ơ | Ƣ | ƣ | Ƥ | ƥ | Ʀ | Ƨ | ƨ | Ʃ | ƪ | ƫ | Ƭ | ƭ | Ʈ | Ư |
U+01Bx | ư | Ʊ | Ʋ | Ƴ | ƴ | Ƶ | ƶ | Ʒ | Ƹ | ƹ | ƺ | ƻ | Ƽ | ƽ | ƾ | ƿ |
U+01Cx | ǀ | ǁ | ǂ | ǃ | DŽ | Dž | dž | LJ | Lj | lj | NJ | Nj | nj | Ǎ | ǎ | Ǐ |
U+01Dx | ǐ | Ǒ | ǒ | Ǔ | ǔ | Ǖ | ǖ | Ǘ | ǘ | Ǚ | ǚ | Ǜ | ǜ | ǝ | Ǟ | ǟ |
U+01Ex | Ǡ | ǡ | Ǣ | ǣ | Ǥ | ǥ | Ǧ | ǧ | Ǩ | ǩ | Ǫ | ǫ | Ǭ | ǭ | Ǯ | ǯ |
U+01Fx | ǰ | DZ | Dz | dz | Ǵ | ǵ | Ƕ | Ƿ | Ǹ | ǹ | Ǻ | ǻ | Ǽ | ǽ | Ǿ | ǿ |
U+020x | Ȁ | ȁ | Ȃ | ȃ | Ȅ | ȅ | Ȇ | ȇ | Ȉ | ȉ | Ȋ | ȋ | Ȍ | ȍ | Ȏ | ȏ |
U+021x | Ȑ | ȑ | Ȓ | ȓ | Ȕ | ȕ | Ȗ | ȗ | Ș | ș | Ț | ț | Ȝ | ȝ | Ȟ | ȟ |
U+022x | Ƞ | ȡ | Ȣ | ȣ | Ȥ | ȥ | Ȧ | ȧ | Ȩ | ȩ | Ȫ | ȫ | Ȭ | ȭ | Ȯ | ȯ |
U+023x | Ȱ | ȱ | Ȳ | ȳ | ȴ | ȵ | ȶ | ȷ | ȸ | ȹ | Ⱥ | Ȼ | ȼ | Ƚ | Ⱦ | ȿ |
U+024x | ɀ | Ɂ | ɂ | Ƀ | Ʉ | Ʌ | Ɇ | ɇ | Ɉ | ɉ | Ɋ | ɋ | Ɍ | ɍ | Ɏ | ɏ |
Notes
|
The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-B block:
Version | Final code points [lower-alpha 1] | Count | UTC ID | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|---|
1.0.0 | U+0180..01EF | 112 | (to be determined) | |||
X3L2/94-077 | N994 | Davis, Mark (1994-03-03), ISO/IEC 10646-1 - Proposed Draft Corrigendum 1 | ||||
X3L2/94-098 | N1033 (pdf, doc) | Umamaheswaran, V. S.; Ksar, Mike (1994-06-01), "8.1.15", Unconfirmed Minutes of ISO/IEC JTC 1/SC 2/WG 2 Meeting 25, Falez Hotel, Antalya, Turkey, 1994-04-18--22 | ||||
N1105 | Everson, Michael (1994-10-11), Defect Report - Character names in Latin Extended-B | |||||
N1162 | Everson, Michael (1995-03-10), Input document to WG2 N 1105 (Defect Report - Character names in Latin Extended-B | |||||
L2/04-399 | N2859 | LaBonté, Alain (2004-10-19), Correcting reference glyph for U+01B3 LATIN CAPITAL LETTER Y WITH HOOK | ||||
L2/05-137 | Freytag, Asmus (2005-05-10), Handling "defective" names | |||||
L2/05-108R | Moore, Lisa (2005-08-26), "Consensus 103-C7", UTC #103 Minutes, Create a "Normative Name Alias" property and file in the UCD. Populate the property with names from the sections "Typos" and "Bad or misleading names" from document L2/05-137. | |||||
L2/06-157 | McGowan, Rick (2006-05-12), "Proposed Additional Name Aliases (BETA FEEDBACK) [U+01A2, U+01A2]", Comments on Public Review Issues (January 31, 2006 - May 12, 2006) | |||||
L2/06-108 | Moore, Lisa (2006-05-25), "B.11.10 [U+01A2, U+01A2]", UTC #107 Minutes | |||||
L2/19-201 | Zhong, Juechen (2019-05-14), Proposal to correct LATIN SMALL LETTER TONE SIX (U+0185) | |||||
L2/19-286 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-07-22), "a. LATIN SMALL LETTER TONE SIX", Recommendations to UTC #160 July 2019 on Script Proposals | |||||
L2/21-083 | Jiang, Kushim (2021-02-03), Proposal to provide the glyphs of Latin letter tone six | |||||
L2/21-073 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-04-23), "1a. Latin Letter Tone Six", Recommendations to UTC #167 April 2021 on Script Proposals | |||||
U+01F0 | 1 | UTC/1991-047 | Becker, Joe, Extended Latin, Standard Phonetic, Modifier Letters, General Diacritical Marks, Greek, Cyrillic | |||
UTC/1991-048B | Whistler, Ken (1991-03-27), "IPA additions", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple | |||||
1.1 | U+01F1..01F5, 01FA..0217 | 35 | (to be determined) | |||
X3L2/94-077 | N994 | Davis, Mark (1994-03-03), ISO/IEC 10646-1 - Proposed Draft Corrigendum 1 | ||||
X3L2/94-098 | N1033 (pdf, doc) | Umamaheswaran, V. S.; Ksar, Mike (1994-06-01), "8.1.15", Unconfirmed Minutes of ISO/IEC JTC 1/SC 2/WG 2 Meeting 25, Falez Hotel, Antalya, Turkey, 1994-04-18--22 | ||||
3.0 | U+01F6..01F7, 021C..021D | 4 | N1166 | Everson, Michael (1995-03-10), Proposal Summary for HWAIR and WYNN | ||
L2/97-117 | N1547 | Everson, Michael (1997-04-14), Reminder about 4 medieval English Latin characters | ||||
L2/97-119 | N1549 | Everson, Michael (1997-04-20), On the derivation of YOGH and EZH | ||||
L2/97-288 | N1603 | Umamaheswaran, V. S. (1997-10-24), "8.19", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997 | ||||
L2/98-004R | N1681 | Text of ISO 10646 – AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22 | ||||
L2/98-318 | N1894 | Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22 | ||||
U+01F8..01F9 | 2 | X3L2/95-112 | N1282 | Two Pinyin letters with Tone Mark missing, 1995-11-07 | ||
N1303 (html, doc) | Umamaheswaran, V. S.; Ksar, Mike (1996-01-26), "8.12 Missing Pinyin characters", Minutes of Meeting 29, Tokyo | |||||
X3L2/96-033 | N1355 | Suignard, Michel (1996-04-17), US response to WG 2 resolution 29.10 at the meeting in Tokyo | ||||
N1353 | Umamaheswaran, V. S.; Ksar, Mike (1996-06-25), "8.10", Draft minutes of WG2 Copenhagen Meeting # 30 | |||||
N1461 | Pinyin proposal summary form, 1996-08-15 | |||||
N1453 | Ksar, Mike; Umamaheswaran, V. S. (1996-12-06), "8.3.4", WG 2 Minutes - Quebec Meeting 31 | |||||
UTC/1996-xxx | Greenfield, Steve (1996-12-13), "Motion #70-6", Action Items & Resolutions Generated at UTC #70 | |||||
L2/98-004R | N1681 | Text of ISO 10646 – AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22 | ||||
L2/98-318 | N1894 | Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22 | ||||
U+0218..021B | 4 | UTC/1991-048B | Whistler, Ken (1991-03-27), "7 Add Roman S cedilla, Roman S comma to extended Latin", Draft Minutes from the UTC meeting #46 day 2, 3/27 at Apple | |||
X3L2/96-053 | N1361 | Proposal for addition of Latin characters, 1995-04-22 | ||||
N1353 | Umamaheswaran, V. S.; Ksar, Mike (1996-06-25), "8.2.3", Draft minutes of WG2 Copenhagen Meeting # 30 | |||||
UTC/1996-027.2 | Greenfield, Steve (1996-07-01), "I. Additional Latin Characters", UTC #69 Minutes (PART 2) | |||||
N1440 | Confirmation of request for 4 additional Latin characters used in Romania - reference document WG2 N 1361, 1996-07-09 | |||||
X3L2/96-109 | N1507 | Suignard, Michel (1996-12-06), Comment on Romanian issues concerning characters S and T with Cedilla/Comma | ||||
N1453 | Ksar, Mike; Umamaheswaran, V. S. (1996-12-06), "8.3.1 Four Romanian characters", WG 2 Minutes - Quebec Meeting 31 | |||||
X3L2/96-123 | Aliprand, Joan; Winkler, Arnold (1996-12-18), "4.6 Romanian", Preliminary Minutes - UTC #71 & X3L2 #168 ad hoc meeting, San Diego - December 5-6, 1996 | |||||
L2/97-030 | N1503 (pdf, doc) | Umamaheswaran, V. S.; Ksar, Mike (1997-04-01), "8.11 Romanian - 4 additional characters", Unconfirmed Minutes of WG 2 Meeting #32, Singapore; 1997-01-20--24 | ||||
L2/97-288 | N1603 | Umamaheswaran, V. S. (1997-10-24), "8.24.5", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997 | ||||
L2/98-004R | N1681 | Text of ISO 10646 – AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22 | ||||
L2/98-318 | N1894 | Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22 | ||||
L2/99-010 | N1903 (pdf, html, doc) | Umamaheswaran, V. S. (1998-12-30), Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25 | ||||
U+021E..021F | 2 | L2/97-164 | Proposal to add 2 Latin characters, 1997-07-01 | |||
L2/98-062 | N1619 | Kolehmainen, Erkki I.; Ruppel, Klaas (1997-07-01), Proposal to add 2 Latin Characters to ISO/IEC 10646 | ||||
L2/98-070 | Aliprand, Joan; Winkler, Arnold, "4.C.3", Minutes of the joint UTC and L2 meeting from the meeting in Cupertino, February 25-27, 1998 | |||||
L2/98-286 | N1703 | Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.3", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20 | ||||
L2/98-321 | N1905 | Revised text of 10646-1/FPDAM 23, AMENDMENT 23: Bopomofo Extended and other characters, 1998-10-22 | ||||
U+0222..0225 | 4 | L2/98-208 | N1741 | Everson, Michael (1998-05-25), Additional Latin characters for the UCS | ||
L2/98-281R (pdf, html) | Aliprand, Joan (1998-07-31), "Additional Latin Characters (IV.C.1)", Unconfirmed Minutes – UTC #77 & NCITS Subgroup L2 # 174 JOINT MEETING, Redmond, WA -- July 29-31, 1998 | |||||
L2/98-292R (pdf, html, Figure 1) | "2.1", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-19 | |||||
L2/98-292 | N1840 | "2.1", Comments on proposals to add characters from ISO standards developed by ISO/TC 46/SC 4, 1998-08-25 | ||||
L2/98-293 | N1885 | "2.1", Comments on proposals to add various characters to ISO/IEC 10646, 1998-08-25 | ||||
L2/98-301 | N1847 | Everson, Michael (1998-09-12), Responses to NCITS/L2 and Unicode Consortium comments on numerous proposals | ||||
L2/98-372 | N1884R2 (pdf, doc) | Whistler, Ken; et al. (1998-09-22), Additional Characters for the UCS | ||||
L2/98-329 | N1920 | Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 30, AMENDMENT 30: Additional Latin and other characters, 1998-10-28 | ||||
L2/99-010 | N1903 (pdf, html, doc) | Umamaheswaran, V. S. (1998-12-30), "8.1.5.1", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25 | ||||
U+0226..0229 | 4 | L2/98-357 | N1838 (html, doc) | Davis, Mark (1998-09-15), Proposal to add four binary completion letters to the BMP | ||
L2/98-329 | N1920 | Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 30, AMENDMENT 30: Additional Latin and other characters, 1998-10-28 | ||||
L2/99-010 | N1903 (pdf, html, doc) | Umamaheswaran, V. S. (1998-12-30), "8.2.3", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25 | ||||
U+022A..0233 | 10 | X3L2/95-126 | N1322 | Everson, Michael; Ruppel, Klaas; Mētra, Imants (1995-11-01), Proposal for addition of Latin characters for Livonian | ||
N1353 | Umamaheswaran, V. S.; Ksar, Mike (1996-06-25), "8.2.1", Draft minutes of WG2 Copenhagen Meeting # 30 | |||||
N1453 | Ksar, Mike; Umamaheswaran, V. S. (1996-12-06), "8.3.2 Livonian characters", WG 2 Minutes - Quebec Meeting 31 | |||||
L2/98-375 | N1888 | Blumberga, G. (1998-09-11), On inclusion of additional Livonian characters in the UCS | ||||
L2/98-329 | N1920 | Combined PDAM registration and consideration ballot on WD for ISO/IEC 10646-1/Amd. 30, AMENDMENT 30: Additional Latin and other characters, 1998-10-28 | ||||
L2/99-010 | N1903 (pdf, html, doc) | Umamaheswaran, V. S. (1998-12-30), "8.1.5.2", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25 | ||||
3.2 | U+0220 | 1 | L2/00-423 | N2306R | Everson, Michael (2000-11-29), Proposal to add LATIN CAPITAL LETTER N WITH LONG RIGHT LEG to the UCS | |
L2/01-012R | Moore, Lisa (2001-05-21), "Motion 86-M31", Minutes UTC #86 in Mountain View, Jan 2001, Accept the addition of the character LATIN CAPITAL LETTER N WITH LONG RIGHT LEG at U+0220. | |||||
L2/01-344 | N2353 (pdf, doc) | Umamaheswaran, V. S. (2001-09-09), "7.3", Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001 | ||||
4.0 | U+0221, 0234..0236 | 4 | L2/01-272 | N2366 | Cook, Richard; Everson, Michael (2001-07-02), Proposal to add five phonetic characters to the UCS | |
L2/01-295R | Moore, Lisa (2001-11-06), Minutes from the UTC/L2 meeting #88 | |||||
L2/01-347 | N2366R | Cook, Richard; Everson, Michael (2001-09-20), Proposal to add six phonetic characters to the UCS | ||||
L2/01-420 | Whistler, Ken (2001-10-30), "c. Phonetic letter additions", WG2 (Singapore) Resolution Consent Docket for UTC | |||||
L2/01-405R | Moore, Lisa (2001-12-12), "Consensus 89-C22", Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001 | |||||
L2/02-154 | N2403 | Umamaheswaran, V. S. (2002-04-22), Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19 | ||||
4.1 | U+0237 | 1 | L2/03-194 | N2590 | Freytag, Asmus (2003-06-09), Additional Mathematical and Letterlike Characters | |
U+0238..0239, 023C | 3 | L2/03-190R | Constable, Peter (2003-06-08), Proposal to Encode Additional Phonetic Symbols in the UCS | |||
L2/04-047 | Constable, Peter (2004-02-01), Revised Proposal to Encode Additional Phonetic Symbols in the UCS | |||||
L2/04-132 | N2740 | Constable, Peter (2004-04-19), Proposal to add additional phonetic characters to the UCS | ||||
L2/04-145 | Starner, David (2004-04-30), C with stroke character examples from BAE report 1884 (Dorsey) | |||||
L2/04-003R | Moore, Lisa (2004-05-17), "Additional Phonetic Symbols (B.14.13)", UTC #98 Minutes | |||||
L2/04-173 | McGowan, Rick (2004-05-28), "35", Comments on Public Review Issues (Feb 5 - May 28, 2004) | |||||
L2/04-202 | Anderson, Deborah (2004-06-07), Slashed C Feedback | |||||
L2/04-156R2 | Moore, Lisa (2004-08-13), "SENĆOŦEN Latin characters (A.19) and Encoding LATIN SMALL LETTER C WITH STROKE (C.12.11)", UTC #99 Minutes | |||||
U+023A..023B, 023D..023E | 4 | L2/04-170 | N2784R | Elliot, John; Brand, Peter; Harvey, Chris (2004-05-05), Proposal to Add Four SENĆOŦEN Latin Charaters [sic] | ||
L2/04-182 | Anderson, Deborah (2004-06-03), Message in support of SENCOTEN proposal | |||||
L2/04-211 | Anderson, Deborah (2004-06-07), Further Information on SENCOTEN | |||||
L2/04-156R2 | Moore, Lisa (2004-08-13), "SENĆOŦEN Latin characters (A.19)", UTC #99 Minutes | |||||
U+023F..0240 | 2 | L2/03-190R | Constable, Peter (2003-06-08), Proposal to Encode Additional Phonetic Symbols in the UCS | |||
L2/04-242 | N2799 | Everson, Michael; Constable, Peter (2004-06-12), Proposal to add two Africanist phonetic characters | ||||
U+0241 | 1 | L2/04-065 | Constable, Peter (2004-02-01), Proposal to Encode Orthographic Glottal Stops in the UCS | |||
L2/04-224 | N2789 | Kew, Jonathan; Constable, Peter (2004-06-08), Revised Proposal to Encode Orthographic Glottal Stops in the UCS | ||||
L2/05-072 | Constable, Peter (2005-02-09), Case mapping and glottal stops | |||||
5.0 | U+0242 | 1 | L2/04-065 | Constable, Peter (2004-02-01), Proposal to Encode Orthographic Glottal Stops in the UCS | ||
L2/05-057 | Constable, Peter (2005-02-02), Proposed Principles for Character Disunifications | |||||
L2/05-076 | Davis, Mark (2005-02-10), Stability of Case Folding | |||||
L2/05-224 | Priest, Lorna (2005-08-08), Letter in Support of L2/05-194 | |||||
L2/05-194R | N2962R | Everson, Michael; Harvey, Chris (2005-08-10), Proposal to add LATIN SMALL LETTER GLOTTAL STOP to the UCS | ||||
N2942 | Freytag, Asmus; Whistler, Ken (2005-08-12), Proposal to add nine lowercase characters | |||||
L2/05-180 | Moore, Lisa (2005-08-17), "C.16", UTC #104 Minutes | |||||
L2/05-108R | Moore, Lisa (2005-08-26), "Stability of Case Folding (B.14.2)", UTC #103 Minutes | |||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "8.2.2", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | |||||
U+0243..024F | 13 | L2/04-246R | Priest, Lorna (2004-07-26), Revised Proposal for Additional Latin Phonetic and Orthographic Characters | |||
L2/04-316 | Moore, Lisa (2004-08-19), "C.6", UTC #100 Minutes | |||||
L2/04-348 | N2906 | Priest, Lorna (2004-08-23), Revised Proposal for Additional Latin Phonetic and Orthographic Characters | ||||
N2953 (pdf, doc) | Umamaheswaran, V. S. (2006-02-16), "M47.5a", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15 | |||||
|
A diacritic is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek διακριτικός, from διακρίνω. The word diacritic is a noun, though it is sometimes used in an attributive sense, whereas diacritical is only an adjective. Some diacritics, such as the acute ⟨ó⟩, grave ⟨ò⟩, and circumflex ⟨ô⟩, are often called accents. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters.
The ogonek is a diacritic hook placed under the lower right corner of a vowel in the Latin alphabet used in several European languages, and directly under a vowel in several Native American languages. It is also placed on the lower right corner of consonants in some Latin transcriptions of various indigenous languages of the Caucasus mountains.
A cedilla, or cedille, is a hook or tail added under certain letters as a diacritical mark to modify their pronunciation. In Catalan, French, and Portuguese it is used only under the letter c, and the entire letter is called, respectively, c trencada, c cédille, and c cedilhado. It is used to mark vowel nasalization in many languages of Sub-Saharan Africa, including Vute from Cameroon.
A caron is a diacritic mark commonly placed over certain letters in the orthography of some languages to indicate a change of the related letter's pronunciation.
The Polish alphabet is the script of the Polish language, the basis for the Polish system of orthography. It is based on the Latin alphabet but includes certain letters (9) with diacritics: the acute accent ; the overdot ; the tail or ogonek ; and the stroke. ⟨q⟩, ⟨v⟩, and ⟨x⟩, which are used only in foreign words, are usually absent from the Polish alphabet. Additionally, before the standardization of Polish spelling, ⟨qu⟩ was sometimes used in place of ⟨kw⟩, and ⟨x⟩ in place of ⟨ks⟩.
In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters ⟨æ⟩ and ⟨œ⟩ used in English and French, in which the letters ⟨a⟩ and ⟨e⟩ are joined for the first ligature and the letters ⟨o⟩ and ⟨e⟩ are joined for the second ligature. For stylistic and legibility reasons, ⟨f⟩ and ⟨i⟩ are often merged to create ⟨fi⟩ ; the same is true of ⟨s⟩ and ⟨t⟩ to create ⟨st⟩. The common ampersand, ⟨&⟩, developed from a ligature in which the handwritten Latin letters ⟨e⟩ and ⟨t⟩ were combined.
A digraph or digram is a pair of characters used in the orthography of a language to write either a single phoneme, or a sequence of phonemes that does not correspond to the normal values of the two characters combined.
The Romanian alphabet is a variant of the Latin alphabet used for writing the Romanian language. It is a modification of the classical Latin alphabet and consists of 31 letters, five of which have been modified from their Latin originals for the phonetic requirements of the language.
T-comma is a letter which consists of a t with a diacritical comma underneath it, and is distinct from t-cedilla. It is part of the Romanian alphabet, used to represent the Romanian language sound, the voiceless alveolar affricate. The letter is also a part of the Finno-Ugric Livonian language alphabet, representing the sound.
Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
Diacritical marks of two dots¨, placed side-by-side over or under a letter, are used in several languages for several different purposes. The most familiar to English-language speakers are the diaeresis and the umlaut, though there are numerous others. For example, in Albanian, ë represents a schwa. Such diacritics are also sometimes used for stylistic reasons.
C, or c, is the third letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is cee, plural cees.
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.
Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
Yi Syllables is a Unicode block containing the 1,165 characters of the Liangshan Standard Yi script for writing the Nuosu language.
D-comma is a letter that was part of the Romanian alphabet to represent the sound or if it was derived from a Latin d. It was the equivalent of the Cyrillic letters З and Ѕ.
IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.
The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.