Latin Extended-E | |
---|---|
Range | U+AB30..U+AB6F (64 code points) |
Plane | BMP |
Scripts | Latin (56 char.) Greek (1 char.) Common (3 char.) |
Major alphabets | German dialectology, Americanist, Sakha |
Assigned | 60 code points |
Unused | 4 reserved code points |
Unicode version history | |
7.0 (2014) | 50 (+50) |
8.0 (2015) | 54 (+4) |
12.0 (2019) | 56 (+2) |
13.0 (2020) | 60 (+4) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), [3] Anthropos alphabet, Sakha and Americanist usage.
Latin Extended-E [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+AB3x | ꬰ | ꬱ | ꬲ | ꬳ | ꬴ | ꬵ | ꬶ | ꬷ | ꬸ | ꬹ | ꬺ | ꬻ | ꬼ | ꬽ | ꬾ | ꬿ |
U+AB4x | ꭀ | ꭁ | ꭂ | ꭃ | ꭄ | ꭅ | ꭆ | ꭇ | ꭈ | ꭉ | ꭊ | ꭋ | ꭌ | ꭍ | ꭎ | ꭏ |
U+AB5x | ꭐ | ꭑ | ꭒ | ꭓ | ꭔ | ꭕ | ꭖ | ꭗ | ꭘ | ꭙ | ꭚ | ꭛ | ꭜ | ꭝ | ꭞ | ꭟ |
U+AB6x | ꭠ | ꭡ | ꭢ | ꭣ | ꭤ | ꭥ | ꭦ | ꭧ | ꭨ | ꭩ | ꭪ | ꭫ | ||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-E block:
Version | Final code points [a] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
7.0 | U+AB30..AB5F | 48 | L2/08-428 | N3555 | Everson, Michael (2008-11-27), Exploratory proposal to encode Germanicist, Nordicist, and other phonetic characters in the UCS |
L2/09-256 | Ellert, Mattias (2009-07-31), Comments on ISO/IEC JTC1/SC2/WG2 N3555 / L2/08-428 | ||||
L2/10-346 | N3907 | Everson, Michael; Wandl-Vogt, Eveline; Dicklberger, Alois (2010-09-23), Preliminary proposal to encode "Teuthonista" phonetic characters in the UCS | |||
L2/11-137 | N4031 | Everson, Michael; Wandl-Vogt, Eveline; Dicklberger, Alois (2011-05-09), Proposal to encode "Teuthonista" phonetic characters in the UCS | |||
L2/11-203 | N4082 | Everson, Michael; et al. (2011-05-27), Support for "Teuthonista" encoding proposal | |||
L2/11-202 | N4081 | Everson, Michael; Dicklberger, Alois; Pentzlin, Karl; Wandl-Vogt, Eveline (2011-06-02), Revised proposal to encode "Teuthonista" phonetic characters in the UCS | |||
L2/11-240 | N4106 | Everson, Michael; Pentzlin, Karl (2011-06-09), Report on the ad hoc re "Teuthonista" (SC2/WG2 N4081) held during the SC2/WG2 meeting at Helsinki | |||
L2/11-261R2 | Moore, Lisa (2011-08-16), "Consensus 128-C38", UTC #128 / L2 #225 Minutes, Approve 85 characters for German dialectology... | ||||
N4103 | "11.16 Teuthonista phonetic characters", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03 | ||||
L2/12-269 | N4296 | Request to change the names of three Teuthonista characters under ballot, 2012-07-26 | |||
L2/12-343R2 | Moore, Lisa (2012-12-04), "Consensus 133-C3, 133-C5", UTC #133 Minutes | ||||
N4353 (pdf, doc) | "M60.01", Unconfirmed minutes of WG 2 meeting 60, 2013-05-23 | ||||
N4553 (pdf, doc) | Umamaheswaran, V. S. (2014-09-16), "M62.01b, M62.01g", Minutes of WG 2 meeting 62 Adobe, San Jose, CA, USA | ||||
L2/22-101R | Jacquerye, Denis Moyogo (2022-06-14), Proposal to revise the glyph of LATIN SMALL LETTER BARRED ALPHA [affects U+AB30 annotation] | ||||
L2/22-199 | Jacquerye, Denis Moyogo (2022-06-27), On LATIN SMALL LETTER Y WITH SHORT RIGHT LEG [Affects U+AB5A] | ||||
L2/22-128 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-07-20), "2c Latin Small Letter Barred Alpha", Recommendations to UTC #172 July 2022 on Script Proposals | ||||
L2/22-121 | Constable, Peter (2022-08-01), "Action Item 172-A106", Draft Minutes of UTC Meeting 172, Add an annotation to U+AB30 LATIN SMALL LETTER BARRED ALPHA | ||||
L2/22-198 | Jacquerye, Denis Moyogo (2022-08-19), On LATIN SMALL LETTER BLACKLETTER O WITH STROKE [Affects U+AB3E] | ||||
L2/22-248 | Anderson, Deborah; et al. (2022-10-31), "1b SMALL LETTER Y WITH SHORT RIGHT LEG [Affects U+AB5A] and 1c SMALL LETTER BLACKLETTER O WITH STROKE [Affects U+AB3E annotation]", Recommendations to UTC #173 October 2022 on Script Proposals | ||||
L2/22-241 | Constable, Peter (2022-11-09), "Consensus 173-C25 [Affects U+AB5A] and Action Item 173-A112 [Affects U+AB3E annotation]", Approved Minutes of UTC Meeting 173 | ||||
U+AB64..AB65 | 2 | L2/12-266 | N4307 | Schneidemesser, Luanne von; et al. (2012-07-31), Proposal for Two Phonetic Characters | |
L2/12-239 | Moore, Lisa (2012-08-14), "C.13, D.13", UTC #132 Minutes | ||||
L2/13-132 | Moore, Lisa (2013-07-29), "Consensus 136-C7", UTC #136 Minutes | ||||
N4403 (pdf, doc) | Umamaheswaran, V. S. (2014-01-28), "Resolution M61.01", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14 | ||||
8.0 | U+AB60..AB63 | 4 | L2/11-340 | Yevlampiev, Ilya; Jumagueldinov, Nurlan; Pentzlin, Karl (2011-09-12), Proposal to encode four historic Latin letters for Sakha (Yakut) | |
L2/11-422 | Salminen, Tapani; Anderson, Deborah (2011-10-31), Comments on L2/11-360 Latin letters used in the Former Soviet Union and L2/11- 340 Proposal to encode four historic Latin letters for Sakha (Yakut) | ||||
L2/12-044 | N4213 | Yevlampiev, Ilya; Jumagueldinov, Nurlan; Pentzlin, Karl (2012-04-26), Second revised proposal to encode four historic Latin letters for Sakha (Yakut) | |||
L2/13-028 | Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-01-28), "1", Recommendations to UTC on Script Proposals | ||||
L2/13-011 | Moore, Lisa (2013-02-04), "C.6.1", UTC #134 Minutes | ||||
L2/13-132 | Moore, Lisa (2013-07-29), "Consensus 136-C12", UTC #136 Minutes, Approve the name change for U+AB60 LATIN SMALL LETTER SAKHA IOTIFIED A to U+AB60 LATIN SMALL LETTER SAKHA YAT. | ||||
N4403 (pdf, doc) | Umamaheswaran, V. S. (2014-01-28), "10.3.1 Four Historic Latin letters for Sakha (Yakut)", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14 | ||||
12.0 | U+AB66..AB67 | 2 | L2/17-299 | N4842 | Everson, Michael (2017-08-17), Proposal to add two Sinological Latin letters |
L2/17-367 | N4885 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa (2017-09-18), "1. Latin", Comments on WG2 #66 (Sept. 2017) documents | |||
N4953 (pdf, doc) | "M66.15d", Unconfirmed minutes of WG 2 meeting 66, 2018-03-23 | ||||
L2/17-362 | Moore, Lisa (2018-02-02), "Consensus 153-C7", UTC #153 Minutes | ||||
13.0 | U+AB68..AB6B | 4 | L2/19-075R | N5036R | Everson, Michael (2019-05-05), Proposal to add six phonetic characters for Scots to the UCS |
L2/19-173 | Anderson, Deborah; et al. (2019-04-29), "Phonetic characters for Scots", Recommendations to UTC #159 April-May 2019 on Script Proposals | ||||
L2/19-122 | Moore, Lisa (2019-05-08), "C.6", UTC #159 Minutes | ||||
N5122 | "M68.05", Unconfirmed minutes of WG 2 meeting 68, 2019-12-31 | ||||
L2/20-052 | Pournader, Roozbeh (2020-01-15), Changes to Identifier_Type of some Unicode 13.0 characters | ||||
L2/20-015R | Moore, Lisa (2020-05-14), "B.13.4 Changes to Identifier_Type of some Unicode 13.0 characters", Draft Minutes of UTC Meeting 162 | ||||
|
Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
Heng is a letter of the Latin alphabet, originating as a typographic ligature of h and ŋ. It is used for a voiceless y-like sound, such as in Dania transcription of the Danish language.
Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."
Combining Diacritical Marks Supplement is a Unicode block containing combining characters for the Uralic Phonetic Alphabet, Medievalist notations, and German dialectology (Teuthonista). It is an extension of the diacritic characters found in the Combining Diacritical Marks block.
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.
Phonetic Extensions is a Unicode block containing phonetic characters used in the Uralic Phonetic Alphabet, Old Irish phonetic notation, the Oxford English Dictionary and American dictionaries, and Americanist and Russianist phonetic notations. Its character set is continued in the following Unicode block, Phonetic Extensions Supplement.
Phonetic Extensions Supplement is a Unicode block containing characters for specialized and deprecated forms of the International Phonetic Alphabet.
Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet.
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.
The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.
Latin Extended Additional is a Unicode block.
Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.
Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet consists of glyphs resembling capital letters in the basic Latin alphabet in their standard form and turned.
Combining Diacritical Marks Extended is a Unicode block containing diacritical marks used in German dialectology (Teuthonista).
Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023. Some computers have 𐞃, 𐞎 and 𐞥 supported on the font Calibri.
Latin Extended-G is a Unicode block containing additional characters for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP).