Latin Extended-E

Last updated
Latin Extended-E
RangeU+AB30..U+AB6F
(64 code points)
Plane BMP
Scripts Latin (56 char.)
Greek (1 char.)
Common (3 char.)
Major alphabetsGerman dialectology, Americanist, Sakha
Assigned60 code points
Unused4 reserved code points
Unicode version history
7.0 (2014)50 (+50)
8.0 (2015)54 (+4)
12.0 (2019)56 (+2)
13.0 (2020)60 (+4)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), [3] Anthropos alphabet, Sakha and Americanist usage.

Contents

Block

Latin Extended-E [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+AB3x ꬿ
U+AB4x
U+AB5x
U+AB6x
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-E block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
7.0U+AB30..AB5F48 L2/08-428 N3555 Everson, Michael (2008-11-27), Exploratory proposal to encode Germanicist, Nordicist, and other phonetic characters in the UCS
L2/09-256 Ellert, Mattias (2009-07-31), Comments on ISO/IEC JTC1/SC2/WG2 N3555 / L2/08-428
L2/10-346 N3907 Everson, Michael; Wandl-Vogt, Eveline; Dicklberger, Alois (2010-09-23), Preliminary proposal to encode "Teuthonista" phonetic characters in the UCS
L2/11-137 N4031 Everson, Michael; Wandl-Vogt, Eveline; Dicklberger, Alois (2011-05-09), Proposal to encode "Teuthonista" phonetic characters in the UCS
L2/11-203 N4082 Everson, Michael; et al. (2011-05-27), Support for "Teuthonista" encoding proposal
L2/11-202 N4081 Everson, Michael; Dicklberger, Alois; Pentzlin, Karl; Wandl-Vogt, Eveline (2011-06-02), Revised proposal to encode "Teuthonista" phonetic characters in the UCS
L2/11-240 N4106 Everson, Michael; Pentzlin, Karl (2011-06-09), Report on the ad hoc re "Teuthonista" (SC2/WG2 N4081) held during the SC2/WG2 meeting at Helsinki
L2/11-261R2 Moore, Lisa (2011-08-16), "Consensus 128-C38", UTC #128 / L2 #225 Minutes, Approve 85 characters for German dialectology...
N4103 "11.16 Teuthonista phonetic characters", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
L2/12-269 N4296 Request to change the names of three Teuthonista characters under ballot, 2012-07-26
L2/12-343R2 Moore, Lisa (2012-12-04), "Consensus 133-C3, 133-C5", UTC #133 Minutes
N4353 (pdf, doc)"M60.01", Unconfirmed minutes of WG 2 meeting 60, 2013-05-23
N4553 (pdf, doc)Umamaheswaran, V. S. (2014-09-16), "M62.01b, M62.01g", Minutes of WG 2 meeting 62 Adobe, San Jose, CA, USA
L2/22-101R Jacquerye, Denis Moyogo (2022-06-14), Proposal to revise the glyph of LATIN SMALL LETTER BARRED ALPHA [affects U+AB30 annotation]
L2/22-199 Jacquerye, Denis Moyogo (2022-06-27), On LATIN SMALL LETTER Y WITH SHORT RIGHT LEG [Affects U+AB5A]
L2/22-128 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-07-20), "2c Latin Small Letter Barred Alpha", Recommendations to UTC #172 July 2022 on Script Proposals
L2/22-121 Constable, Peter (2022-08-01), "Action Item 172-A106", Draft Minutes of UTC Meeting 172, Add an annotation to U+AB30 LATIN SMALL LETTER BARRED ALPHA
L2/22-198 Jacquerye, Denis Moyogo (2022-08-19), On LATIN SMALL LETTER BLACKLETTER O WITH STROKE [Affects U+AB3E]
L2/22-248 Anderson, Deborah; et al. (2022-10-31), "1b SMALL LETTER Y WITH SHORT RIGHT LEG [Affects U+AB5A] and 1c SMALL LETTER BLACKLETTER O WITH STROKE [Affects U+AB3E annotation]", Recommendations to UTC #173 October 2022 on Script Proposals
L2/22-241 Constable, Peter (2022-11-09), "Consensus 173-C25 [Affects U+AB5A] and Action Item 173-A112 [Affects U+AB3E annotation]", Approved Minutes of UTC Meeting 173
U+AB64..AB652 L2/12-266 N4307 Schneidemesser, Luanne von; et al. (2012-07-31), Proposal for Two Phonetic Characters
L2/12-239 Moore, Lisa (2012-08-14), "C.13, D.13", UTC #132 Minutes
L2/13-132 Moore, Lisa (2013-07-29), "Consensus 136-C7", UTC #136 Minutes
N4403 (pdf, doc)Umamaheswaran, V. S. (2014-01-28), "Resolution M61.01", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14
8.0U+AB60..AB634 L2/11-340 Yevlampiev, Ilya; Jumagueldinov, Nurlan; Pentzlin, Karl (2011-09-12), Proposal to encode four historic Latin letters for Sakha (Yakut)
L2/11-422 Salminen, Tapani; Anderson, Deborah (2011-10-31), Comments on L2/11-360 Latin letters used in the Former Soviet Union and L2/11- 340 Proposal to encode four historic Latin letters for Sakha (Yakut)
L2/12-044 N4213 Yevlampiev, Ilya; Jumagueldinov, Nurlan; Pentzlin, Karl (2012-04-26), Second revised proposal to encode four historic Latin letters for Sakha (Yakut)
L2/13-028 Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-01-28), "1", Recommendations to UTC on Script Proposals
L2/13-011 Moore, Lisa (2013-02-04), "C.6.1", UTC #134 Minutes
L2/13-132 Moore, Lisa (2013-07-29), "Consensus 136-C12", UTC #136 Minutes, Approve the name change for U+AB60 LATIN SMALL LETTER SAKHA IOTIFIED A to U+AB60 LATIN SMALL LETTER SAKHA YAT.
N4403 (pdf, doc)Umamaheswaran, V. S. (2014-01-28), "10.3.1 Four Historic Latin letters for Sakha (Yakut)", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14
12.0U+AB66..AB672 L2/17-299 N4842 Everson, Michael (2017-08-17), Proposal to add two Sinological Latin letters
L2/17-367 N4885 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa (2017-09-18), "1. Latin", Comments on WG2 #66 (Sept. 2017) documents
N4953 (pdf, doc)"M66.15d", Unconfirmed minutes of WG 2 meeting 66, 2018-03-23
L2/17-362 Moore, Lisa (2018-02-02), "Consensus 153-C7", UTC #153 Minutes
13.0U+AB68..AB6B4 L2/19-075R N5036R Everson, Michael (2019-05-05), Proposal to add six phonetic characters for Scots to the UCS
L2/19-173 Anderson, Deborah; et al. (2019-04-29), "Phonetic characters for Scots", Recommendations to UTC #159 April-May 2019 on Script Proposals
L2/19-122 Moore, Lisa (2019-05-08), "C.6", UTC #159 Minutes
N5122 "M68.05", Unconfirmed minutes of WG 2 meeting 68, 2019-12-31
L2/20-052 Pournader, Roozbeh (2020-01-15), Changes to Identifier_Type of some Unicode 13.0 characters
L2/20-015R Moore, Lisa (2020-05-14), "B.13.4 Changes to Identifier_Type of some Unicode 13.0 characters", Draft Minutes of UTC Meeting 162
  1. Proposed code points and characters names may differ from final code points and names

See also

Related Research Articles

E, or e, is the fifth letter and the second vowel letter in the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is e ; plural es, Es or E's. It is the most commonly used letter in many languages, including Czech, Danish, Dutch, English, French, German, Hungarian, Latin, Latvian, Norwegian, Spanish, and Swedish.

<span class="mw-page-title-main">Eng (letter)</span> Letter of the Latin alphabet

Eng or engma is a letter of the Latin alphabet, used to represent a voiced velar nasal in the written form of some languages and in the International Phonetic Alphabet.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

L, or l, is the twelfth letter in the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is el, plural els.

<span class="mw-page-title-main">Heng (letter)</span>

Heng is a letter of the Latin alphabet, originating as a typographic ligature of h and ŋ. It is used for a voiceless y-like sound, such as in Dania transcription of the Danish language.

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."

Combining Diacritical Marks Supplement is a Unicode block containing combining characters for the Uralic Phonetic Alphabet, Medievalist notations, and German dialectology (Teuthonista). It is an extension of the diacritic characters found in the Combining Diacritical Marks block.

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.

Unicode supports several phonetic scripts and notations through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.

Latin Extended Additional is a Unicode block.

The Unicode Standard assigns various properties to each Unicode character and code point.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1, were included in the Latin-1 Supplement block.

Combining Diacritical Marks Extended is a Unicode block containing diacritical marks used in German dialectology (Teuthonista).

Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Everson, Michael; Dicklberger, Alois; Pentzlin, Karl; Wandl-Vogt, Eveline (2011-06-02). "Revised proposal to encode "Teuthonista" phonetic characters in the UCS" (PDF).