Modifier Tone Letters

Modifier Tone Letters
Modifier Tone Letters
Range	U+A700..U+A71F; (32 code points)
Plane	BMP
Scripts	Common
Symbol sets	Tone marks
Assigned	32 code points
Unused	0 reserved code points
Unicode version history
4.1 (2005)	23 (+23)
5.0 (2006)	27 (+4)
5.1 (2008)	32 (+5)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated August 06, 2024

Modifier Tone Letters is a Unicode block containing tone markings for Chinese, Chinantec, Africanist, and other phonetic transcriptions. It does not contain the standard IPA tone marks, which are found in Spacing Modifier Letters.

⟨꜀◌ ꜁◌ ꜂◌ ꜃◌ ◌꜄ ◌꜅ ◌꜆ ◌꜇⟩ are used to mark yin and (underlined) yang splits of the ping, shang, qu and ru tones, respectively, in the etymological four-tone analysis of Chinese. The dotted tone letters ⟨꜈ ꜉ ꜊ ꜋ ꜌⟩ are used for the pitch of neutral tones, while the reversed tone letters ⟨꜒ ꜓ ꜔ ꜕ ꜖⟩ and neutral ⟨꜍ ꜎ ꜏ ꜐ ꜑⟩ are used for tone sandhi. ⟨◌ꜗ ◌ꜘ ◌ꜙ ◌ꜚ⟩ are used in Ozumacín Chinantec. ⟨ꜛ ꜜ⟩ are the IPA diacritics for upstep and downstep, while ⟨ꜝ ꜞ ꜟ⟩ are substitutes people used before broad font support of the IPA, and still preferred by some.

Block

Modifier Tone Letters ^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+A70x	꜀	꜁	꜂	꜃	꜄	꜅	꜆	꜇	꜈	꜉	꜊	꜋	꜌	꜍	꜎	꜏
U+A71x	꜐	꜑	꜒	꜓	꜔	꜕	꜖	ꜗ	ꜘ	ꜙ	ꜚ	ꜛ	ꜜ	ꜝ	ꜞ	ꜟ
Notes 1. ^ As of Unicode version 15.1

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Modifier Tone Letters block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
4.1	U+A700..A716	23	L2/03-317	N2626	Proposal on IPA Extensions & Combining Diacritical Marks for ISO/IEC 10646 in BMP, 2003-09-27
			L2/03-339	N2646	Constable, Peter (2003-10-08), Comments on N2626, Proposal on IPA Extensions and Combining Diacritic Marks for ISO/IEC 10646 in BMP
			L2/03-372	N2673	Anderson, Deborah (2003-10-22), Comments on N2626, "Proposal on IPA Extensions & Combining Diacritic Marks for ISO 10646 in BMP"
			L2/04-107	N2713	Revised Proposal for encoding A Supplemented Set of IPA Combining Marks, Modifier Letters & Five-Degree Contour Tone Marks, 2004-03-20
			L2/04-156R2		Moore, Lisa (2004-08-13), "Supplemental set of IPA combing marks, modifier letters, and five-degree contour tone marks (A.7)", UTC #99 Minutes
5.0	U+A717..A71A	4	L2/04-246R		Priest, Lorna (2004-07-26), Revised Proposal for Additional Latin Phonetic and Orthographic Characters
			L2/04-349		Priest, Lorna (2004-08-27), Proposal to Encode Chinantec Tone Marks and Orthographic 'at' Characters
			L2/04-349R	N2883	Priest, Lorna (2004-12-09), Proposal to Encode Chinantec Tone Marks
5.1	U+A71B..A71F	5		N2945	Priest, Lorna; Constable, Peter (2005-08-09), Proposal to Encode Additional Latin Phonetic and Orthographic Characters
				N2953 (pdf, doc)	Umamaheswaran, V. S. (2006-02-16), "7.2.8", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15
				N3103 (pdf, doc)	Umamaheswaran, V. S. (2006-08-25), "M48.1", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation for the sounds of speech. The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators, and translators.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

L, or l, is the twelfth letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is el, plural els.

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.

Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

Spacing Modifier Letters is a Unicode block containing characters for the IPA, UPA, and other phonetic transcriptions. Included are the IPA tone marks, and modifiers for aspiration and palatalization. The word spacing indicates that these characters occupy their own horizontal space within a line of text. Its block name in Unicode 1.0 was simply Modifier Letters.

Yi Syllables is a Unicode block containing the 1,165 characters of the Liangshan Standard Yi script for writing the Nuosu language.

Bopomofo, also called Zhuyin Fuhao, or simply Zhuyin, is a transliteration system for Standard Chinese and other Sinitic languages. It is commonly used in Taiwan. It consists of 37 characters and five tone marks, which together can transcribe all possible sounds in Mandarin Chinese.

Tone letters are letters that represent the tones of a language, most commonly in languages with contour tones.

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.

IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Miscellaneous Symbols and Pictographs is a Unicode block containing meteorological and astronomical symbols, emoji characters largely for compatibility with Japanese telephone carriers' implementations of Shift JIS, and characters originally from the Wingdings and Webdings fonts found in Microsoft Windows.

In typesetting, the hook or tail is a diacritic mark attached to letters in many alphabets. In shape it looks like a hook and it can be attached below as a descender, on top as an ascender and sometimes to the side. The orientation of the hook can change its meaning: when it is below and curls to the left it can be interpreted as a palatal hook, and when it curls to the right is called hook tail or tail and can be interpreted as a retroflex hook. It should not be mistaken with the hook above, a diacritical mark used in Vietnamese, or the rhotic hook, used in the International Phonetic Alphabet.

Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1, were included in the Latin-1 Supplement block.

Bopomofo is a Unicode block containing phonetic characters for Chinese. The original set of 40 Bopomofo characters is based on the Chinese standard GB 2312. Additional Bopomofo characters can be found in the Bopomofo Extended block.

Bopomofo Extended is a Unicode block containing additional Bopomofo characters for writing phonetic Min Nan, Hakka Chinese, Cantonese, Hmu, and Ge. The basic set of Bopomofo characters can be found in the Bopomofo block.

Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Anthropos alphabet, Sakha and Americanist usage.

Latin Extended-F is a Unicode block containing modifier letters, nearly all IPA and extIPA, for phonetic transcription. The Latin Extended-F and -G blocks contain the first Latin characters defined outside of the Basic Multilingual Plane (BMP). They were added to the free Gentium Plus and Andika fonts with version 6.2 in February 2023. Some computers have 𐞃, 𐞎 and 𐞥 supported on the font Calibri.

Cyrillic Extended-D is a Unicode block containing superscript and subscript Cyrillic characters used in Cyrillic-based phonetic transcription. The block contains the first Cyrillic characters defined outside of the Basic Multilingual Plane (BMP).

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[1]

[2]

[lower-alpha 1]

Modifier Tone Letters

Contents

Block

History

See also

Related Research Articles

References

Modifier Tone Letters
Range	U+A700..U+A71F (32 code points)
Plane	BMP
Scripts	Common
Symbol sets	Tone marks
Assigned	32 code points
Unused	0 reserved code points
Unicode version history

4.1 (2005)	23 (+23)
5.0 (2006)	27 (+4)
5.1 (2008)	32 (+5)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]