Comparison of ASCII encodings of the International Phonetic Alphabet

Last updated

The International Phonetic Alphabet (IPA) consists of more than 100 letters and diacritics. Before Unicode became widely available, several ASCII-based encoding systems of the IPA were proposed. The alphabet went through a large revision at the Kiel Convention of 1989, and the vowel symbols again in 1993. [1] Systems devised before these revisions inevitably lack support for the additions they introduced.

Contents

Only language-neutral systems are discussed below because language-dependent ones (such as ARPABET) do not allow for a systematic comparison.

General information

SystemAuthor(s)CreatedLast
updated
NoteRef
Branner (unnamed)David Prager Branner at the University of Washington 1994 ? [2]
Millar & Oasa (unnamed)J. Bruce Millar and Hiroaki Oasa at Australian National University 19811981 [3]
PHONASCIIGeorge D. Allen at Purdue University 19881988Not a direct mapping of the IPA. Segments are separated by spaces, and diacritics by commas. [4]
Praat Paul Boersma and David Weenink at the University of Amsterdam 19922025A subset of a larger set of backslash trigraphs also comprising international and mathematical characters. The symbols can be typed in ASCII but will be visualized as the normal Unicode glyphs. [5]
IPA (SIL) Keyboard SIL International 19942021 [6]
UCLA Phonological Segment Inventory Database (UPSID) Ian Maddieson at the University of California, Los Angeles 1984 ?Presented here is the scheme used for representing phonemes in the database of phonological inventories. Consequently, it is not designed for transcription of multiple segments and does not have symbols for values not found phonemically in the languages sampled. [7]
Usenet IPA/ASCII transcription Participants in sci.lang and alt.usage.english newsgroups (later maintained by Evan Kirshenbaum at HP Labs)19912011Also known variously as "ASCII-IPA", "Kirshenbaum", etc. [8] IETF language subtags register fonkirsh to identify text in this convention. [9] [10]
WorldbetJames L. Hieronymus at AT&T Bell Laboratories 19941994Segments are separated by spaces. [11]
X-SAMPA John C. Wells at University College London 19952000 IETF language subtags register fonxsamp to identify text in this convention. [9] [12]

Symbols

Only the symbols in the latest IPA chart are included. The numbers in the leftmost column, according to which the symbols are sorted, are the IPA Numbers. Some of the IPA symbols to which a system lacks a corresponding symbol may still be represented in that system by use of a modifier (diacritic), but such combinations are not included unless the documentation explicitly assigns one for the value.

Coverage

ScopeBrannerMillar & OasaPHONASCIIPraatSILUPSIDUsenetWorldbetX-SAMPA
Consonants (80)79 (69 (67 (80 (80 (75 (73 (73 (79 (
Vowels (29)29 (27 (26 (29 (28 (28 (28 (26 (29 (
Diacritics (35)34 (15 (25 (35 (34 (12 (17 (25 (26 (
Suprasegmentals (28)28 (20 (21 (28 (28 (2 (4 (11 (28 (
Total (172)170 (131 (139 (172 (170 (117 (122 (135 (162 (

See also

Notes

  1. 1 2 3 4 5 6 7 8 9 10 11 12 In Worldbet, these combinations are given as merely proposed for values "for which no machine-readable coding has yet been proposed".
  2. The uvular approximant is represented by R in PHONASCII.
  3. 1 2 3 L represents either a voiceless alveolar lateral fricative, a velar approximant, or a velarized alveolar lateral approximant in the Usenet IPA/ASCII transcription.
  4. 1 2 c! represents either an alveolar or palatal click in the Usenet IPA/ASCII transcription.
  5. 1 2 - represents either retracted or "velarized or pharyngealized" in Millar & Oasa's system.
  6. 1 2 ¿ and ¡ are not part of ASCII, but are nonetheless proposed as encoding advanced and retracted tongue root, respectively, in Worldbet.
  7. . represents either raised or palatalized in Millar & Oasa's system.
  8. 1 2 * represents either non-syllabic or extra-short in Millar & Oasa's system.
  9. )) representing a tie bar is placed after both segments, as in ts)), in Branner's system.

References

  1. International Phonetic Association (1993). "Council actions on revisions of the IPA". Journal of the International Phonetic Association. 23 (1): 32–34. doi:10.1017/S002510030000476X. S2CID   249420050.
  2. Branner, David Prager (1994). "Proposal for an ASCII Version of the IPA". University of Washington. Archived from the original on 9 February 1999.
  3. Millar, J. B.; Oasa, H. (1981). "Proposal for ASCII coded phonetic script". Journal of the International Phonetic Association. 11 (2): 62–74. doi:10.1017/S0025100300002279. S2CID   146352996.
  4. Allen, George D. (1988). "The PHONASCII system". Journal of the International Phonetic Association. 18 (1): 9–25. doi:10.1017/S0025100300003509. S2CID   143899772.
  5. Boersma, Paul; Weenink, David (2004–2025). "Phonetic symbols". Praat.
  6. "IPA (SIL) Keyboard Help". Keyman Help. SIL International.
  7. Reetz, Henning (23 May 2018). "Simple UPSID interface". Universität Frankfurt.
  8. Gómez-Vilda, Pedro; Ferrández-Vicente, José Manuel; Rodellar-Biarge, Victoria; Álvarez-Marquina, Agustín; Mazaira-Fernández, Luis Miguel; Martínez-Olalla, Rafael; Muñoz-Mulas, Cristina (2009). "Detection of Speech Dynamics by Neuromorphic Units". In Mira, José; Ferrández, José Manuel; Álvarez, José R.; de la Paz, Félix; Toledo, F. Javier (eds.). Methods and Models in Artificial and Natural Computation: A Homage to Professor Mira's Scientific Legacy – Third International Work-Conference on the Interplay Between Natural and Artificial Computation, IWINAC 2009, Santiago de Compostela, Spain, June 22-26, 2009, Proceedings, Part I. Springer. pp. 67–78. doi:10.1007/978-3-642-02264-7_8. ISBN   978-3-642-02263-0. Page 74.
  9. 1 2 "Language Subtag Registry". IANA. 2021-03-05. Retrieved 30 April 2021.
  10. Kirshenbaum, Evan (6 September 2011). "Representing IPA phonetics in ASCII" (PDF). Archived from the original (PDF) on 19 April 2016.
  11. Hieronymus, James L. (1994). "ASCII Phonetic Symbols for the World's Languages: Worldbet". AT&T Bell Laboratories Technical Memorandum. CiteSeerX   10.1.1.225.9914 .
  12. Wells, John (3 May 2000). "Computer-coding the IPA: a proposed extension of SAMPA". University College London.