International Phonetic Alphabet | |
---|---|
Script type | Alphabet – partially featural |
Time period | 1888 to present |
Languages | Used for phonetic and phonemic transcription of any oral language |
Related scripts | |
Parent systems | |
Unicode | |
See Phonetic symbols in Unicode § Unicode blocks | |
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation for the sounds of speech. [1] The IPA is used by lexicographers, foreign language students and teachers, linguists, speech–language pathologists, singers, actors, constructed language creators, and translators. [2] [3]
The IPA is designed to represent those qualities of speech that are part of lexical (and, to a limited extent, prosodic) sounds in oral language: phones, intonation and the separation of syllables. [1] To represent additional qualities of speech—such as tooth gnashing, lisping, and sounds made with a cleft palate—an extended set of symbols may be used. [2]
Segments are transcribed by one or more IPA symbols of two basic types: letters and diacritics. For example, the sound of the English digraph ⟨ch⟩ may be transcribed in IPA with a single letter: [c], or with multiple letters plus diacritics: [t̠̺͡ʃʰ], depending on how precise one wishes to be. Slashes are used to signal phonemic transcription; therefore, /tʃ/ is more abstract than either [t̠̺͡ʃʰ] or [c] and might refer to either, depending on the context and language. [note 1]
Occasionally, letters or diacritics are added, removed, or modified by the International Phonetic Association. As of the most recent change in 2005, [4] there are 107 segmental letters, an indefinitely large number of suprasegmental letters, 44 diacritics (not counting composites), and four extra-lexical prosodic marks in the IPA. These are illustrated in the current IPA chart, posted below in this article and on the International Phonetic Association's website. [5]
In 1886, a group of French and English language teachers, led by the French linguist Paul Passy, formed what would be known from 1897 onwards as the International Phonetic Association (in French, l'Association phonétique internationale). [6] The idea of the alphabet had been suggested to Passy by Otto Jespersen. It was developed by Passy along with other members of the association, principally Daniel Jones. The original IPA alphabet was based on the Romic alphabet, an English spelling reform created by Henry Sweet that in turn was based on the Palaeotype alphabet of Alexander John Ellis, but to make it usable for other languages the values of the symbols were allowed to vary from language to language. [note 2] For example, the sound [ ʃ ] (the sh in shoe) was originally represented with the letter ⟨c⟩ for English but with ⟨x⟩ for French and German; with German, ⟨c⟩ was used for the [ x ] sound of Bach . [6] With a growing number of transcribed languages this proved impractical, and in 1888 the values of the letters were made uniform across languages. This would provide the base for all future revisions. [6] [8]
Since its creation, the IPA has undergone a number of revisions. After relatively frequent revisions and expansions from the 1890s to the 1940s, the IPA remained nearly static until the Kiel Convention in 1989, which substantially revamped the alphabet. A smaller revision took place in 1993 with the resurrection of letters for mid central vowels [2] and the retirement of letters for voiceless implosives. [9] The alphabet was last revised in May 2005 with the addition of a letter for a labiodental flap. [10] Apart from the addition and removal of symbols, changes to the IPA have consisted largely of renaming symbols and categories and in modifying typefaces. [2]
Extensions to the International Phonetic Alphabet for speech pathology (extIPA) were created in 1990 and were officially adopted by the International Clinical Phonetics and Linguistics Association in 1994. [11] They were substantially revised in 2015.
The general principle of the IPA is to provide one letter for each distinctive sound (speech segment). [note 3] This means that:
The alphabet is designed for transcribing sounds (phones), not phonemes, though it is used for phonemic transcription as well. A few letters that did not indicate specific sounds have been retired (⟨ˇ⟩, once used for the "compound" tone of Swedish and Norwegian, and ⟨ƞ⟩, once used for the moraic nasal of Japanese), though one remains: ⟨ɧ⟩, used for the sj-sound of Swedish. When the IPA is used for broad phonetic or for phonemic transcription, the letter–sound correspondence can be rather loose. The IPA has recommended that more 'familiar' letters be used when that would not cause ambiguity. [13] For example, ⟨e⟩ and ⟨o⟩ for [ɛ] and [ɔ], ⟨t⟩ for [t̪] or [ʈ], ⟨f⟩ for [ɸ], etc. Indeed, in the illustration of Hindi in the IPA Handbook, the letters ⟨c⟩ and ⟨ɟ⟩ are used for /t͡ʃ/ and /d͡ʒ/.
Among the symbols of the IPA, 107 letters represent consonants and vowels, 31 diacritics are used to modify these, and 17 additional signs indicate suprasegmental qualities such as length, tone, stress, and intonation. [note 6] These are organized into a chart; the chart displayed here is the official chart as posted at the website of the IPA.
The letters chosen for the IPA are meant to harmonize with the Latin alphabet. [note 7] For this reason, most letters are either Latin or Greek, or modifications thereof. Some letters are neither: for example, the letter denoting the glottal stop, ⟨ʔ⟩, originally had the form of a question mark with the dot removed. A few letters, such as that of the voiced pharyngeal fricative, ⟨ʕ⟩, were inspired by other writing systems (in this case, the Arabic letter ⟨ ﻉ ⟩, ʿayn, via the reversed apostrophe). [9]
Some letter forms derive from existing letters:
The International Phonetic Alphabet is based on the Latin script, and uses as few non-Latin letters as possible. [6] The Association created the IPA so that the sound values of most letters would correspond to "international usage" (approximately Classical Latin). [6] Hence, the consonant letters ⟨b⟩, ⟨d⟩, ⟨f⟩, ⟨ɡ⟩, ⟨h⟩, ⟨k⟩, ⟨l⟩, ⟨m⟩, ⟨n⟩, ⟨p⟩, ⟨s⟩, ⟨t⟩, ⟨v⟩, ⟨w⟩, and ⟨z⟩ have more or less their word-initial values in English (g as in gill, h as in hill, though p t k are unaspirated as in spill, still, skill); and the vowel letters ⟨a⟩, ⟨e⟩, ⟨i⟩, ⟨o⟩, ⟨u⟩ correspond to the (long) sound values of Latin: [i] is like the vowel in machine, [u] is as in rule, etc. Other Latin letters, particularly ⟨j⟩, ⟨r⟩ and ⟨y⟩, differ from English, but have their IPA values in Latin or other European languages.
This basic Latin inventory was extended by adding small-capital and cursive forms, diacritics and rotation. The sound values of these letters are related to those of the original letters, and their derivation may be iconic. [note 9] For example, letters with a rightward-facing hook at the bottom represent retroflex equivalents of the source letters, and small capital letters usually represent uvular equivalents of their source letters.
There are also several letters from the Greek alphabet, though their sound values may differ from Greek. For most Greek letters, subtly different glyph shapes have been devised for the IPA, specifically ⟨ɑ⟩, ⟨ꞵ⟩, ⟨ɣ⟩, ⟨ɛ⟩, ⟨ɸ⟩, ⟨ꭓ⟩ and ⟨ʋ⟩, which are encoded in Unicode separately from their parent Greek letters. One, however – ⟨θ⟩ – has only its Greek form, while for ⟨ꞵ ~ β⟩ and ⟨ꭓ ~ χ⟩, both Greek and Latin forms are in common use. [16] The tone letters are not derived from an alphabet, but from a pitch trace on a musical scale.
Beyond the letters themselves, there are a variety of secondary symbols which aid in transcription. Diacritic marks can be combined with the letters to add tone and phonetic detail such as secondary articulation. There are also special symbols for prosodic features such as stress and intonation.
There are two principal types of brackets used to set off (delimit) IPA transcriptions:
Symbol | Use |
---|---|
[ ... ] | Square brackets are used with phonetic notation, whether broad or narrow [17] – that is, for actual pronunciation, possibly including details of the pronunciation that may not be used for distinguishing words in the language being transcribed, but which the author nonetheless wishes to document. Such phonetic notation is the primary function of the IPA. |
/ ... / | Slashes [note 10] are used for abstract phonemic notation, [17] which note only features that are distinctive in the language, without any extraneous detail. For example, while the 'p' sounds of English pin and spin are pronounced differently (and this difference would be meaningful in some languages), the difference is not meaningful in English. Thus, phonemically the words are usually analyzed as /ˈpɪn/ and /ˈspɪn/, with the same phoneme /p/. To capture the difference between them (the allophones of /p/), they can be transcribed phonetically as [pʰɪn] and [spɪn]. Phonemic notation commonly uses IPA symbols that are rather close to the default pronunciation of a phoneme, but for legibility often uses simple and 'familiar' letters rather than precise notation, for example /r/ and /o/ for the English [ɹʷ] and [əʊ̯] sounds, or /c,ɟ/ for [t͜ʃ,d͜ʒ] as mentioned above. |
Less common conventions include:
Symbol | Use |
---|---|
{ ... } | Braces ("curly brackets") are used for prosodic notation. [18] See Extensions to the International Phonetic Alphabet for examples in this system. |
( ... ) | Parentheses are used for indistinguishable [17] or unidentified utterances. They are also seen for silent articulation (mouthing), [19] where the expected phonetic transcription is derived from lip-reading, and with periods to indicate silent pauses, for example (…) or (2sec). The latter usage is made official in the extIPA, with unidentified segments circled instead. [20] |
⸨ ... ⸩ | Double parentheses indicate either a transcription of obscured speech or a description of the obscuring noise. The IPA specifies that they mark the obscured sound, [18] as in ⸨2σ⸩, two audible syllables obscured by another sound. The current extIPA specifications prescribe double parentheses for the extraneous noise, such as ⸨cough⸩ for a cough by another person (not the speaker) or ⸨knock⸩ for a knock on a door, but the IPA Handbook identifies IPA and extIPA usage as equivalent. [21] Early publications of the extIPA explain double parentheses as marking "uncertainty because of noise which obscures the recording", and that within them "may be indicated as much detail as the transcriber can detect." [22] |
All three of the above are provided by the IPA Handbook. The following are not, but may be seen in IPA transcription or in associated material (especially angle brackets):
Symbol | Field | Description |
---|---|---|
⟦ ... ⟧ | Phonetics | Double square brackets are used for especially precise phonetic transcription, often finer than is normally practicable. [23] This is consistent with the IPA convention of doubling a symbol to indicate greater degree. Double brackets may indicate that a letter has its cardinal IPA value. For example, ⟦a⟧ is an open front vowel, rather than the perhaps slightly different value (such as open central) that "[a]" may be used to transcribe in a particular language. Thus, two vowels transcribed for easy legibility as [e] and [ɛ] may be clarified as actually being ⟦e̝⟧ and ⟦e⟧; [ð] may be more precisely ⟦ð̠̞ˠ⟧. [24] Double brackets may also be used for a specific token or speaker; for example, the pronunciation of a particular child as opposed to the adult pronunciation that is their target. [25] |
| Morphophonology | Double slashes are used for morphophonemic transcription. This is also consistent with the IPA convention of doubling a symbol to indicate greater degree – in this case, more abstract than phonemic transcription. Also commonly seen are the braces of set theory, especially when enclosing the set of phonemes that constitute the morphophoneme, e.g. {t d} or {t|d} or {/t/, /d/} for a conflated /t/ and /d/. Braces have a conflicting use to delimit prosodic transcription within the Voice Quality Symbols, which are an extension of IPA used in extIPA, but are not otherwise used in IPA proper. Other delimiters sometimes seen are pipes and double pipes taken from Americanist phonetic notation. However, these conflict with the pipes used in basic IPA prosodic transcription. [note 11] |
| Diaphonology | Backslashes are used for diaphonemic transcription, for example setting off pronunciations in dictionaries that do not target a specific preferred dialect. [note 10] Other delimiters are double slashes, – the same notation as for morphophonology, – exclamation marks, and pipes. |
| Graphemics | Angle brackets [note 12] are used to mark both original Latin orthography and transliteration from another script; they are also used to identify individual graphemes of any script. [28] [29] In IPA literature, they are used to indicate the IPA letters themselves rather than the sound values that they carry. For example, ⟨cot⟩ would be used for the orthography of the English word cot, as opposed to its pronunciation /ˈkɒt/. Italics are usual when words are written as themselves (as with cot in the previous sentence) rather than to specifically note their orthography. However, italics are sometimes ambiguous, and italic markup is not always accessible to sight-impaired readers who rely on screen reader technology. Double angle brackets may occasionally be useful to distinguish original orthography from transliteration, or the idiosyncratic spelling of a manuscript from the normalized orthography of the language. Pipes are sometimes used instead of double angle brackets to denote the distinct allographs of a grapheme that are known as glyphs . For example, print |g| and script |ɡ| are two glyph variants of the grapheme ⟨ g ⟩ of Latin script. [30] |
Some examples of contrasting brackets in the literature:
In some English accents, the phoneme /l/, which is usually spelled as ⟨l⟩ or ⟨ll⟩, is articulated as two distinct allophones: the clear [l] occurs before vowels and the consonant /j/, whereas the dark [ɫ]/[lˠ] occurs before consonants, except /j/, and at the end of words. [31]
the alternations /f/ –/v/ in plural formation in one class of nouns, as in knife/naɪf/ –knives/naɪvz/, which can be represented morphophonemically as {naɪV} –{naɪV+z}. The morphophoneme {V} stands for the phoneme set {/f/,/v/}. [32]
[ˈf\faɪnəlzˈhɛldɪn(.)⸨knockondoor⸩bɑɹsə{𝑝ˈloʊnəandˈmədɹɪd𝑝}]—f-finals held in Barcelona and Madrid. [33]
IPA letters have cursive forms designed for use in manuscripts and when taking field notes, but the Handbook recommended against their use, as cursive IPA is "harder for most people to decipher". [34] A braille representation of the IPA for blind or visually impaired professionals and students has also been developed. [35]
The International Phonetic Alphabet is occasionally modified by the Association. After each modification, the Association provides an updated simplified presentation of the alphabet in the form of a chart. (See History of the IPA.) Not all aspects of the alphabet can be accommodated in a chart of the size published by the IPA. The alveolo-palatal and epiglottal consonants, for example, are not included in the consonant chart for reasons of space rather than of theory (two additional columns would be required, one between the retroflex and palatal columns and the other between the pharyngeal and glottal columns), and the lateral flap would require an additional row for that single consonant, so they are listed instead under the catchall block of "other symbols". [36] The indefinitely large number of tone letters would make a full accounting impractical even on a larger page, and only a few examples are shown, and even the tone diacritics are not complete; the reversed tone letters are not illustrated at all.
The procedure for modifying the alphabet or the chart is to propose the change in the Journal of the IPA. (See, for example, December 2008 on an open central unrounded vowel [37] and August 2011 on central approximants.) [38] Reactions to the proposal may be published in the same or subsequent issues of the Journal (as in August 2009 on the open central vowel). [39] [ better source needed ] A formal proposal is then put to the Council of the IPA [40] [ clarification needed ] – which is elected by the membership [41] – for further discussion and a formal vote. [42] [43]
Many users of the alphabet, including the leadership of the Association itself, deviate from its standardized usage. [note 13] The Journal of the IPA finds it acceptable to mix IPA and extIPA symbols in consonant charts in their articles. (For instance, including the extIPA letter ⟨𝼆⟩, rather than ⟨ʎ̝̊⟩, in an illustration of the IPA.) [44]
Of more than 160 IPA symbols, relatively few will be used to transcribe speech in any one language, with various levels of precision. A precise phonetic transcription, in which sounds are specified in detail, is known as a narrow transcription. A coarser transcription with less detail is called a broad transcription. Both are relative terms, and both are generally enclosed in square brackets. [1] Broad phonetic transcriptions may restrict themselves to easily heard details, or only to details that are relevant to the discussion at hand, and may differ little if at all from phonemic transcriptions, but they make no theoretical claim that all the distinctions transcribed are necessarily meaningful in the language.
For example, the English word little may be transcribed broadly as [ˈlɪtəl], approximately describing many pronunciations. A narrower transcription may focus on individual or dialectical details: [ˈɫɪɾɫ] in General American, [ˈlɪʔo] in Cockney, or [ˈɫɪːɫ] in Southern US English.
Phonemic transcriptions, which express the conceptual counterparts of spoken sounds, are usually enclosed in slashes (/ /) and tend to use simpler letters with few diacritics. The choice of IPA letters may reflect theoretical claims of how speakers conceptualize sounds as phonemes or they may be merely a convenience for typesetting. Phonemic approximations between slashes do not have absolute sound values. For instance, in English, either the vowel of pick or the vowel of peak may be transcribed as /i/, so that pick, peak would be transcribed as /ˈpik,ˈpiːk/ or as /ˈpɪk,ˈpik/; and neither is identical to the vowel of the French pique, which would also be transcribed /pik/. By contrast, a narrow phonetic transcription of pick, peak, pique could be: [pʰɪk], [pʰiːk], [pikʲ].
IPA is popular for transcription by linguists. Some American linguists, however, use a mix of IPA with Americanist phonetic notation or Sinological phonetic notation or otherwise use nonstandard symbols for various reasons. [45] Authors who employ such nonstandard use are encouraged to include a chart or other explanation of their choices, which is good practice in general, as linguists differ in their understanding of the exact meaning of IPA symbols and common conventions change over time.
Many British dictionaries, including the Oxford English Dictionary and some learner's dictionaries such as the Oxford Advanced Learner's Dictionary and the Cambridge Advanced Learner's Dictionary , now use the International Phonetic Alphabet to represent the pronunciation of words. [46] However, most American (and some British) volumes use one of a variety of pronunciation respelling systems, intended to be more comfortable for readers of English and to be more acceptable across dialects, without the implication of a preferred pronunciation that the IPA might convey. For example, the respelling systems in many American dictionaries (such as Merriam-Webster ) use ⟨y⟩ for IPA [ j] and ⟨sh⟩ for IPA [ ʃ ], reflecting the usual spelling of those sounds in English. [47] [48] [note 14] (In IPA, [y] represents the sound of the French ⟨u⟩, as in tu, and [sh] represents the sequence of consonants in grasshopper.)
The IPA is also not universal among dictionaries in languages other than English. Monolingual dictionaries of languages with phonemic orthographies generally do not bother with indicating the pronunciation of most words, and tend to use respelling systems for words with unexpected pronunciations. Dictionaries produced in Israel use the IPA rarely and sometimes use the Hebrew alphabet for transcription of foreign words. [note 15] Bilingual dictionaries that translate from foreign languages into Russian usually employ the IPA, but monolingual Russian dictionaries occasionally use pronunciation respelling for foreign words. [note 16] The IPA is more common in bilingual dictionaries, but there are exceptions here too. Mass-market bilingual Czech dictionaries, for instance, tend to use the IPA only for sounds not found in Czech. [note 17]
IPA letters have been incorporated into the alphabets of various languages, notably via the Africa Alphabet in many sub-Saharan languages such as Hausa, Fula, Akan, Gbe languages, Manding languages, Lingala, etc. Capital case variants have been created for use in these languages. For example, Kabiyè of northern Togo has Ɖ ɖ, Ŋ ŋ, Ɣ ɣ, Ɔ ɔ, Ɛ ɛ, Ʋ ʋ. These, and others, are supported by Unicode, but appear in Latin ranges other than the IPA extensions.
In the IPA itself, however, only lower-case letters are used. The 1949 edition of the IPA handbook indicated that an asterisk ⟨*⟩ might be prefixed to indicate that a word was a proper name, [50] but this convention was not included in the 1999 Handbook, which notes the contrary use of the asterisk as a placeholder for a sound or feature that does not have a symbol. [51]
The IPA has widespread use among classical singers during preparation as they are frequently required to sing in a variety of foreign languages. They are also taught by vocal coaches to perfect diction and improve tone quality and tuning. [52] Opera librettos are authoritatively transcribed in IPA, such as Nico Castel's volumes [53] and Timothy Cheek's book Singing in Czech. [54] Opera singers' ability to read IPA was used by the site Visual Thesaurus, which employed several opera singers "to make recordings for the 150,000 words and phrases in VT's lexical database ... for their vocal stamina, attention to the details of enunciation, and most of all, knowledge of IPA". [55]
The International Phonetic Association organizes the letters of the IPA into three categories: pulmonic consonants, non-pulmonic consonants, and vowels. [note 18] [57]
Pulmonic consonant letters are arranged singly or in pairs of voiceless (tenuis) and voiced sounds, with these then grouped in columns from front (labial) sounds on the left to back (glottal) sounds on the right. In official publications by the IPA, two columns are omitted to save space, with the letters listed among "other symbols" even though theoretically they belong in the main chart. [note 19] They are arranged in rows from full closure (occlusives: stops and nasals) at top, to brief closure (vibrants: trills and taps), to partial closure (fricatives), and finally minimal closure (approximants) at bottom, again with a row left out to save space. In the table below, a slightly different arrangement is made: All pulmonic consonants are included in the pulmonic-consonant table, and the vibrants and laterals are separated out so that the rows reflect the common lenition pathway of stop → fricative → approximant, as well as the fact that several letters pull double duty as both fricative and approximant; affricates may then be created by joining stops and fricatives from adjacent cells. Shaded cells represent articulations that are judged to be impossible or not distinctive.
Vowel letters are also grouped in pairs—of unrounded and rounded vowel sounds—with these pairs also arranged from front on the left to back on the right, and from maximal closure at top to minimal closure at bottom. No vowel letters are omitted from the chart, though in the past some of the mid central vowels were listed among the "other symbols".
A pulmonic consonant is a consonant made by obstructing the glottis (the space between the vocal folds) or oral cavity (the mouth) and either simultaneously or subsequently letting out air from the lungs. Pulmonic consonants make up the majority of consonants in the IPA, as well as in human language. All consonants in English fall into this category. [59]
The pulmonic consonant table, which includes most consonants, is arranged in rows that designate manner of articulation, meaning how the consonant is produced, and columns that designate place of articulation, meaning where in the vocal tract the consonant is produced. The main chart includes only consonants with a single place of articulation.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Notes
Non-pulmonic consonants are sounds whose airflow is not dependent on the lungs. These include clicks (found in the Khoisan languages and some neighboring Bantu languages of Africa), implosives (found in languages such as Sindhi, Hausa, Swahili and Vietnamese), and ejectives (found in many Amerindian and Caucasian languages).
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Notes
Affricates and co-articulated stops are represented by two letters joined by a tie bar, either above or below the letters with no difference in meaning. [note 21] Affricates are optionally represented by ligatures (e.g. ⟨ʧ, ʤ ⟩), though this is no longer official IPA usage [1] because a great number of ligatures would be required to represent all affricates this way. Alternatively, a superscript notation for a consonant release is sometimes used to transcribe affricates, for example ⟨tˢ⟩ for [t͜s], paralleling [kˣ] ~ [k͜x]. The letters for the palatal plosives ⟨c⟩ and ⟨ɟ⟩ are often used as a convenience for [t͜ʃ] and [d͜ʒ] or similar affricates, even in official IPA publications, so they must be interpreted with care.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Co-articulated consonants are sounds that involve two simultaneous places of articulation (are pronounced using two parts of the vocal tract). In English, the [w] in "went" is a coarticulated consonant, being pronounced by rounding the lips and raising the back of the tongue. Similar sounds are [ʍ] and [ɥ]. In some languages, plosives can be double-articulated, for example in the name of Laurent Gbagbo.
Notes
The IPA defines a vowel as a sound which occurs at a syllable center. [68] Below is a chart depicting the vowels of the IPA. The IPA maps the vowels according to the position of the tongue.
| ||||||||||||||||||||||||||||||||
The vertical axis of the chart is mapped by vowel height. Vowels pronounced with the tongue lowered are at the bottom, and vowels pronounced with the tongue raised are at the top. For example, [ɑ] (the first vowel in father) is at the bottom because the tongue is lowered in this position. [i] (the vowel in "meet") is at the top because the sound is said with the tongue raised to the roof of the mouth.
In a similar fashion, the horizontal axis of the chart is determined by vowel backness. Vowels with the tongue moved towards the front of the mouth (such as [ɛ], the vowel in "met") are to the left in the chart, while those in which it is moved to the back (such as [ʌ], the vowel in "but") are placed to the right in the chart.
In places where vowels are paired, the right represents a rounded vowel (in which the lips are rounded) while the left is its unrounded counterpart.
Diphthongs are typically specified with a non-syllabic diacritic, as in ⟨ui̯⟩ or ⟨u̯i⟩, or with a superscript for the on- or off-glide, as in ⟨uⁱ⟩ or ⟨ᵘi⟩. Sometimes a tie bar is used: ⟨u͜i⟩, especially when it is difficult to tell if the diphthong is characterized by an on-glide or an off-glide or when it is variable.
Notes
Diacritics are used for phonetic detail. They are added to IPA letters to indicate a modification or specification of that letter's normal pronunciation. [69]
By being made superscript, any IPA letter may function as a diacritic, conferring elements of its articulation to the base letter. Those superscript letters listed below are specifically provided for by the IPA Handbook; other uses can be illustrated with ⟨tˢ⟩ ([t] with fricative release), ⟨ᵗs⟩ ([s] with affricate onset), ⟨ⁿd⟩ (prenasalized [d]), ⟨bʱ⟩ ([b] with breathy voice), ⟨mˀ⟩ (glottalized [m]), ⟨sᶴ⟩ ([s] with a flavor of [ʃ], i.e. a voiceless alveolar retracted sibilant), ⟨oᶷ⟩ ([o] with diphthongization), ⟨ɯᵝ⟩ (compressed [ɯ]). Superscript diacritics placed after a letter are ambiguous between simultaneous modification of the sound and phonetic detail at the end of the sound. For example, labialized ⟨kʷ⟩ may mean either simultaneous [k] and [w] or else [k] with a labialized release. Superscript diacritics placed before a letter, on the other hand, normally indicate a modification of the onset of the sound (⟨mˀ⟩ glottalized [m], ⟨ˀm⟩[m] with a glottal onset). (See § Superscript IPA.)
Airstream diacritics | |||||
---|---|---|---|---|---|
◌ʼ | kʼsʼ | Ejective | |||
Syllabicity diacritics | |||||
◌̩ | ɹ̩n̩ | Syllabic | ◌̯ | ɪ̯ʊ̯ | Non-syllabic |
◌̍ | ɻ̍ŋ̍ | ◌̑ | y̑ | ||
Consonant-release diacritics | |||||
◌ʰ | tʰ | Aspirated [α] | ◌̚ | p̚ | No audible release |
◌ⁿ | dⁿ | Nasal release | ◌ˡ | dˡ | Lateral release |
◌ᶿ | tᶿ | Voiceless dental fricative release | ◌ˣ | tˣ | Voiceless velar fricative release |
◌ᵊ | dᵊ | Mid central vowel release | |||
Phonation diacritics | |||||
◌̥ | n̥d̥ | Voiceless | ◌̬ | s̬t̬ | Voiced |
◌̊ | ɻ̊ŋ̊ | ||||
◌̤ | b̤a̤ | Breathy voiced [α] | ◌̰ | b̰a̰ | Creaky voiced |
Articulation diacritics | |||||
◌̪ | t̪d̪ | Dental (◌͆ is dentolabial or underbite in extIPA) | ◌̼ | t̼d̼ | Linguolabial |
◌͆ | ɮ͆ | ||||
◌̺ | t̺d̺ | Apical | ◌̻ | t̻d̻ | Laminal |
◌̟ | u̟t̟ | Advanced (fronted) | ◌̠ | i̠t̠ | Retracted (backed) |
◌᫈ | ɡ᫈ | ◌ | q [β] | ||
◌̈ | ëä | Centralized | ◌̽ | e̽ɯ̽ | Mid-centralized |
◌̝ | e̝r̝ | Raised ([r̝],[ɭ˔] are fricatives) | ◌̞ | e̞β̞ | Lowered ([β̞],[ɣ˕] are approximants) |
◌˔ | ɭ˔ | ◌˕ | y˕ɣ˕ | ||
Co-articulation diacritics | |||||
◌̹ | ɔ̹x̹ | More rounded / less spread (over-rounding) | ◌̜ | ɔ̜xʷ̜ | Less rounded / more spread (under-rounding) [γ] |
◌͗ | y͗χ͗ | ◌͑ | y͑χ͑ʷ | ||
◌ʷ | tʷdʷ | Labialized | ◌ʲ | tʲdʲ | Palatalized |
◌ˠ | tˠdˠ | Velarized | ◌̴ | ɫ ᵶ | Velarized or pharyngealized |
◌ˤ | tˤaˤ | Pharyngealized | |||
◌̘ | e̘o̘ | Advanced tongue root | ◌̙ | e̙o̙ | Retracted tongue root |
◌꭪ | y꭪ | ◌꭫ | y꭫ | ||
◌̃ | ẽz̃ | Nasalized | ◌˞ | ɚɝ | Rhoticity |
Notes:
Subdiacritics (diacritics normally placed below a letter) may be moved above a letter to avoid conflict with a descender, as in voiceless ⟨ŋ̊⟩. [69] The raising and lowering diacritics have optional spacing forms ⟨˔⟩, ⟨˕⟩ that avoid descenders.
The state of the glottis can be finely transcribed with diacritics. A series of alveolar plosives ranging from open-glottis to closed-glottis phonation is:
Open glottis | [t] | voiceless |
---|---|---|
[d̤] | breathy voice, also called murmured | |
[d̥] | slack voice | |
Sweet spot | [d] | modal voice |
[d̬] | stiff voice | |
[d̰] | creaky voice | |
Closed glottis | [ʔ͜t] | glottal closure |
Additional diacritics are provided by the Extensions to the IPA for speech pathology.
These symbols describe the features of a language above the level of individual consonants and vowels, that is, at the level of syllable, word or phrase. These include prosody, pitch, length, stress, intensity, tone and gemination of the sounds of a language, as well as the rhythm and intonation of speech. [71] Various ligatures of pitch/tone letters and diacritics are provided for by the Kiel Convention and used in the IPA Handbook despite not being found in the summary of the IPA alphabet found on the one-page chart.
Under capital letters below we will see how a carrier letter may be used to indicate suprasegmental features such as labialization or nasalization. Some authors omit the carrier letter, for e.g. suffixed [kʰuˣt̪s̟]ʷ or prefixed [ʷkʰuˣt̪s̟], [note 22] or place a spacing variant of a diacritic such as ⟨˔⟩ or ⟨˜⟩ at the beginning or end of a word to indicate that it applies to the entire word. [note 23]
Length, stress, and rhythm | |||
---|---|---|---|
ˈke | Primary stress (appears before stressed syllable) | ˌke | Secondary stress (appears before stressed syllable) |
eːkː | Long (long vowel or geminate consonant) | eˑ | Half-long |
ə̆ɢ̆ | Extra-short | ||
ek.ste eks.te | Syllable break (internal boundary) | es‿e | Linking (lack of a boundary; a phonological word) [note 24] |
Intonation | |||
| [α] | Minor or foot break | ‖ [α] | Major or intonation break |
↗︎ | Global rise [note 25] | ↘︎ | Global fall [note 25] |
Up- and down-step | |||
ꜛke | Upstep | ꜜke | Downstep |
Notes:
Pitch diacritics [note 26] | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
ŋ̋e̋ | Extra high | ŋ̌ě | Rising | ŋ᷄e᷄ | Mid-rising | |||||||
ŋ́é | High | ŋ̂ê | Falling | ŋ᷅e᷅ | Low-rising | |||||||
ŋ̄ē | Mid | ŋ᷈e᷈ | Peaking (rising–falling) | ŋ᷇e᷇ | High-falling | |||||||
ŋ̀è | Low | ŋ᷉e᷉ | Dipping (falling–rising) | ŋ᷆e᷆ | Mid-falling | |||||||
ŋ̏ȅ | Extra low | (etc.) [note 27] |
Chao tone letters [note 26] | ||||
---|---|---|---|---|
˥e | ꜒e | e˥ | e꜒ | High |
˦e | ꜓e | e˦ | e꜓ | Half-high |
˧e | ꜔e | e˧ | e꜔ | Mid |
˨e | ꜕e | e˨ | e꜕ | Half-low |
˩e | ꜖e | e˩ | e꜖ | Low |
˩˥e | ꜖꜒e | e˩˥ | e꜖꜒ | Rising (low to high or generic) |
˥˩e | ꜒꜖e | e˥˩ | e꜒꜖ | Falling (high to low or generic) |
(etc.) |
The old staveless tone letters, which are effectively obsolete, include high ⟨ˉe⟩, mid ⟨˗e⟩, low ⟨ˍe⟩, rising ⟨ˊe⟩, falling ⟨ˋe⟩, low rising ⟨ˏe⟩ and low falling ⟨ˎe⟩.
Officially, the stress marks ⟨ˈ ˌ⟩ appear before the stressed syllable, and thus mark the syllable boundary as well as stress (though the syllable boundary may still be explicitly marked with a period). [74] Occasionally the stress mark is placed immediately before the nucleus of the syllable, after any consonantal onset. [75] In such transcriptions, the stress mark does not mark a syllable boundary. The primary stress mark may be doubled ⟨ˈˈ⟩ for extra stress (such as prosodic stress). The secondary stress mark is sometimes seen doubled ⟨ˌˌ⟩ for extra-weak stress, but this convention has not been adopted by the IPA. [74] Some dictionaries place both stress marks before a syllable, ⟨¦⟩, to indicate that pronunciations with either primary or secondary stress are heard, though this is not IPA usage. [note 28]
There are three boundary markers: ⟨.⟩ for a syllable break, ⟨|⟩ for a minor prosodic break and ⟨‖⟩ for a major prosodic break. The tags 'minor' and 'major' are intentionally ambiguous. Depending on need, 'minor' may vary from a foot break to a break in list-intonation to a continuing–prosodic unit boundary (equivalent to a comma), and while 'major' is often any intonation break, it may be restricted to a final–prosodic unit boundary (equivalent to a period). The 'major' symbol may also be doubled, ⟨‖‖⟩, for a stronger break. [note 29]
Although not part of the IPA, the following additional boundary markers are often used in conjunction with the IPA: ⟨μ⟩ for a mora or mora boundary, ⟨σ⟩ for a syllable or syllable boundary, ⟨+⟩ for a morpheme boundary, ⟨#⟩ for a word boundary (may be doubled, ⟨##⟩, for e.g. a breath-group boundary), [77] ⟨$⟩ for a phrase or intermediate boundary and ⟨%⟩ for a prosodic boundary. For example, C# is a word-final consonant, %V a post-pausa vowel, and σC a syllable-initial consonant.
⟨ꜛ ꜜ⟩ are defined in the Handbook as "upstep" and "downstep", concepts from tonal languages. However, the upstep symbol can also be used for pitch reset, and the IPA Handbook uses it for prosody in the illustration for Portuguese, a non-tonal language.
Phonetic pitch and phonemic tone may be indicated by either diacritics placed over the nucleus of the syllable (e.g., high-pitch ⟨é⟩) or by Chao tone letters placed either before or after the word or syllable. There are three graphic variants of the tone letters: with or without a stave, and facing left or facing right from the stave. The stave was introduced with the 1989 Kiel Convention, as was the option of placing a staved letter after the word or syllable, while retaining the older conventions. There are therefore six ways to transcribe pitch/tone in the IPA: i.e., ⟨é⟩, ⟨˦e⟩, ⟨e˦⟩, ⟨꜓e⟩, ⟨e꜓⟩ and ⟨ˉe⟩ for a high pitch/tone. [74] [78] [79] Of the tone letters, only left-facing staved letters and a few representative combinations are shown in the summary on the Chart, and in practice it is currently more common for tone letters to occur after the syllable/word than before, as in the Chao tradition. Placement before the word is a carry-over from the pre-Kiel IPA convention, as is still the case for the stress and upstep/downstep marks. The IPA endorses the Chao tradition of using the left-facing tone letters, ⟨˥ ˦ ˧ ˨ ˩⟩, for underlying tone, and the right-facing letters, ⟨꜒ ꜓ ꜔ ꜕ ꜖⟩, for surface tone, as occurs in tone sandhi, and for the intonation of non-tonal languages. [note 30] In the Portuguese illustration in the 1999 Handbook, tone letters are placed before a word or syllable to indicate prosodic pitch (equivalent to [↗︎] global rise and [↘︎] global fall, but allowing more precision), and in the Cantonese illustration they are placed after a word/syllable to indicate lexical tone. Theoretically therefore prosodic pitch and lexical tone could be simultaneously transcribed in a single text, though this is not a formalized distinction.
Rising and falling pitch, as in contour tones, are indicated by combining the pitch diacritics and letters in the table, such as grave plus acute for rising [ě] and acute plus grave for falling [ê]. Only six combinations of two diacritics are supported, and only across three levels (high, mid, low), despite the diacritics supporting five levels of pitch in isolation. The four other explicitly approved rising and falling diacritic combinations are high/mid rising [e᷄], low rising [e᷅], high falling [e᷇], and low/mid falling [e᷆]. [note 31]
The Chao tone letters, on the other hand, may be combined in any pattern, and are therefore used for more complex contours and finer distinctions than the diacritics allow, such as mid-rising [e˨˦], extra-high falling [e˥˦], etc. There are 20 such possibilities. However, in Chao's original proposal, which was adopted by the IPA in 1989, he stipulated that the half-high and half-low letters ⟨˦ ˨⟩ may be combined with each other, but not with the other three tone letters, so as not to create spuriously precise distinctions. With this restriction, there are 8 possibilities. [80]
The old staveless tone letters tend to be more restricted than the staved letters, though not as restricted as the diacritics. Officially, they support as many distinctions as the staved letters, [note 32] but typically only three pitch levels are distinguished. Unicode supports default or high-pitch ⟨ˉ ˊ ˋ ˆ ˇ ˜ ˙⟩ and low-pitch ⟨ˍ ˏ ˎ ꞈ ˬ ˷⟩. Only a few mid-pitch tones are supported (such as ⟨˗ ˴⟩), and then only accidentally.
Although tone diacritics and tone letters are presented as equivalent on the chart, "this was done only to simplify the layout of the chart. The two sets of symbols are not comparable in this way." [81] Using diacritics, a high tone is ⟨é⟩ and a low tone is ⟨è⟩; in tone letters, these are ⟨e˥⟩ and ⟨e˩⟩. One can double the diacritics for extra-high ⟨e̋⟩ and extra-low ⟨ȅ⟩; there is no parallel to this using tone letters. Instead, tone letters have mid-high ⟨e˦⟩ and mid-low ⟨e˨⟩; again, there is no equivalent among the diacritics. Thus in a three-register tone system, ⟨é ē è⟩ are equivalent to ⟨e˥ e˧ e˩⟩, while in a four-register system, ⟨e̋ é è ȅ⟩ may be equivalent to ⟨e˥ e˦ e˨ e˩⟩. [74]
The correspondence breaks down even further once they start combining. For more complex tones, one may combine three or four tone diacritics in any permutation, [74] though in practice only generic peaking (rising-falling) e᷈ and dipping (falling-rising) e᷉ combinations are used. Chao tone letters are required for finer detail (e˧˥˧,e˩˨˩,e˦˩˧,e˨˩˦, etc.). Although only 10 peaking and dipping tones were proposed in Chao's original, limited set of tone letters, phoneticians often make finer distinctions, and indeed an example is found on the IPA Chart. [note 33] The system allows the transcription of 112 peaking and dipping pitch contours, including tones that are level for part of their length.
Register | Level [note 35] | Rising | Falling | Peaking | Dipping |
---|---|---|---|---|---|
e˩ | e˩˩ | e˩˧ | e˧˩ | e˩˧˩ | e˧˩˧ |
e˨ | e˨˨ | e˨˦ | e˦˨ | e˨˦˨ | e˦˨˦ |
e˧ | e˧˧ | e˧˥ | e˥˧ | e˧˥˧ | e˥˧˥ |
e˦ | e˦˦ | e˧˥˩ | e˧˩˥ | ||
e˥ | e˥˥ | e˩˥ | e˥˩ | e˩˥˧ | e˥˩˧ |
More complex contours are possible. Chao gave an example of [꜔꜒꜖꜔] (mid-high-low-mid) from English prosody. [80]
Chao tone letters generally appear after each syllable, for a language with syllable tone (⟨a˧vɔ˥˩⟩), or after the phonological word, for a language with word tone (⟨avɔ˧˥˩⟩). The IPA gives the option of placing the tone letters before the word or syllable (⟨˧a˥˩vɔ⟩, ⟨˧˥˩avɔ⟩), but this is rare for lexical tone. (And indeed reversed tone letters may be used to clarify that they apply to the following rather than to the preceding syllable: ⟨꜔a꜒꜖vɔ⟩, ⟨꜔꜒꜖avɔ⟩.) The staveless letters are not directly supported by Unicode, but some fonts allow the stave in Chao tone letters to be suppressed.
IPA diacritics may be doubled to indicate an extra degree (greater intensity) of the feature indicated. [82] This is a productive process, but apart from extra-high and extra-low tones being marked by doubled high- and low-tone diacritics, ⟨ə̋, ə̏⟩, the major prosodic break ⟨‖⟩ being marked as a doubled minor break ⟨|⟩, and a couple other instances, such usage is not enumerated by the IPA.
For example, the stress mark may be doubled (or even tripled, etc.) to indicate an extra degree of stress, such as prosodic stress in English. [83] An example in French, with a single stress mark for normal prosodic stress at the end of each prosodic unit (marked as a minor prosodic break), and a double or even triple stress mark for contrastive/emphatic stress: [ˈˈɑ̃ːˈtre|məˈsjø‖ˈˈvwalamaˈdam‖]Entrez monsieur, voilà madame. [84] Similarly, a doubled secondary stress mark ⟨ˌˌ⟩ is commonly used for tertiary (extra-light) stress, though a proposal to officially adopt this was rejected. [85] In a similar vein, the effectively obsolete staveless tone letters were once doubled for an emphatic rising intonation ⟨˶⟩ and an emphatic falling intonation ⟨˵⟩. [86]
Length is commonly extended by repeating the length mark, which may be phonetic, as in [ĕeeˑeːeːˑeːː] etc., as in English shhh![ʃːːː], or phonemic, as in the "overlong" segments of Estonian:
(Normally additional phonemic degrees of length are handled by the extra-short or half-long diacritic, i.e. ⟨e eˑ eː⟩ or ⟨ĕ e eː⟩, but the first two words in each of the Estonian examples are analyzed as typically short and long, /eeː/ and /nnː/, requiring a different remedy for the additional words.)
Delimiters are similar: double slashes indicate extra phonemic (morpho-phonemic), double square brackets especially precise transcription, and double parentheses especially unintelligible.
Occasionally other diacritics are doubled:
The extIPA provides combining parentheses for weak intensity, which when combined with a doubled diacritic indicate an intermediate degree. For instance, increasing degrees of nasalization of the vowel [e] might be written ⟨e ẽ᪻ ẽ ẽ̃᪻ ẽ̃⟩.
As noted above, IPA letters are often used quite loosely in broad transcription if no ambiguity would arise in a particular language. Because of that, IPA letters have not generally been created for sounds that are not distinguished in individual languages. A distinction between voiced fricatives and approximants is only partially implemented by the IPA, for example. Even with the relatively recent addition of the palatal fricative ⟨ʝ⟩ and the velar approximant ⟨ɰ⟩ to the alphabet, other letters, though defined as fricatives, are often ambiguous between fricative and approximant. For forward places, ⟨β⟩ and ⟨ð⟩ can generally be assumed to be fricatives unless they carry a lowering diacritic. Rearward, however, ⟨ʁ⟩ and ⟨ʕ⟩ are perhaps more commonly intended to be approximants even without a lowering diacritic. ⟨h⟩ and ⟨ɦ⟩ are similarly either fricatives or approximants, depending on the language, or even glottal "transitions", without that often being specified in the transcription.
Another common ambiguity is among the letters for palatal consonants. ⟨c⟩ and ⟨ɟ⟩ are not uncommonly used as a typographic convenience for affricates, typically [t͜ʃ] and [d͜ʒ], while ⟨ɲ⟩ and ⟨ʎ⟩ are commonly used for palatalized alveolar [n̠ʲ] and [l̠ʲ]. To some extent this may be an effect of analysis, but it is common to match up single IPA letters to the phonemes of a language, without overly worrying about phonetic precision.
It has been argued that the lower-pharyngeal (epiglottal) fricatives ⟨ʜ⟩ and ⟨ʢ⟩ are better characterized as trills, rather than as fricatives that have incidental trilling. [93] This has the advantage of merging the upper-pharyngeal fricatives [ħ,ʕ] together with the epiglottal plosive [ʡ] and trills [ʜʢ] into a single pharyngeal column in the consonant chart. However, in Shilha Berber the epiglottal fricatives are not trilled. [94] [95] Although they might be transcribed ⟨ħ̠ ʢ̠⟩ to indicate this, the far more common transcription is ⟨ʜ ʢ⟩, which is therefore ambiguous between languages.
Among vowels, ⟨a⟩ is officially a front vowel, but is more commonly treated as a central vowel. The difference, to the extent it is even possible, is not phonemic in any language.
For all phonetic notation, it is good practice for an author to specify exactly what they mean by the symbols that they use.
Superscript IPA letters are used to indicate secondary aspects of articulation. These may be aspects of simultaneous articulation that are considered to be in some sense less dominant than the basic sound, or may be transitional articulations that are interpreted as secondary elements. [96] Examples include secondary articulation; onsets, releases and other transitions; shades of sound; light epenthetic sounds and incompletely articulated sounds. Morphophonemically, superscripts may be used for assimilation, e.g. ⟨aʷ⟩ for the effect of labialization on a vowel /a/, which may be realized as phonemic /o/. [97] The IPA and ICPLA endorse Unicode encoding of superscript variants of all contemporary segmental letters, including the "implicit" IPA retroflex letters ⟨ꞎ 𝼅 𝼈 ᶑ 𝼊 ⟩. [44] [98] [99]
Superscripts are often used as a substitute for the tie bar, for example ⟨tᶴ⟩ for [t͜ʃ] and ⟨kᵖ⟩ or ⟨ᵏp⟩ for [k͜p]. However, in precise notation there is a difference between a fricative release in [tᶴ] and the affricate [t͜ʃ], between a velar onset in [ᵏp] and doubly articulated [k͜p]. [100]
Superscript letters can be meaningfully modified by combining diacritics, just as baseline letters can. For example, a superscript dental nasal in ⟨ⁿ̪d̪⟩, a superscript voiceless velar nasal in ⟨ᵑ̊ǂ⟩, and labial-velar prenasalization in ⟨ᵑ͡ᵐɡ͡b⟩. Although the diacritic may seem a bit oversized compared to the superscript letter it modifies, e.g. ⟨ᵓ̃⟩, this can be an aid to legibility, just as it is with the composite superscript c-cedilla ⟨ᶜ̧⟩ and rhotic vowels ⟨ᵊ˞ ᶟ˞⟩. Superscript length marks can be used to indicate the length of aspiration of a consonant, e.g. [pʰtʰ𐞂kʰ𐞁]. Another option is to use extIPA parentheses and a doubled diacritic: ⟨p⁽ʰ⁾ tʰ kʰʰ⟩. [44]
A number of IPA letters and diacritics have been retired or replaced over the years. This number includes duplicate symbols, symbols that were replaced due to user preference, and unitary symbols that were rendered with diacritics or digraphs to reduce the inventory of the IPA. The rejected symbols are now considered obsolete, though some are still seen in the literature.
The IPA once had several pairs of duplicate symbols from alternative proposals, but eventually settled on one or the other. An example is the vowel letter ⟨ɷ⟩, rejected in favor of ⟨ʊ⟩. Affricates were once transcribed with ligatures, such as ⟨ʧ ʤ ⟩ (and others, some of which not found in Unicode). These have been officially retired but are still used. Letters for specific combinations of primary and secondary articulation have also been mostly retired, with the idea that such features should be indicated with tie bars or diacritics: ⟨ƍ⟩ for [zʷ] is one. In addition, the rare voiceless implosives, ⟨ƥ ƭ ƈ ƙ ʠ ⟩, were dropped soon after their introduction and are now usually written ⟨ɓ̥ ɗ̥ ʄ̊ ɠ̊ ʛ̥ ⟩. The original set of click letters, ⟨ʇ, ʗ, ʖ, ʞ⟩, was retired but is still sometimes seen, as the current pipe letters ⟨ǀ, ǃ, ǁ, ǂ⟩ can cause problems with legibility, especially when used with brackets ([ ] or / /), the letter ⟨l⟩ (small L), or the prosodic marks ⟨|, ‖⟩. (For this reason, some publications which use the current IPA pipe letters disallow IPA brackets.) [101]
Individual non-IPA letters may find their way into publications that otherwise use the standard IPA. This is especially common with:
In addition, it is common to see ad hoc typewriter substitutions, generally capital letters, for when IPA support is not available, e.g. S for ⟨ ʃ ⟩. (See also SAMPA and X-SAMPA substitute notation.)
The Extensions to the International Phonetic Alphabet for Disordered Speech, commonly abbreviated "extIPA" and sometimes called "Extended IPA", are symbols whose original purpose was to accurately transcribe disordered speech. At the Kiel Convention in 1989, a group of linguists drew up the initial extensions, [note 39] which were based on the previous work of the PRDS (Phonetic Representation of Disordered Speech) Group in the early 1980s. [103] The extensions were first published in 1990, then modified, and published again in 1994 in the Journal of the International Phonetic Association, when they were officially adopted by the ICPLA. [104] While the original purpose was to transcribe disordered speech, linguists have used the extensions to designate a number of sounds within standard communication, such as hushing, gnashing teeth, and smacking lips, [2] as well as regular lexical sounds such as lateral fricatives that do not have standard IPA symbols.
In addition to the Extensions to the IPA for disordered speech, there are the conventions of the Voice Quality Symbols, which include a number of symbols for additional airstream mechanisms and secondary articulations in what they call "voice quality".
Capital letters and various characters on the number row of the keyboard are commonly used to extend the alphabet in various ways.
There are various punctuation-like conventions for linguistic transcription that are commonly used together with IPA. Some of the more common are:
Full capital letters are not used as IPA symbols, except as typewriter substitutes (e.g. N for ⟨ŋ⟩, S for ⟨ ʃ ⟩, O for ⟨ɔ⟩– see SAMPA). They are, however, often used in conjunction with the IPA in two cases:
Wildcards are commonly used in phonology to summarize syllable or word shapes, or to show the evolution of classes of sounds. For example, the possible syllable shapes of Mandarin can be abstracted as ranging from /V/ (an atonic vowel) to /CGVNᵀ/ (a consonant-glide-vowel-nasal syllable with tone), and word-final devoicing may be schematized as C → C̥/_#. They are also used in historical linguistics for a sound that is posited but whose nature has not been determined beyond some generic category such as {nasal} or {uvular}. In speech pathology, capital letters represent indeterminate sounds, and may be superscripted to indicate they are weakly articulated: e.g. [ᴰ] is a weak indeterminate alveolar, [ᴷ] a weak indeterminate velar. [108]
There is a degree of variation between authors as to the capital letters used, but ⟨C⟩ for {consonant}, ⟨V⟩ for {vowel} and ⟨N⟩ for {nasal} are ubiquitous in English-language material. Other common conventions are ⟨T⟩ for {tone/accent} (tonicity), ⟨P⟩ for {plosive}, ⟨F⟩ for {fricative}, ⟨S⟩ for {sibilant}, [note 40] ⟨G⟩ for {glide/semivowel}, ⟨L⟩ for {lateral} or {liquid}, ⟨R⟩ for {rhotic} or {resonant/sonorant}, [note 41] ⟨₵⟩ for {obstruent},⟨Ʞ⟩ for {click}, ⟨A, E, O, Ɨ, U⟩ for {open, front, back, close, rounded vowel} [note 42] and ⟨B, D, Ɉ, K, Q, Φ, H⟩ for {labial, alveolar, post-alveolar/palatal, velar, uvular, pharyngeal, glottal [note 43] consonant}, respectively, and ⟨X⟩ for {any sound}, as in ⟨CVX⟩ for a heavy syllable {CVC, CVV̯, CVː}. The letters can be modified with IPA diacritics, for example ⟨Cʼ⟩ for {ejective}, ⟨Ƈ ⟩ for {implosive}, ⟨N͡C⟩ or ⟨ᴺC⟩ for {prenasalized consonant}, ⟨Ṽ⟩ for {nasal vowel}, ⟨CʰV́⟩ for {aspirated CV syllable with high tone}, ⟨S̬⟩ for {voiced sibilant}, ⟨N̥⟩ for {voiceless nasal}, ⟨P͡F⟩ or ⟨Pꟳ⟩ for {affricate}, ⟨Cᴳ⟩ for a consonant with a glide as secondary articulation (e.g. ⟨Cʲ⟩ for {palatalized consonant} and ⟨Cʷ⟩ for {labialized consonant}) and ⟨D̪⟩ for {dental consonant}. ⟨H⟩, ⟨M⟩, ⟨L⟩ are also commonly used for high, mid and low tone, with ⟨LH⟩ for rising tone and ⟨HL⟩ for falling tone, rather than transcribing them overly precisely with IPA tone letters or with ambiguous digits. [note 44]
Typical examples of archiphonemic use of capital letters are ⟨I⟩ for the Turkish harmonic vowel set {iyɯu}; [note 45] ⟨D⟩ for the conflated flapped middle consonant of American English writer and rider; ⟨N⟩ for the homorganic syllable-coda nasal of languages such as Spanish and Japanese (essentially equivalent to the wild-card usage of the letter); and ⟨R⟩ in cases where a phonemic distinction between trill /r/ and flap /ɾ/ is conflated, as in Spanish enrejar/eNreˈxaR/ (the n is homorganic and the first r is a trill, but the second r is variable). [109] Similar usage is found for phonemic analysis, where a language does not distinguish sounds that have separate letters in the IPA. For instance, Castillian Spanish has been analyzed as having phonemes /Θ/ and /S/, which surface as [θ] and [s] in voiceless environments and as [ð] and [z] in voiced environments (e.g. hazte/ˈaΘte/ → [ˈaθte], vs hazme/ˈaΘme/ → [ˈaðme], or las manos/laSˈmanoS/ → [lazˈmanos]). [110]
⟨V⟩, ⟨F⟩ and ⟨C⟩ have completely different meanings as Voice Quality Symbols, where they stand for "voice" (VoQS jargon for secondary articulation), [note 46] "falsetto" and "creak". These three letters may take diacritics to indicate what kind of voice quality an utterance has, and may be used as carrier letters to extract a suprasegmental feature that occurs on all susceptible segments in a stretch of IPA. For instance, the transcription of Scottish Gaelic [kʷʰuˣʷt̪ʷs̟ʷ] 'cat' and [kʷʰʉˣʷt͜ʃʷ] 'cats' (Islay dialect) can be made more economical by extracting the suprasegmental labialization of the words: Vʷ[kʰuˣt̪s̟] and Vʷ[kʰʉˣt͜ʃ]. [111] The conventional wildcards ⟨X⟩ or ⟨C⟩ might be used instead of VoQS ⟨V⟩ so that the reader does not misinterpret ⟨Vʷ⟩ as meaning that only vowels are labialized (i.e. Xʷ[kʰuˣt̪s̟] for all segments labialized, Cʷ[kʰuˣt̪s̟] for all consonants labialized), or the carrier letter may be omitted altogether (e.g. ʷ[kʰuˣt̪s̟], [ʷkʰuˣt̪s̟] or [kʰuˣt̪s̟]ʷ). (See § Suprasegmentals for other transcription conventions.)
This summary is to some extent valid internationally, but linguistic material written in other languages may have different associations with capital letters used as wildcards. For example, in German ⟨K⟩ and ⟨V⟩ are used for Konsonant (consonant) and Vokal (vowel); in French, tone may be transcribed with ⟨H⟩ and ⟨B⟩ for haut (high) and bas (low). [112]
The blank cells on the summary IPA chart can be filled without much difficulty if the need arises.
The missing retroflex letters, namely ⟨ᶑ ꞎ 𝼅 𝼈 𝼊 ⟩, are "implicit" in the alphabet, and the IPA supported their adoption into Unicode. [44] Attested in the literature are the retroflex implosive ⟨ᶑ ⟩, the voiceless retroflex lateral fricative ⟨ꞎ ⟩, the retroflex lateral flap ⟨𝼈 ⟩ and the retroflex click ⟨𝼊 ⟩; the first is also mentioned in the IPA Handbook, and the lateral fricatives are provided for by the extIPA.
The epiglottal trill is arguably covered by the generally trilled epiglottal "fricatives" ⟨ʜ ʢ⟩. Ad hoc letters for near-close central vowels, ⟨ᵻ ᵿ⟩, are used in some descriptions of English, though those are specifically reduced vowels (forming a set with the IPA reduced vowels ⟨ə ɐ⟩), and the simple points in vowel space are easily transcribed with diacritics: ⟨ɪ̈ ʊ̈⟩ or ⟨ɨ̞ ʉ̞⟩. Diacritics are able to fill in most of the remainder of the charts. [note 47] If a sound cannot be transcribed, an asterisk ⟨*⟩ may be used, either as a letter or as a diacritic (as in ⟨k*⟩ sometimes seen for the Korean "fortis" velar).
Representations of consonant sounds outside of the core set are created by adding diacritics to letters with similar sound values. The Spanish bilabial and dental approximants are commonly written as lowered fricatives, [β̞] and [ð̞] respectively. [note 48] Similarly, voiced lateral fricatives can be written as raised lateral approximants, [ɭ˔ʎ̝ʟ̝], though the extIPA also provides ⟨𝼅⟩ for the first of these. A few languages such as Banda have a bilabial flap as the preferred allophone of what is elsewhere a labiodental flap. It has been suggested that this be written with the labiodental flap letter and the advanced diacritic, [ⱱ̟]. [114] Similarly, a labiodental trill would be written [ʙ̪] (bilabial trill and the dental sign), and the labiodental plosives are now universally ⟨p̪ b̪⟩ rather than the ad hoc letters ⟨ȹ ȸ⟩ once found in Bantuist literature. Other taps can be written as extra-short plosives or laterals, e.g. [ ɟ̆ɢ̆ʟ̆], though in some cases the diacritic would need to be written below the letter. A retroflex trill can be written as a retracted [r̠], just as non-subapical retroflex fricatives sometimes are. The remaining pulmonic consonants – the uvular laterals ([ʟ̠𝼄̠ʟ̠˔]) and the palatal trill – while not strictly impossible, are very difficult to pronounce and are unlikely to occur even as allophones in the world's languages.
The vowels are similarly manageable by using diacritics for raising, lowering, fronting, backing, centering, and mid-centering. [note 49] For example, the unrounded equivalent of [ʊ] can be transcribed as mid-centered [ɯ̽], and the rounded equivalent of [æ] as raised [ɶ̝] or lowered [œ̞] (though for those who conceive of vowel space as a triangle, simple [ɶ] already is the rounded equivalent of [æ]). True mid vowels are lowered [e̞ø̞ɘ̞ɵ̞ɤ̞o̞] or raised [ɛ̝œ̝ɜ̝ɞ̝ʌ̝ɔ̝], while centered [ɪ̈ʊ̈] and [ä] (or, less commonly, [ɑ̈]) are near-close and open central vowels, respectively.
The only known vowels that cannot be represented in this scheme are vowels with unexpected roundedness. For unambiguous transcription, such sounds would require dedicated diacritics. Possibilities include ⟨ʏʷ⟩ (or ⟨ɪʷ⟩) for protrusion and ⟨uᵝ⟩ (or VoQS ⟨ɯᶹ⟩) for compression. However, these transcriptions suggest that the sounds are diphthongs, and so while they may be clear for a language like Swedish where they are diphthongs, they may be misleading for languages such as Japanese where they are monophthongs. The extIPA 'spread' diacritic ⟨◌͍⟩ is sometimes seen for compressed ⟨u͍⟩, ⟨o͍⟩, ⟨ɔ͍⟩, ⟨ɒ͍⟩, though again the intended meaning would need to be explained or they would be interpreted as being spread the way that cardinal ⟦i⟧ is. For protrusion (w-like labialization without velarization), Ladefoged & Maddieson use the old IPA omega diacritic for labialization, ⟨◌̫⟩, for protruded ⟨y᫇⟩, ⟨ʏ̫⟩, ⟨ø̫⟩, ⟨œ̫⟩. This is an adaptation of an old IPA convention of rounding an unrounded vowel letter like i with a subscript omega (⟨◌̫⟩) and unrounding a rounded letter like u with a subscript turned omega. [116] As of 2024 [update] , the turned omega diacritic is in the pipeline for Unicode, and is under consideration for compression in extIPA. [117] Kelly & Local use a combining w diacritic ⟨◌ᪿ⟩ for protrusion (e.g. ⟨yᷱ øᪿ⟩) and a combining ʍ diacritic ⟨◌ᫀ⟩ for compression (e.g. ⟨uᫀ oᫀ⟩). [118] Because their transcriptions are manuscript, these are effectively the same symbols as the old IPA diacritics, which indeed are historically cursive w and ʍ. However, the more angular ⟨◌ᫀ⟩ of typescript might misleadingly suggest the vowel is protruded and voiceless (like [ʍ]) rather than compressed and voiced.
An IPA symbol is often distinguished from the sound it is intended to represent, since there is not necessarily a one-to-one correspondence between letter and sound in broad transcription, making articulatory descriptions such as "mid front rounded vowel" or "voiced velar stop" unreliable. While the Handbook of the International Phonetic Association states that no official names exist for its symbols, it admits the presence of one or two common names for each. [119] The symbols also have nonce names in the Unicode standard. In many cases, the names in Unicode and the IPA Handbook differ. For example, the Handbook calls ⟨ɛ⟩ "epsilon", while Unicode calls it "small letter open e".
The traditional names of the Latin and Greek letters are usually used for unmodified letters. [note 50] Letters which are not directly derived from these alphabets, such as ⟨ʕ⟩, may have a variety of names, sometimes based on the appearance of the symbol or on the sound that it represents. In Unicode, some of the letters of Greek origin have Latin forms for use in IPA; the others use the characters from the Greek block.
For diacritics, there are two methods of naming. For traditional diacritics, the IPA notes the name in a well known language; for example, ⟨é⟩ is "e-acute", based on the name of the diacritic in English and French. Non-traditional diacritics are often named after objects they resemble, so ⟨d̪⟩ is called "d-bridge".
Geoffrey Pullum and William Ladusaw [ d ] list a variety of names in use for both current and retired IPA symbols in their Phonetic Symbol Guide . Many of them found their way into Unicode. [9]
Unicode supports nearly all of the IPA. Apart from basic Latin and Greek and general punctuation, the primary blocks are IPA Extensions, Spacing Modifier Letters and Combining Diacritical Marks, with lesser support from Phonetic Extensions, Phonetic Extensions Supplement, Combining Diacritical Marks Supplement, and scattered characters elsewhere. The extended IPA is supported primarily by those blocks and Latin Extended-G.
After the Kiel Convention in 1989, most IPA symbols were assigned an identifying number to prevent confusion between similar characters during the printing of manuscripts. The codes were never much used and have been superseded by Unicode.
Many typefaces have support for IPA characters, but good diacritic rendering remains rare. [121] Web browsers generally do not need any configuration to display IPA characters, provided that a typeface capable of doing so is available to the operating system.
Typefaces that provide full IPA and nearly full extIPA support, including properly rendering the diacritics, include Gentium Plus, Charis SIL, Doulos SIL, and Andika developed by SIL International. Indeed, the IPA chose Doulos to publish their chart in Unicode format. In addition to the level of support found in commercial and system fonts, these fonts support the full range of old-style (pre-Kiel) staveless tone letters, through a character variant option that suppresses the stave of the Chao tone letters. They also have an option to maintain the [a] ~ [ɑ] vowel distinction in italics. The only notable gaps are with the extIPA: the combining parentheses, which enclose diacritics, are not supported, nor is the enclosing circle that marks unidentified sounds, and which Unicode considers to be a copy-edit mark and thus not eligible for Unicode support.
The basic Latin Noto fonts commissioned by Google also have decent IPA support, including diacritic placement, only failing with the more obscure IPA and extIPA characters and superscripts of the Latin Extended-F and Latin Extended-G blocks. The extIPA parentheses are included, but they do not enclose diacritics as they are supposed to.
DejaVu is the second free Unicode font chosen by the IPA to publish their chart. It was last updated in 2016 and so does not support the Latin F or G blocks. Stacked diacritics tend to overstrike each other.
As of 2018 [update] , the IPA was developing their own font, unitipa, based on TIPA. [122]
Calibri, the default font of Microsoft Office, has nearly complete IPA support with good diacritic rendering, though it is not as complete as some free fonts (see image at right). Other widespread Microsoft fonts, such as Arial and Times New Roman, have poor support.
The Apple system fonts Geneva, Lucida Grande and Hiragino (certain weights) have only basic IPA support.
Brill has complete IPA and extIPA coverage of characters added to Unicode by 2020, with good diacritic and tone-letter support. It is a commercial font but is freely available for non-commercial use. [123]
Several systems have been developed that map the IPA symbols to ASCII characters. Notable systems include SAMPA and X-SAMPA. The usage of mapping systems in on-line text has to some extent been adopted in the context input methods, allowing convenient keying of IPA characters that would be otherwise unavailable on standard keyboard layouts.
IETF language tags have registered fonipa as a variant subtag identifying text as written in IPA. [124] Thus, an IPA transcription of English could be tagged as en-fonipa. For the use of IPA without attribution to a concrete language, und-fonipa is available.
Online IPA keyboard utilities are available, though none of them cover the complete range of IPA symbols and diacritics. Examples are the IPA 2018 i-charts hosted by the IPA, [125] IPA character picker by Richard Ishida at GitHub, [126] Type IPA phonetic symbols at TypeIt.org, [127] and an IPA Chart keyboard by Weston Ruter also at GitHub. [128] In April 2019, Google's Gboard for Android added an IPA keyboard to its platform. [129] [130] For iOS there are multiple free keyboard layouts available, such as the IPA Phonetic Keyboard. [131]
A caronKARR-ən. or háček, is a diacritic mark placed over certain letters in the orthography of some languages, to indicate a change of the related letter's pronunciation. Typographers tend to use the term caron, while linguists prefer the Czech word háček.
Pharyngealization is a secondary articulation of consonants or vowels by which the pharynx or epiglottis is constricted during the articulation of the sound.
Labialization is a secondary articulatory feature of sounds in some languages. Labialized sounds involve the lips while the remainder of the oral cavity produces another sound. The term is normally restricted to consonants. When vowels involve the lips, they are called rounded.
The voiced alveolar lateral fricative is a type of consonantal sound, used in some spoken languages. The symbol in the International Phonetic Alphabet that represents voiced dental, alveolar, and postalveolar lateral fricatives is ⟨ɮ⟩, and the equivalent X-SAMPA symbol is K\
.
In phonetics, a flap or tap is a type of consonantal sound, which is produced with a single contraction of the muscles so that one articulator is thrown against another.
Voice or voicing is a term used in phonetics and phonology to characterize speech sounds. Speech sounds can be described as either voiceless or voiced.
Americanist phonetic notation, also known as the North American Phonetic Alphabet (NAPA), the Americanist Phonetic Alphabet or the American Phonetic Alphabet (APA), is a system of phonetic notation originally developed by European and American anthropologists and language scientists for the phonetic and phonemic transcription of indigenous languages of the Americas and for languages of Europe. It is still commonly used by linguists working on, among others, Slavic, Uralic, Semitic languages and for the languages of the Caucasus, of India, and of much of Africa; however, Uralicists commonly use a variant known as the Uralic Phonetic Alphabet.
The International Phonetic Alphabet was created soon after the International Phonetic Association was established in the late 19th century. It was intended as an international system of phonetic transcription for oral languages, originally for pedagogical purposes. The Association was established in Paris in 1886 by French and British language teachers led by Paul Passy. The prototype of the alphabet appeared in Phonetic Teachers' Association (1888b). The Association based their alphabet upon the Romic alphabet of Henry Sweet, which in turn was based on the Phonotypic Alphabet of Isaac Pitman and the Palæotype of Alexander John Ellis.
Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
Finno-Ugric transcription (FUT) or the Uralic Phonetic Alphabet (UPA) is a phonetic transcription or notational system used predominantly for the transcription and reconstruction of Uralic languages. It was first published in 1901 by Eemil Nestor Setälä, a Finnish linguist; it was somewhat modified in the 1970s.
In phonetics, secondary articulation occurs when the articulation of a consonant is equivalent to the combined articulations of two or three simpler consonants, at least one of which is an approximant. The secondary articulation of such co-articulated consonants is the approximant-like articulation. It "colors" the primary articulation rather than obscuring it. Maledo (2011) defines secondary articulation as the superimposition of lesser stricture upon a primary articulation.
The Extensions to the International Phonetic Alphabet for Disordered Speech, commonly abbreviated extIPA, are a set of letters and diacritics devised by the International Clinical Phonetics and Linguistics Association to augment the International Phonetic Alphabet for the phonetic transcription of disordered speech. Some of the symbols are used for transcribing features of normal speech in IPA transcription, and are accepted as such by the International Phonetic Association.
Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.
The International Phonetic Alphabet (IPA) possesses a variety of obsolete and nonstandard symbols. Throughout the history of the IPA, characters representing phonetic values have been modified or completely replaced. An example is ⟨ɷ⟩ for standard. Several symbols indicating secondary articulation have been dropped altogether, with the idea that they should be indicated with diacritics: for is one. In addition, the rare voiceless implosive series has been dropped.
Tone letters are letters that represent the tones of a language, most commonly in languages with contour tones.
In phonetics and phonology, relative articulation is description of the manner and place of articulation of a speech sound relative to some reference point. Typically, the comparison is made with a default, unmarked articulation of the same phoneme in a neutral sound environment. For example, the English velar consonant is fronted before the vowel compared to articulation of before other vowels. This fronting is called palatalization.
Teuthonista is a phonetic transcription system used predominantly for the transcription of (High) German dialects. It is very similar to other Central European transcription systems from the early 20th century. The base characters are mostly based on the Latin alphabet, which can be modified by various diacritics.
IPA numbers are a legacy system of coding the symbols of the International Phonetic Alphabet. They were the organizational basis for XSAMPA and the IPA Extensions block of Unicode.
IPA Braille is the modern standard Braille encoding of the International Phonetic Alphabet (IPA), as recognized by the International Council on English Braille.
Dania is the traditional linguistic transcription system used in Denmark to describe the Danish language. It was invented by Danish linguist Otto Jespersen and published in 1890 in the Dania, Tidsskrift for folkemål og folkeminder magazine from which the system was named.