Diacritic

Last updated
Latin letter A with multiple diacritics Latin letter A.svg
Latin letter A with multiple diacritics

A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek διακριτικός (diakritikós, "distinguishing"), from διακρίνω (diakrínō, "to distinguish"). The word diacritic is a noun, though it is sometimes used in an attributive sense, whereas diacritical is only an adjective. Some diacritics, such as the acute ó, grave ò, and circumflex ô (all shown above an 'o'), are often called accents. Diacritics may appear above or below a letter or in some other position such as within the letter or between two letters.

Contents

The main use of diacritics in Latin script is to change the sound-values of the letters to which they are added. Historically, English has used the diaeresis diacritic to indicate the correct pronunciation of ambiguous words, such as "coöperate", without which the <oo> letter sequence could be misinterpreted to be pronounced /ˈkuːpəreɪt/. Other examples are the acute and grave accents, which can indicate that a vowel is to be pronounced differently than is normal in that position, for example not reduced to /ə/ or silent as in the case of the two uses of the letter e in the noun résumé (as opposed to the verb resume) and the help sometimes provided in the pronunciation of some words such as doggèd, learnèd, blessèd, and especially words pronounced differently than normal in poetry (for example movèd, breathèd).

Most other words with diacritics in English are borrowings from languages such as French to better preserve the spelling, such as the diaeresis on naïve and Noël, the acute from café, the circumflex in the word crêpe, and the cedille in façade. All these diacritics, however, are frequently omitted in writing, and English is the only major modern European language that does not have diacritics in common usage. [a]

In Latin-script alphabets in other languages diacritics may distinguish between homonyms, such as the French ("there") versus la ("the"), which are both pronounced /la/. In Gaelic type, a dot over a consonant indicates lenition of the consonant in question. In other writing systems, diacritics may perform other functions. Vowel pointing systems, namely the Arabic harakat and the Hebrew niqqud systems, indicate vowels that are not conveyed by the basic alphabet. The Indic virama (etc.) and the Arabic sukūn (ـْـ) mark the absence of vowels. Cantillation marks indicate prosody. Other uses include the Early Cyrillic titlo stroke (◌҃) and the Hebrew gershayim (״), which, respectively, mark abbreviations or acronyms, and Greek diacritical marks, which showed that letters of the alphabet were being used as numerals. In Vietnamese and the Hanyu Pinyin official romanization system for Mandarin in China, diacritics are used to mark the tones of the syllables in which the marked vowels occur.

In orthography and collation, a letter modified by a diacritic may be treated either as a new, distinct letter or as a letter–diacritic combination. This varies from language to language and may vary from case to case within a language.

In some cases, letters are used as "in-line diacritics", with the same function as ancillary glyphs, in that they modify the sound of the letter preceding them, as in the case of the "h" in the English pronunciation of "sh" and "th". [2] Such letter combinations are sometimes even collated as a single distinct letter. For example, the spelling sch was traditionally often treated as a separate letter in German. Words with that spelling were listed after all other words spelled with s in card catalogs in the Vienna public libraries, for example (before digitization).

Types

Among the types of diacritic used in alphabets based on the Latin script are:

The tilde, dot, comma, titlo, apostrophe, bar, and colon are sometimes diacritical marks, but also have other uses.

Not all diacritics occur adjacent to the letter they modify. In the Wali language of Ghana, for example, an apostrophe indicates a change of vowel quality, but occurs at the beginning of the word, as in the dialects ’Bulengee and ’Dolimi. Because of vowel harmony, all vowels in a word are affected, so the scope of the diacritic is the entire word. In abugida scripts, like those used to write Hindi and Thai, diacritics indicate vowels, and may occur above, below, before, after, or around the consonant letter they modify.

The tittle (dot) on the letter i or the letter j, of the Latin alphabet originated as a diacritic to clearly distinguish i from the minims (downstrokes) of adjacent letters. It first appeared in the 11th century in the sequence ii (as in ingeníí), then spread to i adjacent to m, n, u, and finally to all lowercase is. The j, originally a variant of i, inherited the tittle. The shape of the diacritic developed from initially resembling today's acute accent to a long flourish by the 15th century. With the advent of Roman type it was reduced to the round dot we have today. [3]

Several languages of eastern Europe use diacritics on both consonants and vowels, whereas in western Europe digraphs are more often used to change consonant sounds. Most languages in Europe use diacritics on vowels, aside from English where there are typically none (with some exceptions).

Diacritics specific to non-Latin alphabets

Arabic

Greek

These diacritics are used in addition to the acute, grave, and circumflex accents and the diaeresis:

Hebrew

Genesis 1:9 "And God said, Let the waters be collected".
Letters in black, niqqud in red, cantillation in blue Example of biblical Hebrew trope.svg
Genesis 1:9 "And God said, Let the waters be collected".
Letters in black, niqqud in red, cantillation in blue

(Cantillation marks do not generally render correctly; refer to Hebrew cantillation#Names and shapes of the ta'amim for a complete table together with instructions for how to maximize the possibility of viewing them in a web browser.)

Korean

Hangul, the Korean syllabary Hunmin jeong-eum.jpg
Hangul, the Korean syllabary

The diacritics and  , known as Bangjeom (방점; 傍點), were used to mark pitch accents in Hangul for Middle Korean. They were written to the left of a syllable in vertical writing and above a syllable in horizontal writing.

Sanskrit and Indic

Devanagari scripts (from Brahmic family) compound letters, which are vowels combined with consonants, have diacritics. Here, k
(k) is shown with vowel diacritics. That is: aa, i, e, u, au ', H, etc. Devanagari matras.svg
Devanagari scripts (from Brahmic family) compound letters, which are vowels combined with consonants, have diacritics. Here, (k) is shown with vowel diacritics. That is: ा, ि, े, ु, ौ ़, ः, etc.

Syriac

In addition to the above vowel marks, transliteration of Syriac sometimes includes ə, or superscript e (or often nothing at all) to represent an original Aramaic schwa that became lost later on at some point in the development of Syriac. [4] Some transliteration schemes find its inclusion necessary for showing spirantization or for historical reasons. [5] [6]

Non-alphabetic scripts

Some non-alphabetic scripts also employ symbols that function essentially as diacritics.

Alphabetization or collation

Different languages use different rules to put diacritic characters in alphabetical order. For example, French and Portuguese treat letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries. The Scandinavian languages and the Finnish language, by contrast, treat the characters with diacritics å, ä, and ö as distinct letters of the alphabet, and sort them after z. Usually ä (a-umlaut) and ö (o-umlaut) [used in Swedish and Finnish] are sorted as equivalent to æ (ash) and ø (o-slash) [used in Danish and Norwegian]. Also, aa, when used as an alternative spelling to å, is sorted as such. Other letters modified by diacritics are treated as variants of the underlying letter, with the exception that ü is frequently sorted as y.

Languages that treat accented letters as variants of the underlying letter usually alphabetize words with such symbols immediately after similar unmarked words. For instance, in German where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (e.g. schon and then schön, or fallen and then fällen). However, when names are concerned (e.g. in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed e; Austrian phone books now treat characters with umlauts as separate letters (immediately following the underlying vowel).

In Spanish, the grapheme ñ is considered a distinct letter, different from n and collated between n and o, as it denotes a different sound from that of a plain n. But the accented vowels á, é, í, ó, ú are not separated from the unaccented vowels a, e, i, o, u, as the acute accent in Spanish only modifies stress within the word or denotes a distinction between homonyms, and does not modify the sound of a letter.

For a comprehensive list of the collating orders in various languages, see Collating sequence.

Generation with computers

German keyboard with umlaut letters Germanic umlaut on keyboard.jpg
German keyboard with umlaut letters

Modern computer technology was developed mostly in countries that speak Western European languages (particularly English), and many early binary encodings were developed with a bias favoring Englisha language written without diacritical marks. With computer memory and computer storage at premium, early character sets were limited to the Latin alphabet, the ten digits and a few punctuation marks and conventional symbols. The American Standard Code for Information Interchange (ASCII), first published in 1963, encoded just 95 printable characters. It included just four free-standing diacriticsacute, grave, circumflex and tildewhich were to be used by backspacing and overprinting the base letter. The ISO/IEC 646 standard (1967) defined national variations that replace some American graphemes with precomposed characters (such as é, è and ë), according to languagebut remained limited to 95 printable characters.

Unicode was conceived to solve this problem by assigning every known character its own code; if this code is known, most modern computer systems provide a method to input it. For historical reasons, almost all the letter-with-accent combinations used in European languages were given unique code points and these are called precomposed characters. For other languages, it is usually necessary to use a combining character diacritic together with the desired base letter. Unfortunately, even as of 2024, many applications and web browsers remain unable to operate the combining diacritic concept properly.

Depending on the keyboard layout and keyboard mapping, it is more or less easy to enter letters with diacritics on computers and typewriters. Keyboards used in countries where letters with diacritics are the norm, have keys engraved with the relevant symbols. In other cases, such as when the US international or UK extended mappings are used, the accented letter is created by first pressing the key with the diacritic mark, followed by the letter to place it on. This method is known as the dead key technique, as it produces no output of its own but modifies the output of the key pressed after it.

Languages with letters containing diacritics

The following languages have letters with diacritics that are orthographically distinct from those without diacritics.

Latin script

Baltic

  • Latvian has the following letters: ā , ē , ī , ū , č , ģ , ķ , ļ , ņ , š , ž
  • Lithuanian. In general usage, where letters appear with the caron (č, š and ž), they are considered as separate letters from c, s or z and collated separately; letters with the ogonek ( ą , ę , į and ų ), the macron ( ū ) and the overdot ( ė ) are considered as separate letters as well, but not given a unique collation order.

Celtic

  • Welsh uses the circumflex, diaeresis, acute, and grave accents on its seven vowels a, e, i, o, u, w, y (hence the composites â, ê, î, ô, û, ŵ, ŷ, ä, ë, ï, ö, ü, , ÿ, á, é, í, ó, ú, , ý, à, è, ì, ò, ù, , ). However all except the circumflex (which is used as a macron) are fairly rare.
  • Following spelling reforms since the 1970s, Scottish Gaelic uses graves only, which can be used on any vowel ( à , è , ì , ò , ù ). Formerly acute accents could be used on á, ó and é, which were used to indicate a specific vowel quality. With the elimination of these accents, the new orthography relies on the reader having prior knowledge of pronunciation of a given word.
  • Manx uses the cedilla diacritic ç combined with h to give the digraph çh (pronounced /tʃ/) to mark the distinction between it and the digraph ch (pronounced /h/ or /x/). Other diacritics used in Manx included the circumflex and diaeresis, as in â, ê, ï, etc. to mark the distinction between two similarly spelled words but with slightly differing pronunciation.
  • Irish uses only acute accents to mark long vowels, following the 1948 spelling reform. Lenition is indicated using an overdot in Gaelic type ( ċ ,,, ġ , , , , ); in Roman type, a suffixed h is used. Thus, a ṁáṫair is equivalent to a mháthair.
  • Breton does not have a single orthography (spelling system), but uses diacritics for a number of purposes. The diaeresis is used to mark that two vowels are pronounced separately and not as a diphthong/digraph. The circumflex is used to mark long vowels, but usually only when the vowel length is not predictable by phonology. Nasalization of vowels may be marked with a tilde, or following the vowel with the letter ñ. The plural suffix -où is used as a unified spelling to represent a suffix with a number of pronunciations in different dialects, and to distinguish this suffix from the digraph ou which is pronounced as /u:/. An apostrophe is used to distinguish c'h, pronounced /x/ as the digraph ch is used in other Celtic languages, from the French-influenced digraph ch, pronounced /ʃ/.

Finno-Ugric

  • Estonian has a distinct letter õ , which contains a tilde. Estonian vowels with double-dot diacritics ä, ö, ü are similar to German, but these are also distinct letters, unlike German umlauted letters. All four have their own place in the alphabet, between w and x. Carons in š or ž appear only in foreign proper names and loanwords. Also these are distinct letters, placed in the alphabet between s and t.
  • Finnish uses double-dotted vowels (ä and ö). As in Swedish and Estonian, these are regarded as individual letters, rather than 'vowel + diacritic' combinations (as happens in German). It also uses the characters å, š and ž in foreign names and loanwords. In the Finnish and Swedish alphabets, å, ä and ö collate as separate letters after z, the others as variants of their base letter.
  • Hungarian uses the double-dot, the acute and double acute diacritics (the last is unique to Hungarian): (ö, ü), (á, é, í, ó, ú) and (ő, ű). The acute accent indicates the long form of a vowel (in case of i/í, o/ó, u/ú) while the double acute performs the same function for ö and ü. The acute accent can also indicate a different sound (more open, as in case of a/á, e/é). Both long and short forms of the vowels are listed separately in the Hungarian alphabet, but members of the pairs a/á, e/é, i/í, o/ó, ö/ ő , u/ú and ü/ ű are collated in dictionaries as the same letter.
  • Livonian has the following letters: ā, ä, ǟ , , ē, ī, ļ, ņ, ō, ȯ , ȱ , õ , ȭ , ŗ, š, ț, ū, ž.

Germanic

  • German uses the two-dots diacritic (German : umlaut ): letters ä , ö , ü , used to indicate the fronting of back vowels (see umlaut (linguistics)).
  • Dutch uses acute, circumflex, grave and two-dots diacritics with most vowels and cedilla with c, as in French. This results in á , à , ä , é , è , ê , ë , í , î , ï , ó , ô , ö , ú , û , ü and ç . This is mostly on words (and names) originating from French (like crème, café, gêne, façade). The acute accent is also used to stress the vowel (like één). The two-dots diacritic is used as a linguistic diaeresis (a vowel hiatus) that splits the two vowels, e.g., reële, reünie, coördinatie), rather than to indicate a linguistic umlaut as used in German.
  • Afrikaans uses 16 additional vowel forms, both uppercase and lowercase: á , ä , é , è , ê , ë , í , î , ï , ó , ô , ö , ú , û , ü , ý .
  • Faroese uses acutes and some additional letters. All are considered separate letters and have their own place in the alphabet: á , í , ó , ú , ý and ø .
  • Icelandic uses acutes and other additional letters. All are considered separate letters, and have their own place in the alphabet: á , é , í , ó , ú , ý and ö .
  • Danish and Norwegian use additional characters like the o-slash ø and the a-overring å . These letters come after z and æ in the order ø, å. Historically, the å has developed from a ligature by writing a small superscript a over a lowercase a; if an å character is unavailable, some Scandinavian languages allow the substitution of a doubled a, thus aa. The Scandinavian languages collate these letters after z, but have different national collation standards.
  • Swedish uses a-diaeresis ( ä ) and o-diaeresis ( ö ) in the place of ash (æ) and slashed o ( ø ) in addition to the a-overring (å). Historically, the two-dots diacritic for the Swedish letters ä and ö developed from a small Gothic e written above the letters. These letters are collated after z, in the order å, ä, ö.

Romance

  • In Asturian, Galician and Spanish, the character ñ is a letter and collated between n and o.
  • Asturian uses an underdot: (lower case, ), and (lower case ) [7]
  • Catalan uses the acute accent é, í, ó, ú, the grave accent à, è, ò, the diaeresis ï, ü, the cedilla ç, and the interpunct l·l.
  • In Valencian, the circumflex â, ê, î, ô, û may also be used.
  • Corsican uses the following in its alphabet: À/à, È/è, Ì/ì, Ò/ò, Ù/ù.
  • French uses four diacritics, appearing on vowels (circumflex, acute, grave, diaeresis) and the cedilla appearing in ç.
  • Italian uses two diacritics, appearing on vowels (acute, grave)
  • Leonese: could use ñ or nn .
  • Portuguese uses a tilde with the vowels a and o and a cedilla with c.
  • Romanian uses a breve on the letter a ( ă ) to indicate the sound schwa /ə/, as well as a circumflex over the letters a ( â ) and i ( î ) for the sound /ɨ/. Romanian also writes a comma below the letters s ( ș ) and t ( ț ) to represent the sounds /ʃ/ and /t͡s/, respectively. These characters are collated after their non-diacritic equivalent.
  • Spanish uses acute accents (á, é, í, ó, ú) to indicate stress falling on a different syllable than the one it would fall on based on default rules, and to distinguish certain one-syllable homonyms (e.g. el (masculine singular definite article) and él [he]). The acute accent is also used to break up sequences of vowels that would normally be pronounced as a diphthong into two syllables, as in the word reír. Diaeresis is used on u only, to distinguish the combinations gue, gui/ge/,/gi/ from güe, güi/gwe/,/gwi/, e.g. vergüenza, lingüística. The tilde on ñ is not considered a diacritic as ñ is considered a distinct letter from n, not a mutated form of it.

Slavic

  • Gaj's Latin alphabet, used in Croatian and latinized Serbian, has the symbols č , ć , đ , š and ž , which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. It also has one digraph including a diacritic, , which is also alphabetized independently, and follows d and precedes đ in the alphabetical order.
  • The Czech alphabet uses the acute (lowercase á é í ó ú ý, uppercase Á É Í Ó Ú Ý), caron (lowercase č ď ě ň ř š ť ž, uppercase Č Ď Ě Ň Ř Š Ť Ž), and for one letter (lowercase ů, uppercase Ů) the ring. (In ď and ť the caron is modified to look rather like an apostrophe.) Letter with caron are considered separate letters, whereas vowels are considered only as longer variants of the unaccented letters. Acute does not affect alphabetical order, letters with caron are ordered after original counterparts.
  • Polish has the following letters: ą ć ę ł ń ó ś ź ż. These are considered to be separate letters: each of them is placed in the alphabet immediately after its Latin counterpart (e.g. ą between a and b), ź and ż are placed after z in that order.
  • The Serbian Cyrillic alphabet has no diacritics, instead it has a grapheme (glyph) for every letter of its Latin counterpart (including Latin letters with diacritics and the digraphs dž, lj and nj ).
  • The Slovak alphabet uses the acute (lowercase á é í ó ú ý ĺ ŕ, uppercase Á É Í Ó Ú Ý Ĺ Ŕ), caron (lowercase č ď ľ ň š ť ž , uppercase Č Ď Ľ Ň Š Ť Ž ), umlaut (ä Ä) and circumflex accent (ô Ô). All of those are considered separate letters and are placed directly after the original counterpart in the alphabet. [8]
  • The basic Slovenian alphabet has the symbols č , š , and ž , which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. Letters with a caron are placed right after the letters as written without the diacritic. The letter đ ('d with bar') may be used in non-transliterated foreign words, particularly names, and is placed after č and before d.

Turkic

  • Azerbaijani includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö, Ş and Ü.
  • Crimean Tatar includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö, Ş and Ü. Unlike Turkish, Crimean Tatar also has the letter Ñ.
  • Gagauz includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö and Ü. Unlike Turkish, Gagauz also has the letters Ä, Ê Ș and Ț. Ș and Ț are derived from the Romanian alphabet for the same sounds. Sometime the Turkish Ş may be used instead of Ș.
  • Turkish uses a G with a breve ( Ğ ), two letters with two dots ( Ö and Ü , representing two rounded front vowels), two letters with a cedilla ( Ç and Ş , representing the affricate /tʃ/ and the fricative /ʃ/), and also possesses a dotted capital İ (and a dotless lowercase ı representing a high unrounded back vowel). In Turkish each of these are separate letters, rather than versions of other letters, where dotted capital İ and lower case i are the same letter, as are dotless capital I and lowercase ı. Typographically, Ç and Ş are sometimes rendered with an underdot, as in . The new Azerbaijani, Crimean Tatar, and Gagauz alphabets are based on the Turkish alphabet and its same diacriticized letters, with some additions.
  • Turkmen includes the distinct Turkish alphabet letters Ç, Ö, Ş and Ü. In addition, Turkmen uses A with diaeresis ( Ä ) to represent /æ/, N with caron ( Ň ) to represent the velar nasal /ŋ/, Y with acute ( Ý ) to represent the palatal approximant /j/, and Z with caron ( Ž ) to represent /ʒ/.

Other

  • Albanian has two special letters Ç and Ë upper and lowercase. They are placed next to the most similar letters in the alphabet, c and e correspondingly.
  • Esperanto has the symbols ŭ , ĉ, ĝ, ĥ, ĵ and ŝ , which are included in the alphabet, and considered separate letters.
  • Filipino also has the character ñ as a letter and is collated between n and o.
  • Modern Greenlandic does not use any diacritics, although ø and å are used to spell loanwords, especially from Danish and English. [9] [10] From 1851 until 1973, Greenlandic was written in an alphabet invented by Samuel Kleinschmidt, where long vowels and geminate consonants were indicated by diacritics on vowels (in the case of consonant gemination, the diacritics were placed on the vowel preceding the affected consonant). For example, the name Kalaallit Nunaat was spelled Kalâdlit Nunât. This scheme uses the circumflex (◌̂) to indicate a long vowel (e.g. ât, ît, ût; modern: aat, iit, uut), an acute accent (◌́) to indicate gemination of the following consonant: (i.e. ák, ík, úk; modern: akk, ikk, ukk) and, finally, a tilde (◌̃) or a grave accent (◌̀), depending on the author, indicates vowel length and gemination of the following consonant (e.g. ãt/àt, ĩt/ìt, ũt/ùt; modern: aatt, iitt, uutt). ê, ô, used only before r, q, are now written ee, oo in Greenlandic.
  • Hawaiian uses the kahakō (macron) over vowels, although there is some disagreement over considering them as individual letters. The kahakō over a vowel can completely change the meaning of a word that is spelled the same but without the kahakō.
  • Kurdish uses the symbols Ç, Ê, Î, Ş and Û with other 26 standard Latin alphabet symbols.
  • Lakota alphabet uses the caron for the letters č, ȟ, ǧ, š, and ž. It also uses the acute accent for stressed vowels á, é, í, ó, ú, áŋ, íŋ, úŋ.
  • Malay uses some diacritics such as á, ā, ç, í, ñ, ó, š, ú. Uses of diacritics was continued until late 19th century except ā and ē.
  • Maltese uses a C, G, and Z with a dot over them (Ċ, Ġ, Ż), and also has an H with an extra horizontal bar. For uppercase H, the extra bar is written slightly above the usual bar. For lowercase H, the extra bar is written crossing the vertical, like a t, and not touching the lower part (Ħ, ħ). The above characters are considered separate letters. The letter 'c' without a dot has fallen out of use due to redundancy. 'Ċ' is pronounced like the English 'ch' and 'k' is used as a hard c as in 'cat'. 'Ż' is pronounced just like the English 'Z' as in 'Zebra', while 'Z' is used to make the sound of 'ts' in English (like 'tsunami' or 'maths'). 'Ġ' is used as a soft 'G' like in 'geometry', while the 'G' sounds like a hard 'G' like in 'log'. The digraph 'għ' (called għajn after the Arabic letter name ʻayn for غ) is considered separate, and sometimes ordered after 'g', whilst in other volumes it is placed between 'n' and 'o' (the Latin letter 'o' originally evolved from the shape of Phoenician ʻayin, which was traditionally collated after Phoenician nūn).
  • The romanization of Syriac uses the altered letters of. Ā, Č, , Ē, Ë, Ġ, , Ō, Š, , , Ū, Ž alongside the 26 standard Latin alphabet symbols. [11]
  • Vietnamese uses the horn diacritic for the letters ơ and ư; the circumflex for the letters â, ê, and ô; the breve for the letter ă; and a bar through the letter đ. Separately, it also has á, à, ả, ã and ạ, the five tones used for vowels besides the flat tone 'a'.

Cyrillic letters

  • Belarusian and Uzbek Cyrillic have a letter ў .
  • Belarusian, Bulgarian, Russian and Ukrainian have the letter й .
  • Belarusian and Russian have the letter ё . In Russian, this letter is usually replaced by е , although it has a different pronunciation. The use of е instead of ё does not affect the pronunciation. Ё is always used in children's books and in dictionaries. A minimal pair is все (vs'e, "everybody" pl.) and всё (vs'o, "everything" n. sg.). In Belarusian the replacement by е is a mistake; in Russian, it is permissible to use either е or ё for ё but the former is more common in everyday writing (as opposed to instructional or juvenile writing).
  • The Cyrillic Ukrainian alphabet has the letters ґ , й and ї . Ukrainian Latynka has many more.
  • Macedonian has the letters ќ and ѓ .
  • In Bulgarian and Macedonian the possessive pronoun ѝ (ì, "her") is spelled with a grave accent in order to distinguish it from the conjunction и (i, "and").
  • The acute accent ◌́ above any vowel in Cyrillic alphabets is used in dictionaries, books for children and foreign learners to indicate the word stress, it also can be used for disambiguation of similarly spelled words with different lexical stresses.

Diacritics that do not produce new letters

Blackboard used in class at Harvard shows students' efforts at placing the u and acute accent diacritic used in Spanish orthography. Spanish orthography.jpg
Blackboard used in class at Harvard shows students' efforts at placing the ü and acute accent diacritic used in Spanish orthography.

English

English is one of the few European languages that does not have many words that contain diacritical marks. Instead, digraphs are the main way the Modern English alphabet adapts the Latin to its phonemes. Exceptions are unassimilated foreign loanwords, including borrowings from French (and, increasingly, Spanish, like jalapeño and piñata); however, the diacritic is also sometimes omitted from such words. Loanwords that frequently appear with the diacritic in English include café, résumé or resumé (a usage that helps distinguish it from the verb resume), soufflé, and naïveté (see English terms with diacritical marks ). In older practice (and even among some orthographically conservative modern writers), one may see examples such as élite, mêlée and rôle.

English speakers and writers once used the diaeresis more often than now in words such as coöperation (from Fr. coopération), zoölogy (from Grk. zoologia), and seeër (now more commonly see-er or simply seer) as a way of indicating that adjacent vowels belonged to separate syllables, but this practice has become far less common. The New Yorker magazine is a major publication that continues to use the diaeresis in place of a hyphen for clarity and economy of space. [12]

A few English words, often when used out of context, especially in isolation, can only be distinguished from other words of the same spelling by using a diacritic or modified letter. These include exposé, lamé, maté, öre, øre, résumé and rosé. In a few words, diacritics that did not exist in the original have been added for disambiguation, as in maté (from Sp. and Port. mate), saké (the standard Romanization of the Japanese has no accent mark), and Malé (from Dhivehi މާލެ), to clearly distinguish them from the English words mate, sake, and male.

The acute and grave accents are occasionally used in poetry and lyrics: the acute to indicate stress overtly where it might be ambiguous (rébel vs. rebél) or nonstandard for metrical reasons (caléndar), the grave to indicate that an ordinarily silent or elided syllable is pronounced (warnèd,parlìament).

In certain personal names such as Renée and Zoë , often two spellings exist, and the person's own preference will be known only to those close to them. Even when the name of a person is spelled with a diacritic, like Charlotte Brontë , this may be dropped in English-language articles, and even in official documents such as passports, due either to carelessness, the typist not knowing how to enter letters with diacritical marks, or technical reasons (California, for example, does not allow[ clarification needed ] names with diacritics, as the computer system cannot process such characters). They also appear in some worldwide company names and/or trademarks, such as Nestlé and Citroën .

Other languages

The following languages have letter-diacritic combinations that are not considered independent letters.

Transliteration

Several languages that are not written with the Roman alphabet are transliterated, or romanized, using diacritics. Examples:

Limits

Orthographic

Possibly the greatest number of combining diacritics required to compose a valid character in any Unicode language is 8, for the "well-known grapheme cluster in Tibetan and Ranjana scripts" or HAKṢHMALAWARAYAṀ. [14]

It consists of

  1. U+0F67TIBETAN LETTER HA
  2. U+0F90TIBETAN SUBJOINED LETTER KA
  3. U+0FB5TIBETAN SUBJOINED LETTER SSA
  4. U+0FA8TIBETAN SUBJOINED LETTER MA
  5. U+0FB3TIBETAN SUBJOINED LETTER LA
  6. U+0FBATIBETAN SUBJOINED LETTER FIXED-FORM WA
  7. U+0FBCTIBETAN SUBJOINED LETTER FIXED-FORM RA
  8. U+0FBBTIBETAN SUBJOINED LETTER FIXED-FORM YA
  9. U+0F82TIBETAN SIGN NYI ZLA NAA DA

An example of the rendering, which may be broken depending on the browser used:

ཧྐྵྨླྺྼྻྂ

Unorthographic/ornamental

Some users have explored the limits of rendering in web browsers and other software by "decorating" words with excessive nonsensical diacritics per character to produce so-called Zalgo text.

List of diacritics in Unicode

Diacritics for Latin script in Unicode:

Diacritics in Unicode for Latin script
CharacterCharacter name
Unicode code point
Mark General category Script
̀
  • COMBINING GRAVE ACCENT
  • U+0300
GraveMn: Mark, nonspacingInherited
́
  • COMBINING ACUTE ACCENT
  • U+0301
AcuteMn: Mark, nonspacingInherited
̂
  • COMBINING CIRCUMFLEX ACCENT
  • U+0302
CircumflexMn: Mark, nonspacingInherited
̃
  • COMBINING TILDE
  • U+0303
TildeMn: Mark, nonspacingInherited
̄
  • COMBINING MACRON
  • U+0304
MacronMn: Mark, nonspacingInherited
̅
  • COMBINING OVERLINE
  • U+0305
OverlineMn: Mark, nonspacingInherited
̆
  • COMBINING BREVE
  • U+0306
BreveMn: Mark, nonspacingInherited
̇
  • COMBINING DOT ABOVE
  • U+0307
DotMn: Mark, nonspacingInherited
̈
  • COMBINING DIAERESIS
  • U+0308
DiaeresisMn: Mark, nonspacingInherited
̉
  • COMBINING HOOK ABOVE
  • U+0309
HookMn: Mark, nonspacingInherited
̊
  • COMBINING RING ABOVE
  • U+030A
RingMn: Mark, nonspacingInherited
̋
  • COMBINING DOUBLE ACUTE ACCENT
  • U+030B
Double acuteMn: Mark, nonspacingInherited
̌
  • COMBINING CARON
  • U+030C
CaronMn: Mark, nonspacingInherited
̍
  • COMBINING VERTICAL LINE ABOVE
  • U+030D
Vertical lineMn: Mark, nonspacingInherited
̎
  • COMBINING DOUBLE VERTICAL LINE ABOVE
  • U+030E
Double vertical lineMn: Mark, nonspacingInherited
̏
  • COMBINING DOUBLE GRAVE ACCENT
  • U+030F
Double graveMn: Mark, nonspacingInherited
̐
  • COMBINING CANDRABINDU
  • U+0310
CandrabinduMn: Mark, nonspacingInherited
̑
  • COMBINING INVERTED BREVE
  • U+0311
Inverted breveMn: Mark, nonspacingInherited
̒
  • COMBINING TURNED COMMA ABOVE
  • U+0312
Turned commaMn: Mark, nonspacingInherited
̓
  • COMBINING COMMA ABOVE
  • U+0313
CommaMn: Mark, nonspacingInherited
̔
  • COMBINING REVERSED COMMA ABOVE
  • U+0314
Reversed commaMn: Mark, nonspacingInherited
̕
  • COMBINING COMMA ABOVE RIGHT
  • U+0315
Comma rightMn: Mark, nonspacingInherited
̖
  • COMBINING GRAVE ACCENT BELOW
  • U+0316
GraveMn: Mark, nonspacingInherited
̗
  • COMBINING ACUTE ACCENT BELOW
  • U+0317
AcuteMn: Mark, nonspacingInherited
̘
  • COMBINING LEFT TACK BELOW
  • U+0318
Left tackMn: Mark, nonspacingInherited
̙
  • COMBINING RIGHT TACK BELOW
  • U+0319
Right tackMn: Mark, nonspacingInherited
̚
  • COMBINING LEFT ANGLE ABOVE
  • U+031A
Left angleMn: Mark, nonspacingInherited
̛
  • COMBINING HORN
  • U+031B
HornMn: Mark, nonspacingInherited
̜
  • COMBINING LEFT HALF RING BELOW
  • U+031C
Left half ringMn: Mark, nonspacingInherited
̝
  • COMBINING UP TACK BELOW
  • U+031D
Up tackMn: Mark, nonspacingInherited
̞
  • COMBINING DOWN TACK BELOW
  • U+031E
Down tackMn: Mark, nonspacingInherited
̟
  • COMBINING PLUS SIGN BELOW
  • U+031F
Plus signMn: Mark, nonspacingInherited
̠
  • COMBINING MINUS SIGN BELOW
  • U+0320
Minus signMn: Mark, nonspacingInherited
̡
  • COMBINING PALATALIZED HOOK BELOW
  • U+0321
Palatalized hookMn: Mark, nonspacingInherited
̢
  • COMBINING RETROFLEX HOOK BELOW
  • U+0322
Retroflex hookMn: Mark, nonspacingInherited
̣
  • COMBINING DOT BELOW
  • U+0323
DotMn: Mark, nonspacingInherited
̤
  • COMBINING DIAERESIS BELOW
  • U+0324
DiaeresisMn: Mark, nonspacingInherited
̥
  • COMBINING RING BELOW
  • U+0325
RingMn: Mark, nonspacingInherited
̦
  • COMBINING COMMA BELOW
  • U+0326
CommaMn: Mark, nonspacingInherited
̧
  • COMBINING CEDILLA
  • U+0327
CedillaMn: Mark, nonspacingInherited
̨
  • COMBINING OGONEK
  • U+0328
OgonekMn: Mark, nonspacingInherited
̩
  • COMBINING VERTICAL LINE BELOW
  • U+0329
Vertical lineMn: Mark, nonspacingInherited
̪
  • COMBINING BRIDGE BELOW
  • U+032A
BridgeMn: Mark, nonspacingInherited
̫
  • COMBINING INVERTED DOUBLE ARCH BELOW
  • U+032B
Double archMn: Mark, nonspacingInherited
̬
  • COMBINING CARON BELOW
  • U+032C
CaronMn: Mark, nonspacingInherited
̭
  • COMBINING CIRCUMFLEX ACCENT BELOW
  • U+032D
CircumflexMn: Mark, nonspacingInherited
̮
  • COMBINING BREVE BELOW
  • U+032E
BreveMn: Mark, nonspacingInherited
̯
  • COMBINING INVERTED BREVE BELOW
  • U+032F
Inverted breveMn: Mark, nonspacingInherited
̰
  • COMBINING TILDE BELOW
  • U+0330
TildeMn: Mark, nonspacingInherited
̱
  • COMBINING MACRON BELOW
  • U+0331
MacronMn: Mark, nonspacingInherited
̲
  • COMBINING LOW LINE
  • U+0332
Low lineMn: Mark, nonspacingInherited
̳
  • COMBINING DOUBLE LOW LINE
  • U+0333
Double low lineMn: Mark, nonspacingInherited
̴
  • COMBINING TILDE OVERLAY
  • U+0334
Tilde overlayMn: Mark, nonspacingInherited
̵
  • COMBINING SHORT STROKE OVERLAY
  • U+0335
Short stroke overlayMn: Mark, nonspacingInherited
̶
  • COMBINING LONG STROKE OVERLAY
  • U+0336
Long stroke overlayMn: Mark, nonspacingInherited
̷
  • COMBINING SHORT SOLIDUS OVERLAY
  • U+0337
Short solidus overlayMn: Mark, nonspacingInherited
̸
  • COMBINING LONG SOLIDUS OVERLAY
  • U+0338
Long solidus overlayMn: Mark, nonspacingInherited
̹
  • COMBINING RIGHT HALF RING BELOW
  • U+0339
Right half ringMn: Mark, nonspacingInherited
̺
  • COMBINING INVERTED BRIDGE BELOW
  • U+033A
Inverted bridgeMn: Mark, nonspacingInherited
̻
  • COMBINING SQUARE BELOW
  • U+033B
SquareMn: Mark, nonspacingInherited
̼
  • COMBINING SEAGULL BELOW
  • U+033C
SeagullMn: Mark, nonspacingInherited
̽
  • COMBINING X ABOVE
  • U+033D
XMn: Mark, nonspacingInherited
̾
  • COMBINING VERTICAL TILDE
  • U+033E
Vertical tildeMn: Mark, nonspacingInherited
̿
  • COMBINING DOUBLE OVERLINE
  • U+033F
Double overlineMn: Mark, nonspacingInherited
̀
  • COMBINING GRAVE TONE MARK
  • U+0340
Grave toneMn: Mark, nonspacingInherited
́
  • COMBINING ACUTE TONE MARK
  • U+0341
Acute toneMn: Mark, nonspacingInherited
͆
  • COMBINING BRIDGE ABOVE
  • U+0346
BridgeMn: Mark, nonspacingInherited
͇
  • COMBINING EQUALS SIGN BELOW
  • U+0347
Equals signMn: Mark, nonspacingInherited
͈
  • COMBINING DOUBLE VERTICAL LINE BELOW
  • U+0348
Double vertical lineMn: Mark, nonspacingInherited
͉
  • COMBINING LEFT ANGLE BELOW
  • U+0349
Left angleMn: Mark, nonspacingInherited
͊
  • COMBINING NOT TILDE ABOVE
  • U+034A
Not tildeMn: Mark, nonspacingInherited
͋
  • COMBINING HOMOTHETIC ABOVE
  • U+034B
HomotheticMn: Mark, nonspacingInherited
͌
  • COMBINING ALMOST EQUAL TO ABOVE
  • U+034C
Almost equal toMn: Mark, nonspacingInherited
͍
  • COMBINING LEFT RIGHT ARROW BELOW
  • U+034D
Left right arrowMn: Mark, nonspacingInherited
͎
  • COMBINING UPWARDS ARROW BELOW
  • U+034E
Upwards arrowMn: Mark, nonspacingInherited
͐
  • COMBINING RIGHT ARROWHEAD ABOVE
  • U+0350
Right arrowheadMn: Mark, nonspacingInherited
͑
  • COMBINING LEFT HALF RING ABOVE
  • U+0351
Left half ringMn: Mark, nonspacingInherited
͒
  • COMBINING FERMATA
  • U+0352
FermataMn: Mark, nonspacingInherited
͓
  • COMBINING X BELOW
  • U+0353
XMn: Mark, nonspacingInherited
͔
  • COMBINING LEFT ARROWHEAD BELOW
  • U+0354
Left arrowheadMn: Mark, nonspacingInherited
͕
  • COMBINING RIGHT ARROWHEAD BELOW
  • U+0355
Right arrowheadMn: Mark, nonspacingInherited
͖
  • COMBINING RIGHT ARROWHEAD AND UP ARROWHEAD BELOW
  • U+0356
Right arrowhead and up arrowheadMn: Mark, nonspacingInherited
͗
  • COMBINING RIGHT HALF RING ABOVE
  • U+0357
Right half ringMn: Mark, nonspacingInherited
͘
  • COMBINING DOT ABOVE RIGHT
  • U+0358
Dot rightMn: Mark, nonspacingInherited
͙
  • COMBINING ASTERISK BELOW
  • U+0359
AsteriskMn: Mark, nonspacingInherited
͚
  • COMBINING DOUBLE RING BELOW
  • U+035A
Double ringMn: Mark, nonspacingInherited
͛
  • COMBINING ZIGZAG ABOVE
  • U+035B
ZigzagMn: Mark, nonspacingInherited
͜
  • COMBINING DOUBLE BREVE BELOW
  • U+035C
Double breveMn: Mark, nonspacingInherited
͝
  • COMBINING DOUBLE BREVE
  • U+035D
Double breveMn: Mark, nonspacingInherited
͞
  • COMBINING DOUBLE MACRON
  • U+035E
Double macronMn: Mark, nonspacingInherited
͟
  • COMBINING DOUBLE MACRON BELOW
  • U+035F
Double macronMn: Mark, nonspacingInherited
͠
  • COMBINING DOUBLE TILDE
  • U+0360
Double tildeMn: Mark, nonspacingInherited
͡
  • COMBINING DOUBLE INVERTED BREVE
  • U+0361
Double inverted breveMn: Mark, nonspacingInherited
͢
  • COMBINING DOUBLE RIGHTWARDS ARROW BELOW
  • U+0362
Double rightwards arrowMn: Mark, nonspacingInherited
ͣ
  • COMBINING LATIN SMALL LETTER A
  • U+0363
Latin small letter aMn: Mark, nonspacingInherited
ͤ
  • COMBINING LATIN SMALL LETTER E
  • U+0364
Latin small letter eMn: Mark, nonspacingInherited
ͥ
  • COMBINING LATIN SMALL LETTER I
  • U+0365
Latin small letter iMn: Mark, nonspacingInherited
ͦ
  • COMBINING LATIN SMALL LETTER O
  • U+0366
Latin small letter oMn: Mark, nonspacingInherited
ͧ
  • COMBINING LATIN SMALL LETTER U
  • U+0367
Latin small letter uMn: Mark, nonspacingInherited
ͨ
  • COMBINING LATIN SMALL LETTER C
  • U+0368
Latin small letter cMn: Mark, nonspacingInherited
ͩ
  • COMBINING LATIN SMALL LETTER D
  • U+0369
Latin small letter dMn: Mark, nonspacingInherited
ͪ
  • COMBINING LATIN SMALL LETTER H
  • U+036A
Latin small letter hMn: Mark, nonspacingInherited
ͫ
  • COMBINING LATIN SMALL LETTER M
  • U+036B
Latin small letter mMn: Mark, nonspacingInherited
ͬ
  • COMBINING LATIN SMALL LETTER R
  • U+036C
Latin small letter rMn: Mark, nonspacingInherited
ͭ
  • COMBINING LATIN SMALL LETTER T
  • U+036D
Latin small letter tMn: Mark, nonspacingInherited
ͮ
  • COMBINING LATIN SMALL LETTER V
  • U+036E
Latin small letter vMn: Mark, nonspacingInherited
ͯ
  • COMBINING LATIN SMALL LETTER X
  • U+036F
Latin small letter xMn: Mark, nonspacingInherited
  • COMBINING DOUBLED CIRCUMFLEX ACCENT
  • U+1AB0
Doubled circumflexMn: Mark, nonspacingInherited
  • COMBINING DIAERESIS-RING
  • U+1AB1
Diaeresis-ringMn: Mark, nonspacingInherited
  • COMBINING INFINITY
  • U+1AB2
InfinityMn: Mark, nonspacingInherited
  • COMBINING DOWNWARDS ARROW
  • U+1AB3
Downwards arrowMn: Mark, nonspacingInherited
  • COMBINING TRIPLE DOT
  • U+1AB4
Triple dotMn: Mark, nonspacingInherited
  • COMBINING X-X BELOW
  • U+1AB5
X-xMn: Mark, nonspacingInherited
  • COMBINING WIGGLY LINE BELOW
  • U+1AB6
Wiggly lineMn: Mark, nonspacingInherited
  • COMBINING OPEN MARK BELOW
  • U+1AB7
Open markMn: Mark, nonspacingInherited
  • COMBINING DOUBLE OPEN MARK BELOW
  • U+1AB8
Double open markMn: Mark, nonspacingInherited
  • COMBINING LIGHT CENTRALIZATION STROKE BELOW
  • U+1AB9
Light centralization strokeMn: Mark, nonspacingInherited
  • COMBINING STRONG CENTRALIZATION STROKE BELOW
  • U+1ABA
Strong centralization strokeMn: Mark, nonspacingInherited
  • COMBINING PARENTHESES ABOVE
  • U+1ABB
ParenthesesMn: Mark, nonspacingInherited
  • COMBINING DOUBLE PARENTHESES ABOVE
  • U+1ABC
Double parenthesesMn: Mark, nonspacingInherited
  • COMBINING PARENTHESES BELOW
  • U+1ABD
ParenthesesMn: Mark, nonspacingInherited
ᪿ
  • COMBINING LATIN SMALL LETTER W BELOW
  • U+1ABF
Latin small letter wMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER TURNED W BELOW
  • U+1AC0
Latin small letter turned wMn: Mark, nonspacingInherited
  • COMBINING DOTTED GRAVE ACCENT
  • U+1DC0
Dotted graveMn: Mark, nonspacingInherited
  • COMBINING DOTTED ACUTE ACCENT
  • U+1DC1
Dotted acuteMn: Mark, nonspacingInherited
  • COMBINING SNAKE BELOW
  • U+1DC2
SnakeMn: Mark, nonspacingInherited
  • COMBINING SUSPENSION MARK
  • U+1DC3
Suspension markMn: Mark, nonspacingInherited
  • COMBINING MACRON-ACUTE
  • U+1DC4
Macron-acuteMn: Mark, nonspacingInherited
  • COMBINING GRAVE-MACRON
  • U+1DC5
Grave-macronMn: Mark, nonspacingInherited
  • COMBINING MACRON-GRAVE
  • U+1DC6
Macron-graveMn: Mark, nonspacingInherited
  • COMBINING ACUTE-MACRON
  • U+1DC7
Acute-macronMn: Mark, nonspacingInherited
  • COMBINING GRAVE-ACUTE-GRAVE
  • U+1DC8
Grave-acute-graveMn: Mark, nonspacingInherited
  • COMBINING ACUTE-GRAVE-ACUTE
  • U+1DC9
Acute-grave-acuteMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER R BELOW
  • U+1DCA
Latin small letter rMn: Mark, nonspacingInherited
  • COMBINING BREVE-MACRON
  • U+1DCB
Breve-macronMn: Mark, nonspacingInherited
  • COMBINING MACRON-BREVE
  • U+1DCC
Macron-breveMn: Mark, nonspacingInherited
  • COMBINING DOUBLE CIRCUMFLEX ABOVE
  • U+1DCD
Double circumflexMn: Mark, nonspacingInherited
  • COMBINING OGONEK ABOVE
  • U+1DCE
OgonekMn: Mark, nonspacingInherited
  • COMBINING ZIGZAG BELOW
  • U+1DCF
ZigzagMn: Mark, nonspacingInherited
  • COMBINING IS BELOW
  • U+1DD0
IsMn: Mark, nonspacingInherited
  • COMBINING UR ABOVE
  • U+1DD1
UrMn: Mark, nonspacingInherited
  • COMBINING US ABOVE
  • U+1DD2
UsMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE
  • U+1DD3
Latin small letter flattened open aMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER AE
  • U+1DD4
Latin small letter aeMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER AO
  • U+1DD5
Latin small letter aoMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER AV
  • U+1DD6
Latin small letter avMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER C CEDILLA
  • U+1DD7
Latin small letter c cedillaMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER INSULAR D
  • U+1DD8
Latin small letter insular dMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER ETH
  • U+1DD9
Latin small letter ethMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER G
  • U+1DDA
Latin small letter gMn: Mark, nonspacingInherited
  • COMBINING LATIN LETTER SMALL CAPITAL G
  • U+1DDB
Latin letter small capital gMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER K
  • U+1DDC
Latin small letter kMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER L
  • U+1DDD
Latin small letter lMn: Mark, nonspacingInherited
  • COMBINING LATIN LETTER SMALL CAPITAL L
  • U+1DDE
Latin letter small capital lMn: Mark, nonspacingInherited
  • COMBINING LATIN LETTER SMALL CAPITAL M
  • U+1DDF
Latin letter small capital mMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER N
  • U+1DE0
Latin small letter nMn: Mark, nonspacingInherited
  • COMBINING LATIN LETTER SMALL CAPITAL N
  • U+1DE1
Latin letter small capital nMn: Mark, nonspacingInherited
  • COMBINING LATIN LETTER SMALL CAPITAL R
  • U+1DE2
Latin letter small capital rMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER R ROTUNDA
  • U+1DE3
Latin small letter r rotundaMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER S
  • U+1DE4
Latin small letter sMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER LONG S
  • U+1DE5
Latin small letter long sMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER Z
  • U+1DE6
Latin small letter zMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER ALPHA
  • U+1DE7
Latin small letter alphaMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER B
  • U+1DE8
Latin small letter bMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER BETA
  • U+1DE9
Latin small letter betaMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER SCHWA
  • U+1DEA
Latin small letter schwaMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER F
  • U+1DEB
Latin small letter fMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER L WITH DOUBLE MIDDLE TILDE
  • U+1DEC
Latin small letter l with double middle tildeMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER O WITH LIGHT CENTRALIZATION STROKE
  • U+1DED
Latin small letter o with light centralization strokeMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER P
  • U+1DEE
Latin small letter pMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER ESH
  • U+1DEF
Latin small letter eshMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER U WITH LIGHT CENTRALIZATION STROKE
  • U+1DF0
Latin small letter u with light centralization strokeMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER W
  • U+1DF1
Latin small letter wMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER A WITH DIAERESIS
  • U+1DF2
Latin small letter a with diaeresisMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER O WITH DIAERESIS
  • U+1DF3
Latin small letter o with diaeresisMn: Mark, nonspacingInherited
  • COMBINING LATIN SMALL LETTER U WITH DIAERESIS
  • U+1DF4
Latin small letter u with diaeresisMn: Mark, nonspacingInherited
  • COMBINING UP TACK ABOVE
  • U+1DF5
Up tackMn: Mark, nonspacingInherited
  • COMBINING DOT ABOVE LEFT
  • U+1DF8
Dot leftMn: Mark, nonspacingInherited
  • COMBINING WIDE INVERTED BRIDGE BELOW
  • U+1DF9
Wide inverted bridgeMn: Mark, nonspacingInherited
  • COMBINING DELETION MARK
  • U+1DFB
Deletion markMn: Mark, nonspacingInherited
  • COMBINING DOUBLE INVERTED BREVE BELOW
  • U+1DFC
Double inverted breveMn: Mark, nonspacingInherited
  • COMBINING ALMOST EQUAL TO BELOW
  • U+1DFD
Almost equal toMn: Mark, nonspacingInherited
  • COMBINING LEFT ARROWHEAD ABOVE
  • U+1DFE
Left arrowheadMn: Mark, nonspacingInherited
᷿
  • COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
  • U+1DFF
Right arrowhead and down arrowheadMn: Mark, nonspacingInherited
  • COMBINING LEFT HARPOON ABOVE
  • U+20D0
Left harpoonMn: Mark, nonspacingInherited
  • COMBINING RIGHT HARPOON ABOVE
  • U+20D1
Right harpoonMn: Mark, nonspacingInherited
  • COMBINING LONG VERTICAL LINE OVERLAY
  • U+20D2
Long vertical line overlayMn: Mark, nonspacingInherited
  • COMBINING SHORT VERTICAL LINE OVERLAY
  • U+20D3
Short vertical line overlayMn: Mark, nonspacingInherited
  • COMBINING ANTICLOCKWISE ARROW ABOVE
  • U+20D4
Anticlockwise arrowMn: Mark, nonspacingInherited
  • COMBINING CLOCKWISE ARROW ABOVE
  • U+20D5
Clockwise arrowMn: Mark, nonspacingInherited
  • COMBINING LEFT ARROW ABOVE
  • U+20D6
Left arrowMn: Mark, nonspacingInherited
  • COMBINING RIGHT ARROW ABOVE
  • U+20D7
Right arrowMn: Mark, nonspacingInherited
  • COMBINING RING OVERLAY
  • U+20D8
Ring overlayMn: Mark, nonspacingInherited
  • COMBINING CLOCKWISE RING OVERLAY
  • U+20D9
Clockwise ring overlayMn: Mark, nonspacingInherited
  • COMBINING ANTICLOCKWISE RING OVERLAY
  • U+20DA
Anticlockwise ring overlayMn: Mark, nonspacingInherited
  • COMBINING THREE DOTS ABOVE
  • U+20DB
Three dotsMn: Mark, nonspacingInherited
  • COMBINING FOUR DOTS ABOVE
  • U+20DC
Four dotsMn: Mark, nonspacingInherited
  • COMBINING LEFT RIGHT ARROW ABOVE
  • U+20E1
Left right arrowMn: Mark, nonspacingInherited
  • COMBINING REVERSE SOLIDUS OVERLAY
  • U+20E5
Reverse solidus overlayMn: Mark, nonspacingInherited
  • COMBINING DOUBLE VERTICAL STROKE OVERLAY
  • U+20E6
Double vertical stroke overlayMn: Mark, nonspacingInherited
  • COMBINING ANNUITY SYMBOL
  • U+20E7
Annuity symbolMn: Mark, nonspacingInherited
  • COMBINING TRIPLE UNDERDOT
  • U+20E8
Triple underdotMn: Mark, nonspacingInherited
  • COMBINING WIDE BRIDGE ABOVE
  • U+20E9
Wide bridgeMn: Mark, nonspacingInherited
  • COMBINING LEFTWARDS ARROW OVERLAY
  • U+20EA
Leftwards arrow overlayMn: Mark, nonspacingInherited
  • COMBINING LONG DOUBLE SOLIDUS OVERLAY
  • U+20EB
Long double solidus overlayMn: Mark, nonspacingInherited
  • COMBINING RIGHTWARDS HARPOON WITH BARB DOWNWARDS
  • U+20EC
Rightwards harpoon with barb downwardsMn: Mark, nonspacingInherited
  • COMBINING LEFTWARDS HARPOON WITH BARB DOWNWARDS
  • U+20ED
Leftwards harpoon with barb downwardsMn: Mark, nonspacingInherited
  • COMBINING LEFT ARROW BELOW
  • U+20EE
Left arrowMn: Mark, nonspacingInherited
  • COMBINING RIGHT ARROW BELOW
  • U+20EF
Right arrowMn: Mark, nonspacingInherited
  • COMBINING ASTERISK ABOVE
  • U+20F0
AsteriskMn: Mark, nonspacingInherited
  • COMBINING LIGATURE LEFT HALF
  • U+FE20
Ligature left halfMn: Mark, nonspacingInherited
  • COMBINING LIGATURE RIGHT HALF
  • U+FE21
Ligature right halfMn: Mark, nonspacingInherited
  • COMBINING DOUBLE TILDE LEFT HALF
  • U+FE22
Double tilde left halfMn: Mark, nonspacingInherited
  • COMBINING DOUBLE TILDE RIGHT HALF
  • U+FE23
Double tilde right halfMn: Mark, nonspacingInherited
  • COMBINING MACRON LEFT HALF
  • U+FE24
Macron left halfMn: Mark, nonspacingInherited
  • COMBINING MACRON RIGHT HALF
  • U+FE25
Macron right halfMn: Mark, nonspacingInherited
  • COMBINING CONJOINING MACRON
  • U+FE26
Conjoining macronMn: Mark, nonspacingInherited
  • COMBINING LIGATURE LEFT HALF BELOW
  • U+FE27
Ligature left halfMn: Mark, nonspacingInherited
  • COMBINING LIGATURE RIGHT HALF BELOW
  • U+FE28
Ligature right halfMn: Mark, nonspacingInherited
  • COMBINING TILDE LEFT HALF BELOW
  • U+FE29
Tilde left halfMn: Mark, nonspacingInherited
  • COMBINING TILDE RIGHT HALF BELOW
  • U+FE2A
Tilde right halfMn: Mark, nonspacingInherited
  • COMBINING MACRON LEFT HALF BELOW
  • U+FE2B
Macron left halfMn: Mark, nonspacingInherited
  • COMBINING MACRON RIGHT HALF BELOW
  • U+FE2C
Macron right halfMn: Mark, nonspacingInherited
  • COMBINING CONJOINING MACRON BELOW
  • U+FE2D
Conjoining macronMn: Mark, nonspacingInherited

See also

Notes

  1. The New Yorker is reported as being unique in its continuing usage of them. [1]

References

  1. Baum, Dan (16 December 2010). "The New Yorker's odd mark — the diaeresis". dscriber. Archived from the original on 16 December 2010. Among the many mysteries of The New Yorker is that funny little umlaut over words like coöperate and reëlect. The New Yorker seems to be the only publication on the planet that uses it, and I always found it a little pretentious until I did some research. Turns out, it's not an umlaut. It's a diaeresis.
  2. Sweet, Henry (1877). A Handbook of Phonetics. Cambridge: Cambridge University Press. pp. 174–175. Even letters with accents and diacritics [...] being only cast for a few founts, act practically as new letters. [...] We may consider the h in sh and th simply as a diacritic written for convenience on a line with the letter it modifies.
  3. Oxford English Dictionary
  4. Nestle, Eberhard (1888). Syrische Grammatik mit Litteratur, Chrestomathie und Glossar. Berlin: H. Reuther's Verlagsbuchhandlung. [translated to English as Syriac grammar with bibliography, chrestomathy and glossary, by R. S. Kennedy. London: Williams & Norgate 1889].
  5. Coakley, J. F. (2002). Robinson's Paradigms and Exercises in Syriac Grammar (5th ed.). Oxford University Press. ISBN   978-0-19-926129-1.
  6. Michaelis, Ioannis Davidis (1784). Grammatica Syriaca.
  7. Gramática de la Llingua Asturiana (PDF) (3rd ed.). Academia de la Llingua Asturiana. 2001. section 1.2. ISBN   84-8168-310-8. Archived from the original (PDF) on 2011-05-25. Retrieved 2011-06-07.
  8. http://www.juls.savba.sk/ediela/psp2000/psp.pdf page 12, section I.2
  9. Grønlands sprognævn (1992)
  10. Petersen (1990)
  11. S.P. Brock, "An Introduction to Syriac Studies", in J.H. Eaton (Ed.,), Horizons in Semitic Studies (1980)
  12. Norris, Mary (26 April 2012). "The Curse of the Diaeresis". The New Yorker. Retrieved 18 April 2014.
  13. van Geloven, Sander (2012). Diakritische tekens in het Nederlands (in Dutch). Utrecht: Hellebaard. Archived from the original on 2013-10-29.
  14. Steele, Shawn (2010-01-25). "Most combining characters in a Unicode glyph/character/whatever". Microsoft. Archived from the original on 2019-05-16. Retrieved 2019-11-25.