This article may rely excessively on sources too closely associated with the subject , potentially preventing the article from being verifiable and neutral.(November 2024) |
Michael Everson (born January 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over one hundred books since 2006.
His central area of expertise is with writing systems of the world, specifically in the representation of these systems in formats for computer and digital media. In 2003 Rick McGowan said he was "probably the world's leading expert in the computer encoding of scripts" [1] for his work to add a wide variety of scripts and characters to the Universal Character Set. Since 1993, he has written over two hundred proposals [2] which have added thousands of characters to ISO/IEC 10646 and the Unicode standard; as of 2003, he was credited as the leading contributor of Unicode proposals. [1]
This section of a biography of a living person needs additional citations for verification .(August 2018) |
Everson was born in Norristown, Pennsylvania, and moved to Tucson, Arizona, at the age of 12. His interest in the works of J. R. R. Tolkien led him to study Old English and then other Germanic languages. He read German, Spanish, and French for his B.A. at the University of Arizona (1985), and the History of Religions and Indo-European linguistics for his M.A. at the University of California, Los Angeles (1988)[ citation needed ].
In 1989, a former professor, Dr. Marija Gimbutas, asked him to read a paper [3] on Basque mythology at an Indo-Europeanist Conference held in Ireland; shortly thereafter he moved to Dublin, where he studied as a Fulbright Scholar in the Faculty of Celtic Studies, University College Dublin (1991). [4] He became a naturalized Irish citizen in 2000, although he retains American citizenship. [1]
Everson is active in supporting minority-language communities, especially in the fields of character encoding standardization and internationalization. In addition to being one of the primary contributing editors of the Unicode Standard, he is also a contributing editor to ISO/IEC 10646, registrar for ISO 15924, [5] and subtag reviewer for BCP 47. He has contributed to the encoding of many scripts and characters in those standards, receiving the Unicode Bulldog Award in 2000 [6] for his technical contributions to the development and promotion of the Unicode Standard. In 2004, Everson was appointed convenor of ISO TC46/WG3 (Conversion of Written Languages), which is responsible for transliteration standards.
Everson is one of the co-editors (along with Rick McGowan, Ken Whistler, and V.S. Umamaheswaran) of the Unicode roadmaps that detail actual and proposed allocations for current and future Unicode scripts and blocks. [7]
On July 1, 2012, Everson was appointed to the Volapük Academy by the Cifal, Brian R. Bishop, for his work in Volapük publishing. [8]
Everson has been actively involved in the encoding of many scripts [9] in the Unicode and ISO/IEC 10646 standards, including Avestan, Balinese, Bamum, Bassa Vah, Batak, Braille, Brāhmī, Buginese, Buhid, Unified Canadian Aboriginal Syllabics, Carian, Cham, Cherokee, Coptic, Cuneiform, Cypriot, Deseret, Duployan, Egyptian hieroglyphs, Elbasan, Ethiopic, Georgian, Glagolitic, Gothic, Hanunóo, Imperial Aramaic, Inscriptional Pahlavi, Inscriptional Parthian, Javanese, Kayah Li, Khmer, Lepcha, Limbu, Linear A, Linear B, Lycian, Lydian, Mandaic, Manichaean, Meitei Mayek, Mongolian, Myanmar, Nabataean, New Tai Lue, N'Ko, Ogham, Ol Chiki, Old Hungarian, Old Italic, Old North Arabian, Old Persian, Old South Arabian, Old Turkic, Osmanya, Palmyrene, Phaistos Disc, Phoenician, Rejang, Runic, Samaritan, Saurashtra, Shavian, Sinhala, Sundanese, Tagalog, Tagbanwa, Tai Le, Tai Tham, Thaana, Tibetan, Ugaritic, Vai, and Yi, as well as many characters belonging to the Latin, Greek, Cyrillic, and Arabic scripts.
Everson authored or co-authored proposals for many symbol characters for encoding into Unicode and ISO/IEC 10646. Among those proposals submitted to ISO/IEC JTC 1/SC 2/WG 2 that have been accepted and encoded: N2586R (U+267E♾PERMANENT PAPER SIGN and four other miscellaneous symbols [10] admitted into Unicode 4.1), N3727 (the 26 Regional Indicator Symbols used in pairs to generate national flags in emoji contexts; [11] adopted into Unicode 6.0), and N4783R2 (chess notation symbols [12] encoded into Unicode 11.0).
Among proposals that have not yet been approved for encoding: N1866 (an early proposal for encoding Blissymbols into the Supplementary Multilingual Plane of Unicode; [13] still listed in the SMP roadmap as of Unicode 15.0 [14] although no further action had been taken on it for years).
Everson, along with Doug Ewell, Rebecca Bettencourt, Ricardo Bánffy, Eduardo Marín Silva, Elias Mårtenson, Mark Shoulson, Shawn Steele, and Rebecca Turner, is a contributor to the Terminals Working Group researching obscure characters found in legacy character sets used by home computers (or "microcomputers"), terminals, and other legacy devices made from the mid-1970s until the mid-1980s; thanks to their proposal L2/19-025, 212 graphic characters for compatibility with MSX, Commodore 64, and other microcomputers of the era, as well as Teletext, were encoded in the Symbols for Legacy Computing block, [15] while 731 additional characters from L2/21-235 have been accepted for a future version of the standard. [16]
In 1995 he designed a Unicode font, Everson Mono, a monospaced typeface with more than 4,800 characters. This font was the third Unicode-encoded font to contain a large number of characters from many character blocks, after Lucida Sans Unicode and Unihan font (both 1993). In 2007 he was commissioned by the International Association of Coptic Studies to create a standard free Unicode 5.1 font for Coptic, Antinoou, using the Sahidic style. [17]
Together with John Cowan, he is also responsible for the ConScript Unicode Registry, a project to coordinate the mapping of artificial scripts into the Unicode Private Use Area. Among the scripts "encoded" in the CSUR, Shavian and Deseret were eventually formally adopted into Unicode; two other conscripts under consideration are Tolkien's scripts of Tengwar and Cirth.
Everson has also created locale and language information for many languages, from support for the Irish language and the other Celtic languages to the minority Languages of Finland. [18] In 2000, together with Trond Trosterud, he co-authored Software localization into Nynorsk Norwegian, a report commissioned by the Norwegian Language Council. In 2003 he was commissioned by the United Nations Development Programme to prepare a report [19] on the computer locale requirements for the major languages of Afghanistan (Pashto, Dari, and Uzbek), co-authored by Roozbeh Pournader, which was endorsed by the Ministry of Communications of the Afghan Transitional Islamic Administration. [20] More recently, UNESCO's Initiative B@bel [21] funded Everson's work to encode the N'Ko and Balinese scripts. [22]
In 2007 he co-authored a proposal for a new standard written form of Cornish, called Kernowek Standard. [23] Following the publication of the Standard Written Form in 2008, Everson and a group of other users examined the specification and implemented a set of modifications to it, publishing a formal specification in 2012. [24]
As of March 2014 [update] Everson operates a publishing company, Evertype, through which he has published a total of 295 books. [25] These include a wide range of titles by various authors and editors, with Everson himself as co-author of one, editor of several, and having adapted or revised several more. He also designed fonts for several. [25]
Everson has a particular interest in Gaelic typeface design, and does a considerable amount of work typesetting books in Irish for publication by Evertype. [26]
Another project consists of his publications of translations of Alice's Adventures in Wonderland in many languages, amongst which are minority languages and constructed languages. [27] Translations are available in Cornish, Esperanto, French, German, Hawaiian, Irish, Italian, Jèrriais, Ladino, Latin, Lingua Franca Nova, Lingwa de planeta, Low German, Manx, Mennonite Low German, Borain Picard, Sambahsa, Scots, Shavian transliteration, Swedish, Ulster Scots and Welsh, with several other translations being prepared.[ citation needed ]
ISO/IEC 8859-15:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 15: Latin alphabet No. 9, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1999. It is informally referred to as Latin-9. It is similar to ISO 8859-1, and thus also intended for “Western European” languages, but replaces some less common symbols with the euro sign and some letters that were deemed necessary.
The Coptic script is the script used for writing the Coptic language, the most recent development of Egyptian. The repertoire of glyphs is based on the uncial Greek alphabet, augmented by letters borrowed from the Egyptian Demotic. It was the first alphabetic script used for the Egyptian language. There are several Coptic alphabets, as the script varies greatly among the various dialects and eras of the Coptic language.
A constructed writing system or a neography is a writing system specifically created by an individual or group, rather than having evolved as part of a language or culture like a natural script. Some are designed for use with constructed languages, although several of them are used in linguistic experimentation or for other more practical ends in existing languages. Prominent examples of constructed scripts include Korean Hangul and Tengwar.
The ConScript Unicode Registry is a volunteer project to coordinate the assignment of code points in the Unicode Private Use Areas (PUA) for the encoding of artificial scripts, such as those for constructed languages. It was founded by John Cowan and was maintained by him and Michael Everson. It is not affiliated with the Unicode Consortium.
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.
Kra is a glyph formerly used to write the Kalaallisut language of Greenland and is now only found in Inuttitut, a distinct Inuktitut dialect. It is visually similar to a Latin small capital letter K, a Greek letter Kappa: κ, or a Cyrillic small letter Ka: к.
L, or l, is the twelfth letter of the Latin alphabet, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is el, plural els.
A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only support the basic Latin alphabet. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts", although as the maximum number of glyphs that can be defined in a TrueType font is restricted to 65,535, it is not possible for a single font to provide individual glyphs for all defined Unicode characters. This article lists some widely used Unicode fonts that support a comparatively large number and broad range of Unicode characters.
Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there were very few applications which could do anything with a font so encoded in any case. The original "Everson Mono" had a MacRoman character set, and other character sets were provided as separate files named with suffixes: "Everson Mono Latin B", "Everson Mono Currency", "Everson Mono Armenian" and so on. A range of fonts with the character set of the ISO/IEC 8859 series were also made. A single Unicode font file incorporating most or all of the characters from all of the previous separate Everson Mono files was named "Everson Mono Unicode" in 2003, but since 2008 the single large font has been named simply "Everson Mono". At present, there are regular, italic, bold, and bold-italic styles.
The Klingon scripts are fictional alphabetic scripts used in the Star Trek movies and television shows to write the Klingon language.
The Universal Coded Character Set is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added.
The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country codes in a way that allows optional special treatment.
Runic is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014). The original encoding of runes in UCS was based on the recommendations of the "ISO Runes Project" submitted in 1997.
Phaistos Disc is a Unicode block containing the characters found on the undeciphered Phaistos Disc artefact.
Newa is a Unicode block containing characters from the Newa alphabet, which is used to write Nepal Bhasa.
ISO 5426 is a character set developed by ISO, similar to ISO/IEC 6937. It was first published in 1983.
ISO-IR-197 is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies. FreeDOS calls it code page 59187.
The Khmer keyboard includes several keyboard layouts for Khmer script.