Michael Everson

Last updated

Everson in 2011 Michael Everson - Wikimedia 2011 hackathon in Berlin 087.jpg
Everson in 2011

Michael Everson (born January 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over a hundred books since 2006.

Contents

His central area of expertise is with writing systems of the world, specifically in the representation of these systems in formats for computer and digital media. In 2003 Rick McGowan said he was "probably the world's leading expert in the computer encoding of scripts" [1] for his work to add a wide variety of scripts and characters to the Universal Character Set. Since 1993, he has written over two hundred proposals [2] which have added thousands of characters to ISO/IEC 10646 and the Unicode standard; as of 2003, he was credited as the leading contributor of Unicode proposals. [1]

Life

Everson was born in Norristown, Pennsylvania, and moved to Tucson, Arizona, at the age of 12. His interest in the works of J. R. R. Tolkien led him to study Old English and then other Germanic languages. He read German, Spanish, and French for his B.A. at the University of Arizona (1985), and the History of Religions and Indo-European linguistics for his M.A. at the University of California, Los Angeles (1988)[ citation needed ].

In 1989, a former professor, Dr. Marija Gimbutas, asked him to read a paper [3] on Basque mythology at an Indo-Europeanist Conference held in Ireland; shortly thereafter he moved to Dublin, where he studied as a Fulbright Scholar in the Faculty of Celtic Studies, University College Dublin (1991). [4] He became a naturalized Irish citizen in 2000, although he retains American citizenship.[ citation needed ]

Work

Everson is active in supporting minority-language communities, especially in the fields of character encoding standardization and internationalization. In addition to being one of the primary contributing editors of the Unicode Standard, he is also a contributing editor to ISO/IEC 10646, registrar for ISO 15924, [5] and subtag reviewer for BCP 47. He has contributed to the encoding of many scripts and characters in those standards, receiving the Unicode Bulldog Award in 2000 [6] for his technical contributions to the development and promotion of the Unicode Standard. In 2004, Everson was appointed convenor of ISO TC46/WG3 (Conversion of Written Languages), which is responsible for transliteration standards.

Everson is one of the co-editors (along with Rick McGowan, Ken Whistler, and V.S. Umamaheswaran) of the Unicode roadmaps that detail actual and proposed allocations for current and future Unicode scripts and blocks. [7]

On July 1, 2012, Everson was appointed to the Volapük Academy by the Cifal, Brian R. Bishop, for his work in Volapük publishing. [8]

Encoding of scripts

Everson has been actively involved in the encoding of many scripts [9] in the Unicode and ISO/IEC 10646 standards, including Avestan, Balinese, Bamum, Bassa Vah, Batak, Braille, Brāhmī, Buginese, Buhid, Unified Canadian Aboriginal Syllabics, Carian, Cham, Cherokee, Coptic, Cuneiform, Cypriot, Deseret, Duployan, Egyptian hieroglyphs, Elbasan, Ethiopic, Georgian, Glagolitic, Gothic, Hanunóo, Imperial Aramaic, Inscriptional Pahlavi, Inscriptional Parthian, Javanese, Kayah Li, Khmer, Lepcha, Limbu, Linear A, Linear B, Lycian, Lydian, Mandaic, Manichaean, Meitei Mayek, Mongolian, Myanmar, Nabataean, New Tai Lue, N'Ko, Ogham, Ol Chiki, Old Hungarian, Old Italic, Old North Arabian, Old Persian, Old South Arabian, Old Turkic, Osmanya, Palmyrene, Phaistos Disc, Phoenician, Rejang, Runic, Samaritan, Saurashtra, Shavian, Sinhala, Sundanese, Tagalog, Tagbanwa, Tai Le, Tai Tham, Thaana, Tibetan, Ugaritic, Vai, and Yi, as well as many characters belonging to the Latin, Greek, Cyrillic, and Arabic scripts.

Encoding of symbols

Everson authored or co-authored proposals for many symbol characters for encoding into Unicode and ISO/IEC 10646. Among those proposals submitted to ISO/IEC JTC 1/SC 2/WG 2 that have been accepted and encoded: N2586R (U+267EPERMANENT PAPER SIGN and four other miscellaneous symbols [10] admitted into Unicode 4.1), N3727 (the 26 Regional Indicator Symbols used in pairs to generate national flags in emoji contexts; [11] adopted into Unicode 6.0), and N4783R2 (chess notation symbols [12] encoded into Unicode 11.0).

Among proposals that have not yet been approved for encoding: N1866 (an early proposal for encoding Blissymbols into the Supplementary Multilingual Plane of Unicode; [13] still listed in the SMP roadmap as of Unicode 15.0 [14] although no further action had been taken on it for years).

Everson, along with Doug Ewell, Rebecca Bettencourt, Ricardo Bánffy, Eduardo Marín Silva, Elias Mårtenson, Mark Shoulson, Shawn Steele, and Rebecca Turner, is a contributor to the Terminals Working Group researching obscure characters found in legacy character sets used by home computers (or "microcomputers"), terminals, and other legacy devices made from the mid-1970s until the mid-1980s; thanks to their proposal L2/19-025, 212 graphic characters for compatibility with MSX, Commodore 64, and other microcomputers of the era, as well as Teletext, were encoded in the Symbols for Legacy Computing block, [15] while 731 additional characters from L2/21-235 have been accepted for a future version of the standard. [16]

Font development

In 1995 he designed a Unicode font, Everson Mono, a monospaced typeface with more than 4,800 characters. This font was the third Unicode-encoded font to contain a large number of characters from many character blocks, after Lucida Sans Unicode and Unihan font (both 1993). In 2007 he was commissioned by the International Association of Coptic Studies to create a standard free Unicode 5.1 font for Coptic, Antinoou, using the Sahidic style. [17]

Conscript Unicode Registry

Together with John Cowan, he is also responsible for the ConScript Unicode Registry, a project to coordinate the mapping of artificial scripts into the Unicode Private Use Area. Among the scripts "encoded" in the CSUR, Shavian and Deseret were eventually formally adopted into Unicode; two other conscripts under consideration are Tolkien's scripts of Tengwar and Cirth.

Language and locale information

Everson has also created locale and language information for many languages, from support for the Irish language and the other Celtic languages to the minority Languages of Finland. [18] In 2000, together with Trond Trosterud, he co-authored Software localization into Nynorsk Norwegian, a report commissioned by the Norwegian Language Council. In 2003 he was commissioned by the United Nations Development Programme to prepare a report [19] on the computer locale requirements for the major languages of Afghanistan (Pashto, Dari, and Uzbek), co-authored by Roozbeh Pournader, which was endorsed by the Ministry of Communications of the Afghan Transitional Islamic Administration. [20] More recently, UNESCO's Initiative B@bel [21] funded Everson's work to encode the N'Ko and Balinese scripts. [22]

Work on a standard for Cornish

In 2007 he co-authored a proposal for a new standard written form of Cornish, called Kernowek Standard. [23] Following the publication of the Standard Written Form in 2008, Everson and a group of other users examined the specification and implemented a set of modifications to it, publishing a formal specification in 2012. [24]

Publishing at Evertype

As of March 2014 Everson operates a publishing company, Evertype, through which he has published a total of 295 books. [25] These include a wide range of titles by various authors and editors, with Everson himself as co-author of one, editor of several, and having adapted or revised several more. He also designed fonts for several. [25]

Everson has a particular interest in Gaelic typeface design, and does a considerable amount of work typesetting books in Irish for publication by Evertype. [26]

Another project consists of his publications of translations of Alice's Adventures in Wonderland in many languages, amongst which are minority languages and constructed languages. [27] Translations are available in Cornish, Esperanto, French, German, Hawaiian, Irish, Italian, Jèrriais, Ladino, Latin, Lingua Franca Nova, Lingwa de planeta, Low German, Manx, Mennonite Low German, Borain Picard, Sambahsa, Scots, Shavian transliteration, Swedish, Ulster Scots and Welsh, with several other translations being prepared.[ citation needed ]

Related Research Articles

<span class="mw-page-title-main">Unicode</span> Character encoding standard

Unicode, formally The Unicode Standard, is a text encoding standard maintained by the Unicode Consortium designed to support the use of text written in all of the world's major writing systems. Version 15.1 of the standard defines 149813 characters and 161 scripts used in various ordinary, literary, academic, and technical contexts.

The Coptic script is the script used for writing the Coptic language, the latest stage of Egyptian. The repertoire of glyphs is based on the uncial Greek alphabet, augmented by letters borrowed from the Egyptian Demotic. It was the first alphabetic script used for the Egyptian language. There are several Coptic alphabets, as the script varies greatly among the various dialects and eras of the Coptic language.

A constructed writing system or a neography is a writing system specifically created by an individual or group, rather than having evolved as part of a language or culture like a natural script. Some are designed for use with constructed languages, although several of them are used in linguistic experimentation or for other more practical ends in existing languages. Prominent examples of constructed scripts include Korean Hangul and the International Phonetic Alphabet.

The ConScript Unicode Registry is a volunteer project to coordinate the assignment of code points in the Unicode Private Use Areas (PUA) for the encoding of artificial scripts, such as those for constructed languages. It was founded by John Cowan and was maintained by him and Michael Everson. It is not affiliated with the Unicode Consortium.

<span class="mw-page-title-main">ArmSCII</span> Set of obsolete single-byte character encodings

ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.

ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.

<span class="mw-page-title-main">Kra (letter)</span> Letter used in an Inuktitut dialect

Kra is a glyph formerly used to write the Kalaallisut language of Greenland and is now only found in Nunatsiavummiutut, a distinct Inuktitut dialect. It is visually similar to a Latin small capital letter K, a Greek letter Kappa: κ, or a Cyrillic small letter Ka: к.

A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only support the basic Latin alphabet. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts", although as the maximum number of glyphs that can be defined in a TrueType font is restricted to 65,535, it is not possible for a single font to provide individual glyphs for all defined Unicode characters. This article lists some widely used Unicode fonts that support a comparatively large number and broad range of Unicode characters.

<span class="mw-page-title-main">Everson Mono</span> Monospace Unicode font

Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there were very few applications which could do anything with a font so encoded in any case. The original "Everson Mono" had a MacRoman character set, and other versions were named with suffixes: "Everson Mono Latin B", "Everson Mono Currency", "Everson Mono Armenian" and so on. A range of fonts with the character set of the ISO/IEC 8859 series were also made. A large font distributed in 2003 was named "Everson Mono Unicode", but since 2008 the font has been named simply "Everson Mono". At present, there are regular, italic, bold, and bold-italic styles.

The Klingon scripts are fictional alphabetic scripts used in the Star Trek movies and television shows to write the Klingon language.

The Universal Coded Character Set is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added.

The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country codes in a way that allows optional special treatment.

Runic is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014). The original encoding of runes in UCS was based on the recommendations of the "ISO Runes Project" submitted in 1997.

<span class="mw-page-title-main">Phaistos Disc (Unicode block)</span> Unicode character block

Phaistos Disc is a Unicode block containing the characters found on the undeciphered Phaistos Disc artefact.

Mende Kikakui is a Unicode block containing the Kikakui characters for writing the Mende language.

Newa is a Unicode block containing characters from the Newa alphabet, which is used to write Nepal Bhasa.

ISO 5426 is a character set developed by ISO, similar to ISO/IEC 6937. It was first published in 1983.

ISO-IR-197 is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies.

The Khmer keyboard includes several keyboard layouts for Khmer script.

References

  1. 1 2 Erard, Michael (September 25, 2003). "For the World's ABC's, He Makes 1's and 0's". The New York Times . Retrieved May 23, 2008.
  2. Everson, Michael (January 27, 2007). "Papers formally submitted to the Unicode Technical Committee and ISO/IEC JTC1/SC2/WG2". Evertype.
  3. "Tenacity in religion, myth, and folklore". Evertype.com. Retrieved March 11, 2015.
  4. "About Michael Everson". Evertype.com. Retrieved November 28, 2010.
  5. "ISO 15924 Registration Authority". ISO, Unicode, Inc., & Evertype. 2004. Retrieved March 11, 2015.
  6. "The Bulldog Award". Unicode, Inc. Retrieved March 11, 2015.
  7. "Roadmaps to Unicode". unicode.org.
  8. Archived January 23, 2013, at the Wayback Machine
  9. Everson, Michael (March 19, 2009). "Papers formally submitted to the Unicode Technical Committee and ISO/IEC JTC1/SC2/WG2 (Universal Character Set)". Evertype. Retrieved March 11, 2015.
  10. "Proposal to encode five miscellaneous symbols in the UCS" (PDF). unicode.org. September 4, 2003.
  11. "Proposal to encode Regional Indicator Symbols in the UCS" (PDF). unicode.org. September 18, 2009.
  12. "Proposal to encode symbols for chess notation" (PDF). unicode.org. January 26, 2017.
  13. "Encoding Blissymbolics in Plane 1 of the UCS" (PDF). unicode.org. September 10, 1998.
  14. "Roadmap to the SMP". unicode.org. April 9, 2023.
  15. "Proposal to add characters from legacy computers and teletext to the UCS" (PDF). unicode.org. January 4, 2019.
  16. "Approved Minutes of UTC Meeting 170". unicode.org. January 25, 2022.
  17. "Antinoou - A standard font for Coptic". Evertype. Retrieved March 11, 2015.
  18. Everson, Michael (March 14, 1997). "Sami locales". Evertype. Retrieved March 11, 2015.
  19. Everson, Michael; Pournader, Roozbeh (July 29, 2003). "Computer Locale Requirements for Afghanistan" (PDF). Evertype.
  20. Lepage, Marc (April–June 2003). "Afghans beat language obstacle to entering digital age" (PDF). Poverty Alleviation Initiatives. United Nations Economic and Social Commission for Asia and the Pacific. 13 (2). Archived from the original (PDF) on October 12, 2006.
  21. "UNESCO B@bel Initiative". Portal.unesco.org. Archived from the original on March 13, 2005. Retrieved November 21, 2011.
  22. "Development of a Unicode standard for the West African Language N'ko". Multilingualism in Cyberspace. UNESCO. November 12, 2004. Archived from the original on January 7, 2016. Retrieved March 11, 2015.
  23. Archived September 27, 2007, at the Wayback Machine
  24. Williams, Nicholas. 2012. Desky Kernowek: A Complete Guide to Cornish. Cathair na Mart: Evertype. ISBN   978-1-904808-99-2
  25. 1 2 "Evertype complete catalogue". Evertype.com. Retrieved December 26, 2019.
  26. Everson, Michael (October 5, 2006). "Books typeset by Michael Everson". Evertype. Retrieved March 11, 2015.
  27. "Evertype Publications - Wonderland and Carrolliana". Evertype.com. Retrieved March 11, 2015.