ConScript Unicode Registry

Last updated

The ConScript Unicode Registry is a volunteer project to coordinate the assignment of code points in the Unicode Private Use Areas (PUA) for the encoding of artificial scripts, such as those for constructed languages. [1] It was founded by John Cowan and was maintained by him and Michael Everson. It is not affiliated with the Unicode Consortium. [2]

Contents

History

The ConScript Unicode Registry is a volunteer project [3] that was founded by John Cowan [4] in the early 1990s. It is a joint project of John Cowan and Michael Everson. [4]

Historically, it was hosted on both Cowan and Everson's websites (branded as the North American and European sites, respectively); in 2002, the site was transitioned to be hosted exclusively on Everson's site. [5] [6]

Since 2008, the ConScript Unicode Registry has been largely unmaintained; in 2008, Cowan explained that Everson was too busy to continue maintaining the project. [7] Due to this inactivity, Rebecca Bettencourt founded the Under-ConScript Unicode Registry, aiming to coordinate code points for constructed languages until they can be formally added to the ConScript Unicode Registry. [8] Scripts added to the Under-ConScript Unicode Registry include Sitelen Pona (for Toki Pona) [9] and Cirth. [10]

Scripts

The CSUR and UCSUR include the following scripts: [1] [11] [8]

  Withdrawn
  Allocated only in the UCSUR
Scripts in the ConScript Unicode Registry [1] [11] and Under-ConScript Unicode Registry [8]
Writing systemCreator(s)Code rangeNotes
Aiha Ursula K. Le Guin F8A0–F8CF
AlzetjanHerman MillerE550–E57F
AmlinMarnanel ThurmanE6D0−E6EF
Amman-IarDavid BellE2A0–E2CF
AthMorioka HiroyukiF4C0−F4EF
aUI John W. WeilgartE280–E29F
AurebeshStephen CraneE890−E8DF
Braille Louis Braille EB60−EB9FThe Braille Extended block includes all permutations of all Braille patterns as specified in the original 1829 specification, but not otherwise encoded in the Braille Patterns Unicode block. [12]
Cirth J. R. R. Tolkien E080–E0FFUnicode proposal made in September 1997 [13]
No action taken but Cirth appears in the Roadmap to the SMP [14]
Cistercian numerals EBA0−EBDF
CylenianDanae DekkerEC00−EC2F
DeiniDana NutterED00−ED3F
DeraniHoemaıF16B0−F16DF
Deseret E830–E88FWithdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+10400U+1044F).
D'niRichard A. WatsonE830−E88F
Engsvanyáli M. A. R. Barker E100–E14F
Ewellic Doug EwellE680–E6CF
Ferengi Timothy MillerE600–E62F
Gargoyle Herman MillerE5C0–E5DF
Glaitha-ARebecca G. BettencourtE900−E97F
Glaitha-BRebecca G. BettencourtE980−E9FF
GraflectAaron PatersonEC70−ECEF
GrawlixesMort WalkerEDF0−EDFF
IlianoreJeff SmithE1B0–E1CF
IranicAli Hossein MohammadiED80−EDAF
Kazat ?AkkorouHerman MillerE430–E44F
KazvaradHerman MillerE450–E46F
KelwathiHerman MillerE4F0–E4FF
KinyaMaurizio M. GavoliE150–E1AF
Kinya

llables

Maurizio M. GavoliF0000–F0E6F
Klingon Astra Image CorporationF8D0–F8FFUnicode proposal made in 1997 [15]
Formally rejected by Unicode in 2001 [16] [17]
Lapointe Hexadecimal NumeralsBoby LapointeEBE0−EBEF
LhenaziArthaey AngosiiEA00−EA9F
Martin Hexadecimal NumeralsBruce Alan MartinEBF0−EBFF
MizarianHerman MillerE300–E33F
MonofonSteve T. BellE800–E82F
NijiNorbert LindenbergED40−ED7F
NísklôzHerman MillerE400–E42F
OlaetyanHerman MillerE3B0–E3FF
Ophidian E5E0–E5FF
Orokin Digital Extremes EB00−EB3F
Phaistos Disc E6D0–E6FFWithdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+101D0U+101FF).
PiktoJohn E. WilliamsF0E70–F16A4
RøzhxhHerman MillerE490–E4BF
RynnanHerman MillerE520–E54F
SadalianWong Sadale Cho ChingF2000−F267F
SaklorHerman MillerE500–E51F
SarkaiHerman MillerE360–E37F
SerivelnaHerman MillerE4C0–E4EF
Seussian Latin Extensions Theodor Geisel E630–E64F
Shavian Ronald Kingsley Read E700–E72FWithdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+10450U+1047F).
SemtogQuefei HuangF1B00−F1C3F
Sitelen Pona Sonja LangF1900−F19FF
Sitelen Pona RadicalsSonja Lang et al.F1C80−F1C9F
Solresol Jean François Sudre E770–E77F
SsûrakiHerman MillerE5A0–E5BF
Standard Galactic Tom Hall EB40−EB5F
StreichTommaso DonnarummaE2D0–E2DF
SyaiBrian LilburnE1D0–E1FF
SylabicaMarcin KowalczykEC30−EC6F
SyrrinDanae DekkerEC00−EC2F
TassaruneseNarylisEDB0−EDDF
TelarassoHerman MillerE580–E59F
Tengwar J. R. R. Tolkien E000–E07FUnicode proposal made in September 1997. [18] No action taken but Tengwar appears in the Roadmap to the SMP. [14]
ThelwikHerman MillerE380–E3AF
Titi PulaÐoom EpictooþF1C40−F1C7F
TonalJohn W. NystromE8E0−E8FF
Unifon John MaloneE740–E76FIn UCSUR, the block Unifon Extended was added in the code range U+E6F0–U+E73F. [19]
Verdurian Mark RosenfelderE200–E26F
Visible Speech Alexander Melville Bell E780–E7FF
WanyaMax Dominik WeberEAA0−EAFF
Whitaker Hexadecimal NumeralsRonald O. WhitakerECF0-ECFF
XaîniPaul BlakeE2E0–E2FF
ZarkhándHerman MillerE470–E48F
ZeseJack EisenmannEDE0−EDEF
Zírí:NkaHerman MillerE340–E35F

Font support

Some fonts support ConScript Unicode specified code points:

See also

Related Research Articles

<span class="mw-page-title-main">Cirth</span> Artificial script in the fantasy works of J. R. R. Tolkien

The Cirth is a semi‑artificial script, based on real‑life runic alphabets, one of several scripts invented by J. R. R. Tolkien for the constructed languages he devised and used in his works. Cirth is written with a capital letter when referring to the writing system; the letters themselves can be called cirth.

<span class="mw-page-title-main">Michael Everson</span> American-Irish type designer (born 1963)

Michael Everson is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over one hundred books since 2006.

The Ideographic Research Group (IRG), formerly called the Ideographic Rapporteur Group, is a subgroup of Working Group 2 (WG2) of ISO/IEC JTC1 Subcommittee 2 (SC2), which is the committee responsible for developing the Universal Coded Character Set. IRG is tasked with preparing and reviewing sets of CJK unified ideographs for eventual inclusion in both ISO/IEC 10646 and The Unicode Standard. The IRG is composed of representatives from national standards bodies from China, Japan, South Korea, Vietnam, and other regions that have historically used Chinese characters, as well as experts from liaison organizations such as the SAT Daizōkyō Text Database Committee (SAT), Taipei Computer Association (TCA), and the Unicode Technical Committee (UTC). The group holds two meetings every year lasting 4-5 days each, subsequently reporting its activities to its parent ISO/IEC JTC 1/SC 2 (SC2/WG2) committee.

Runic is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014). The original encoding of runes in UCS was based on the recommendations of the "ISO Runes Project" submitted in 1997.

Optical Character Recognition is a Unicode block containing signal characters for OCR and MICR standards.

Phaistos Disc is a Unicode block containing the characters found on the undeciphered Phaistos Disc artefact.

Meroitic Hieroglyphs is a Unicode block formal hieroglyphic containing characters for writing the Meroitic language.

Tamil All Character Encoding (TACE16) is a scheme for encoding the Tamil script in the Private Use Area of Unicode, implementing a syllabary-based character model differing from the modified-ISCII model used by Unicode's existing Tamil implementation.

Caucasian Albanian is a Unicode block containing characters used by the Caucasian Albanian peoples of Azerbaijan and Dagestan for writing Northeast Caucasian languages.

Khudawadi is a Unicode block containing characters of the Khudabadi script used by some Sindhis in India for writing the Sindhi language.

Mahajani is a Unicode block containing characters historically used for writing Punjabi and Marwari.

Mende Kikakui is a Unicode block containing the Kikakui characters for writing the Mende language.

Modi is a Unicode block containing the Modi alphabet characters for writing the Marathi language.

Nabataean is a Unicode block containing characters for writing the ancient Nabataean language.

Old Permic is a Unicode block containing Old Permic characters for writing the Komi language.

Siddham is a Unicode block containing characters for the historical, Brahmi-derived Siddham script used for writing Sanskrit between the years c. 550 – c. 1200.

Multani is a Unicode block containing characters used for writing the Multani alphabet, a Brahmic script used in the Multan region of Punjab and in northern Sindh in Pakistan. The script is now obsolete, but was historically used to write the Saraiki language.

Osage is a Unicode block containing characters from the Osage alphabet, which was devised in 2006 for writing the Osage language spoken by the Osage people of Oklahoma, United States.

<span class="mw-page-title-main">Zanabazar square script</span> Abugida developed by the monk and scholar Zanabazar

Zanabazar's square script is a horizontal Mongolian square script, an abugida developed by the monk and scholar Zanabazar based on the Tibetan alphabet to write Mongolian. It can also be used to write Tibetan language and Sanskrit as a geometric typeface.

CJK Unified Ideographs Extension I is a Unicode block comprising CJK Unified Ideographs included in drafts of an amendment to China's GB 18030 standard circulated in 2022 and 2023, which were fast-tracked into Unicode in 2023.

References

  1. 1 2 3 "ConScript Unicode Registry". Evertype.com. Archived from the original on 2015-06-22. Retrieved 2015-06-20.
  2. "FAQ - Private-Use Characters, Noncharacters, and Sentinels". Unicode Consortium. Retrieved 7 August 2023. As an example, see the ConScript Unicode Registry—a private group unaffiliated with the Unicode Consortium—which has extensive tables listing private-use character definitions for various unencoded scripts.
  3. Scott, Michael Lee (2009). Programming Language Pragmatics (3rd ed.). Morgan Kaufmann. p. 295. ISBN   978-0-12-374514-9. It also sets aside a "private use area" for such artificial [constructed] languages as Klingon, Tengwar, and Cirth [Tolkien Elvish]. Allocation of this private space is coordinated by a volunteer organization known as the ConScript Unicode Registry.
  4. 1 2 Cowan, John; Everson, Michael (2023-04-27). "ConScript Unicode Registry". ConScript Unicode Registry. Evertype. Retrieved 2023-08-09. The Registry is the joint project of John Cowan and Michael Everson. John originated the Registry and did most of the work of collecting information and writing preliminary proposals; Michael has been reviewing and revising these proposals into final registrations.
  5. "Wayback Machine comparison (2002-01-13 vs 2002-06-15)". ConScript Unicode Registry.
  6. Cowan, John. "John Cowan". Chester County InterLink. Archived from the original on 2002-07-24.
  7. Cowan, John (2008-02-04). "Re: CSUR help". KreativeKorp.
  8. 1 2 3 Bettencourt, Rebecca G. "Under-ConScript Unicode Registry". Archived from the original on 2015-07-18. Retrieved 2015-07-21.
  9. jan Ke Tami (2022-09-01). "nasin Juniko ala la, nasin UCSUR li lon" (PDF). Lipu Tenpo (in Toki Pona) (moku): 2–3. ISSN   2752-4639.
  10. 1 2 "GNU Unifont". Unifoundry.com. Archived from the original on 2013-11-12. Retrieved 2015-07-24.
  11. 1 2 Everson, Michael (21 March 2008). "Roadmap to the ConScript Unicode Registry". Evertype. Archived from the original on 17 July 2021. Retrieved 30 July 2021.
  12. User:Vanisaac. "Braille Extended" (PDF). Retrieved 2024-01-14.
  13. Everson, Michael (1997-09-18). "N1642: Proposal to encode Cirth in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  14. 1 2 "Roadmap to the SMP". Unicode.org. 2015-06-03. Archived from the original on 2015-06-23. Retrieved 2015-06-20.
  15. Everson, Michael (1997-09-18). "N1643: Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  16. "N2370: Unicode Consortium Liaison Report". The Unicode Consortium. 2001-10-10. Archived from the original on 2015-07-31. Retrieved 2015-07-20.
  17. "Archive of Notices of Non-Approval". The Unicode Consortium. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  18. Everson, Michael (1997-09-18). "N1641: Proposal to encode Tengwar in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  19. "Unifon Extended: U+E6F0 - U+E73F". KreativeKorp. Retrieved 2024-01-14.
  20. Bettencourt, Rebecca. "Constructium". Kreative Korporation. Archived from the original on 13 July 2011. Retrieved 28 June 2017.
  21. Bettencourt, Rebecca. "Fairfax". Kreative Korporation. Archived from the original on 23 October 2021. Retrieved 5 November 2021.
  22. "Horta is a remarkably intelligent and sensitive typeface". Christ Trek Fonts. Archived from the original on 2019-10-31. Retrieved 2019-10-31.
  23. "Kurinto Font Folio". Archived from the original on 2020-07-26. Retrieved 2020-07-27.
  24. Niwatori, Shiroi. "にしき的フォント" (in Japanese). Retrieved 2024-01-14.