Adlam (Unicode block)

Last updated
Adlam
RangeU+1E900..U+1E95F
(96 code points)
Plane SMP
Scripts Adlam
Assigned88 code points
Unused8 reserved code points
Unicode version history
9.0 (2016)87 (+87)
12.0 (2019)88 (+1)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Adlam is a Unicode block containing characters from the Adlam script, an alphabetic script devised during the late 1980s for writing the Fula language in Guinea, Nigeria, Liberia, and other nearby countries.[ which? ]

Contents

History

In June 2016, Adlam was added to the Unicode Standard with the release of version 9.0. [3]

In October 2017, Google released a Noto font that supports the block, Noto Sans Adlam, although it did not handle prenasalized consonants properly. [4]

On 3 October 2018, Microsoft released an updated Ebrima font to support Adlam alphabet to Windows Insiders as part of the Windows 10 version 1903 feature update, starting from build 18252. [5]

Characters

Adlam [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1E90x𞤀𞤁𞤂𞤃𞤄𞤅𞤆𞤇𞤈𞤉𞤊𞤋𞤌𞤍𞤎𞤏
U+1E91x𞤐𞤑𞤒𞤓𞤔𞤕𞤖𞤗𞤘𞤙𞤚𞤛𞤜𞤝𞤞𞤟
U+1E92x𞤠𞤡𞤢𞤣𞤤𞤥𞤦𞤧𞤨𞤩𞤪𞤫𞤬𞤭𞤮𞤯
U+1E93x𞤰𞤱𞤲𞤳𞤴𞤵𞤶𞤷𞤸𞤹𞤺𞤻𞤼𞤽𞤾𞤿
U+1E94x𞥀𞥁𞥂𞥃𞥊𞥄𞥅𞥆𞥇𞥈𞥉𞥋
U+1E95x𞥐𞥑𞥒𞥓𞥔𞥕𞥖𞥗𞥘𞥙𞥞𞥟
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Adlam block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
9.0U+1E900..1E94A, 1E950..1E959, 1E95E..1E95F87 L2/13-191 N4488 Everson, Michael (2013-10-28), Preliminary proposal to encode the Adlam script in the SMP of the UCS
L2/13-210 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh (2013-10-31), "2", Recommendations to UTC #137 November 2013 on Script Proposals
L2/14-219R N4628 Everson, Michael (2014-09-23), Proposal for encoding the Adlam script in the SMP of the UCS
L2/14-268R Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Iancu, Laurențiu; Glass, Andrew; Constable, Peter; Suignard, Michel (2014-10-27), "9. Adlam", Recommendations to UTC #141 October 2014 on Script Proposals
L2/14-250 Moore, Lisa (2014-11-10), "Consensus 141-C20", UTC #141 Minutes, Accept 87 Adlam characters at U+1E900..U+1E95F in block Adlam U+1E900..U+1E95F, with properties as given in L2/14-219R for encoding in a future version of the standard.
L2/16-052 N4603 (pdf, doc)Umamaheswaran, V. S. (2015-09-01), "10.2.6", Unconfirmed minutes of WG 2 meeting 63
L2/19-070 Patel, Neil; Jamra, Mark (2019-01-19), Replacement of Adlam Reference Font in Codesheet to Updated Design
L2/19-119R N5076 Patel, Neil; Jamra, Mark; Cornelius, Craig; Barry, Ibrahima; Barry, Abdoulaye (2019-04-19), Replacement of Adlam Reference Font in Codesheet to Updated Design
L2/19-173 Anderson, Deborah; et al. (2019-04-29), "5. Adlam", Recommendations to UTC #159 April-May 2019 on Script Proposals
L2/19-122 Moore, Lisa (2019-05-08), "C.14", UTC #159 Minutes
12.0U+1E94B1 L2/18-282 Barry, Abdoulaye; Barry, Ibrahima; Constable, Peter; Glass, Andrew (2018-08-26), Proposal to encode ADLAM LETTER APOSTROPHE for ADLaM script
L2/18-274 McGowan, Rick (2018-09-14), "Feedback on an Encoding Proposal L2/18-282", Comments on Public Review Issues (July 24 - Sept 14, 2018)
L2/18-300 Anderson, Deborah; et al. (2018-09-14), "3. Adlam", Recommendations to UTC #157 on Script Proposals
L2/18-282R N5022 Barry, Abdoulaye; Barry, Ibrahima; Constable, Peter; Glass, Andrew (2018-11-17), Proposal to encode ADLAMNASALIZATION MARKfor ADLaM script
L2/18-272 Moore, Lisa (2018-10-29), "C.7", UTC #157 Minutes
  1. Proposed code points and characters names may differ from final code points and names

See also

Related Research Articles

<span class="mw-page-title-main">Unicode</span> Character encoding standard

Unicode, formally The Unicode Standard, is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines as of the current version (15.0) 149,186 characters covering 161 modern and historic scripts, as well as symbols, thousands of emoji, and non-visual control and formatting codes.

OpenType is a format for scalable computer fonts. Derived from TrueType, it retains TrueType's basic structure but adds many intricate data structures for describing typographic behavior. OpenType is a registered trademark of Microsoft Corporation.

<span class="mw-page-title-main">GB 18030</span> Unicode character encoding mostly used for Simplified Chinese

GB 18030 is a Chinese government standard, described as Information Technology — Chinese coded character set and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312. As a Unicode Transformation Format, GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB2312, CP92

Uniscribe is the Microsoft Windows set of services for rendering Unicode-encoded text, supporting complex text layout. It is implemented in the dynamic link library USP10.DLL. Uniscribe was released with Windows 2000 and Internet Explorer 5.0. In addition, the Windows CE platform has supported Uniscribe since version 5.0.

<span class="mw-page-title-main">Open-source Unicode typefaces</span>

There are Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters, or at least a broad selection of Unicode scripts. There are also numerous projects aimed at providing only a certain script, such as the Arabeyes Arabic font. The advantage of targeting only some scripts with a font was that certain Unicode characters should be rendered differently depending on which language they are used in, and that a font that only includes the characters a certain user needs will be much smaller in file size compared to one with many glyphs. Unicode fonts in modern formats such as OpenType can in theory cover multiple languages by including multiple glyphs per character, though very few actually cover more than one language's forms of the unified Han characters.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only support the basic Latin alphabet. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts", although as the maximum number of glyphs that can be defined in a TrueType font is restricted to 65,535, it is not possible for a single font to provide individual glyphs for all defined Unicode characters. This article lists some widely used Unicode fonts that support a comparatively large number and broad range of Unicode characters.

Sorang Sompeng script is used to write in Sora, a Munda language with 300,000 speakers in India. The script was created by Mangei Gomango in 1936 and is used in religious contexts. He was familiar with Odia, Telugu and English.

The Fula language is written primarily in the Latin script, but in some areas is still written in an older Arabic script called the Ajami script or in the recently invented Adlam script.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has been a part of the Unicode Standard since version 3.2 in April 2002. Tagalog characters can be found in the Noto Sans Tagalog font, among others. The Tagalog Baybayin script was originally proposed for inclusion in Unicode alongside its descendant Hanunoo, Buhid and Tagbanwa scripts as a single block called "Philippine Scripts" and two punctuation marks are only part of the Hanunoo block. In 2021, with version 14.0, the Unicode Standard was updated to add three new characters: the "ra" and archaic "ra", and the pamudpod.

Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet consists of glyphs resembling capital letters in the basic Latin alphabet in their standard form and horizontally or vertically mirrored.

<span class="mw-page-title-main">Noto fonts</span> Multilingual font family from Google

Noto is a font family comprising over 100 individual fonts, which are together designed to cover all the scripts encoded in the Unicode standard. As of October 2016, Noto fonts cover all 93 scripts defined in Unicode version 6.1, although fewer than 30,000 of the nearly 75,000 CJK unified ideographs in version 6.0 are covered. In total, Noto fonts cover nearly 64,000 characters, which is under half of the 149,186 characters defined in Unicode 15.0.

<span class="mw-page-title-main">Source Han Sans</span> Open-source sans-serif CJK typeface

Source Han Sans is a sans-serif gothic typeface family created by Adobe and Google. It is also released by Google under the Noto fonts project as Noto Sans CJK. The family includes seven weights, and supports Traditional Chinese, Simplified Chinese, Japanese and Korean. It also includes Latin, Greek and Cyrillic characters from the Source Sans family.

The Adlam script is a script used to write Fulani. The name Adlam is an acronym derived from the first four letters of the alphabet, standing for Alkule Dandayɗe Leñol Mulugol, which means "the alphabet that protects the peoples from vanishing". It is one of many indigenous scripts developed for specific languages in West Africa.

<span class="mw-page-title-main">Hanifi Rohingya script</span> Unified script for the Rohingya language

The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya today is written in three scripts, Hanafi, Arabic, and Latin (Rohingyalish). Rohingya was first written in the 19th century with a version of the Perso-Arabic script. In 1975, an orthographic Arabic script was developed and approved by the community leaders, based on the Urdu alphabet but with unique innovations to make the script suitable to Rohingya.

Nyiakeng Puachue Hmong is an alphabet script devised for White Hmong and Green Hmong in the 1980s by Reverend Chervang Kong for use within his United Christians Liberty Evangelical Church. The church, which moved around California, Minnesota, Wisconsin, North Carolina, Colorado, and many other states, has used the script in printed material and videos. It is reported to have some use in Laos, Thailand, Vietnam, France, and Australia.

The implementation of emojis on different platforms took place across a three-decade period, starting in the 1990s. Today, the exact appearance of emoji is not prescribed but can vary between fonts and platforms, much like different typefaces.

<span class="mw-page-title-main">Meitei input methods</span>

Meitei input methods are the methods that allow users of computers to input texts in the Meitei script, systematically for Meitei language.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Waddell, Kaveh (2016-11-16). "The Alphabet That Will Save a People From Disappearing". The Atlantic .
  4. "Updates". Google Noto Fonts.
  5. "Announcing Windows 10 Insider Preview Build 18252". Microsoft. 2018-10-03.