NKo (Unicode block)

Last updated
NKo
RangeU+07C0..U+07FF
(64 code points)
Plane BMP
Scripts Nko
Major alphabetsManden
Assigned62 code points
Unused2 reserved code points
Unicode version history
5.0 (2006)59 (+59)
11.0 (2018)62 (+3)
Chart
Code chart
Note: [1] [2]

NKo is a Unicode block containing characters for the Manding languages of West Africa, including Bamanan, Jula, Maninka, Mandinka, and a common literary language, Kangbe, also called N'Ko.

N'Ko became part of Unicode with version 5.0 in July 2006. With Unicode 11.0 in June 2018, three additional characters were added: a combining mark for abbreviated units of measure and two currency symbols.

NKo [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+07Cx߀߁߂߃߄߅߆߇߈߉ߊߋߌߍߎߏ
U+07Dxߐߑߒߓߔߕߖߗߘߙߚߛߜߝߞߟ
U+07Exߠߡߢߣߤߥߦߧߨߩߪ߫߬߭߮߯
U+07Fx߲߰߱߳ߴߵ߶߷߸߹ߺ߽߾߿
Notes
1. ^ As of Unicode version 15.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the NKo block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
5.0U+07C0..07FA59 L2/03-265 Anderson, Deborah (2003-08-17), Letters in support of encoding N'Ko
L2/03-414 Three letters in support of N'ko encoding, 2003-11-01
L2/04-172 N2765 Everson, Michael; Doumbouya, Mamady; Diané, Baba Mamadi; Jammeh, Karamo Kaba (2004-06-08), Proposal to add the N'Ko script to the BMP of the UCS
L2/04-283 N2833 Everson, Michael (2004-06-23), Revisions to the N'Ko script for the PDAM code chart
L2/05-006 N2898 Doumbouya, Mamady (2005-01-11), Re: N'Ko Proposal in Amendment 2
L2/05-010 Yergeau, François; Andries, Patrick (2005-01-20), Re: N'Ko Proposal in Amendment 2
L2/05-017 N2914 Doumbouya, Mamady (2005-01-21), Documents showing old and new N'Ko letters
N2932 (htm, doc)Doumbouya, Mamady (2005-03-12), Encoding of N'Ko in ISO/IEC 10646
L2/05-026 Moore, Lisa (2005-05-16), "Consensus 102-C1", UTC #102 Minutes, Use the block and long script name "NKo" (without apostrophe) as the name for N'Ko.
L2/05-169 N2949 Yergeau, François; Andries, Patrick (2005-07-12), For a Correct Encoding of N'ko
L2/05-248 N2982 Yergeau, François; Andries, Patrick (2005-09-04), Comments on 2977 (Alleged parallel between 3 N'Ko Glyphs & Latin Long S)
L2/05-270 Whistler, Ken (2005-09-21), "E. N'Ko Name Changes", WG2 Consent Docket (Sophia Antipolis)
L2/05-279 Moore, Lisa (2005-11-10), "Consensus 105-C29", UTC #105 Minutes
N2953 (pdf, doc)Umamaheswaran, V. S. (2006-02-16), "7.2.2", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15
L2/05-237 N2977 Everson, Michael; Doumbouya, Mamady; Diané, Baba Mamadi; Jammeh, Karamo Kaba (2008-08-26), Clarification on the identity and use of three N'Ko letters
11.0U+07FD..07FF3 L2/15-338 N4706 Everson, Michael (2015-12-19), Proposal to encode four N'Ko characters in the BMP of the UCS
L2/16-037 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-01-22), "11", Recommendations to UTC #146 January 2016 on Script Proposals
L2/16-004 Moore, Lisa (2016-02-01), "C.8", UTC #146 Minutes
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

<span class="mw-page-title-main">N'Ko script</span> Alphabet for the Manding languages of West Africa

N'Ko is a script devised by Solomana Kante in 1949, as a modern writing system for the Manding languages of West Africa. The term N'Ko, which means I say in all Manding languages, is also used for the Manding literary standard written in N'Ko script.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.

IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Cyrillic is a Unicode block containing the characters used to write the most widely used languages with a Cyrillic orthography. The core of the block is based on the ISO 8859-5 standard, with additions for minority languages and historic orthographies.

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.

Georgian is a Unicode block containing the Mkhedruli and Asomtavruli Georgian characters used to write Modern Georgian, Svan, and Mingrelian languages. Another lower case, Nuskhuri, is encoded in a separate Georgian Supplement block, which is used with the Asomtavruli to write the ecclesiastical Khutsuri Georgian script.

Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Bodo, Maithili, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gurmukhi is a Unicode block containing characters for the Punjabi language, in the Gurmukhi script. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gujarati is a Unicode block containing characters for writing the Gujarati language. In its original incarnation, the code points U+0A81..U+0AD0 were a direct copy of the Gujarati characters A1-F0 from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Thai is a Unicode block containing characters for the Thai, Lanna Tai, and Pali languages. It is based on the Thai Industrial Standard 620-2533.

Lao is a Unicode block containing characters for the languages of Laos. The characters of the Lao block are allocated so as to be equivalent to the similarly positioned characters of the Thai block immediately preceding it.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Tags is a Unicode block containing formatting tag characters. The block is designed to mirror ASCII. It was originally intended for language tags, but has now been repurposed as emoji modifiers, specifically for region flags.

Ahom is a Unicode block containing characters used for writing the Ahom alphabet, which was used to write the Ahom language spoken by the Ahom people in Assam between the 13th and the 18th centuries.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.