Dives Akuru (Unicode block)

Last updated
Dives Akuru
RangeU+11900..U+1195F
(96 code points)
Plane SMP
Scripts Dives Akuru
Assigned72 code points
Unused24 reserved code points
Unicode version history
13.0 (2020)72 (+72)
Code chart
Note: [1] [2]

Dives Akuru is a Unicode block containing characters from the Dhives Akuru script, which was used for writing the Maldivian language up until the 20th century.

Contents

Block

Dives Akuru [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1190x𑤀𑤁𑤂𑤃𑤄𑤅𑤆𑤉𑤌𑤍𑤎𑤏
U+1191x𑤐𑤑𑤒𑤓𑤕𑤖𑤘𑤙𑤚𑤛𑤜𑤝𑤞𑤟
U+1192x𑤠𑤡𑤢𑤣𑤤𑤥𑤦𑤧𑤨𑤩𑤪𑤫𑤬𑤭𑤮𑤯
U+1193x𑤰𑤱𑤲𑤳𑤴𑤵𑤷𑤸𑤻𑤼𑤽 𑤾  𑤿 
U+1194x𑥀 𑥁 𑥂𑥃𑥄𑥅𑥆
U+1195x𑥐𑥑𑥒𑥓𑥔𑥕𑥖𑥗𑥘𑥙
Notes
1. ^ As of Unicode version 14.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Dives Akuru block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
13.0U+11900..11906, 11909, 1190C..11913, 11915..11916, 11918..11935, 11937..11938, 1193B..11946, 11950..1195972 L2/09-191 Pandey, Anshuman (2009-05-02), Preliminary Proposal to Encode the Dhivehi Script in ISO/IEC 10646
L2/10-213 N3848 Pandey, Anshuman (2010-06-30), Preliminary Proposal to Encode Dhives Akuru in ISO/IEC 10646
L2/17-292 Pandey, Anshuman (2017-10-06), Proposal to encode Divehi in Unicode
L2/17-384 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2017-10-22), "8. Divehi (Dhives Akuru)", Recommendations to UTC #153 October 2017 on Script Proposals
L2/17-417R Pandey, Anshuman (2017-12-31), Proposal to encode Dives Akuru in Unicode
L2/18-016R N4929 Pandey, Anshuman (2018-01-23), Proposal to encode Dives Akuru in Unicode
L2/18-039 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Cook, Richard (2018-01-19), "10", Recommendations to UTC #154 January 2018 on Script Proposals
L2/18-007 Moore, Lisa (2018-03-19), "D.7", UTC #154 Minutes
N5020 (pdf, doc)Umamaheswaran, V. S. (2019-01-11), "10.2.1 Dives Akuru script", Unconfirmed minutes of WG 2 meeting 67
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Unicode Character encoding standard

Unicode, formally the Unicode Standard, is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines 144,697 characters covering 159 modern and historic scripts, as well as symbols, emoji, and non-visual control and formatting codes.

Thaana Abugida

Thaana, Taana or Tāna is the present writing system of the Maldivian language spoken in the Maldives. Thaana has characteristics of both an abugida and a true alphabet, with consonants derived from indigenous and Arabic numerals, and vowels derived from the vowel diacritics of the Arabic abjad. Maldivian orthography in Thaana is largely phonemic.

Chandrabindu is a diacritic sign with the form of a dot inside the lower half of a circle. It is used in the Devanagari (ँ), Bengali-Assamese (ঁ), Gujarati (ઁ), Odia (ଁ), Telugu (ఁ), Javanese ( ꦀ) and other scripts.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

Dhives Akuru

Dhives Akuru or Divehi Akuru, is a script formerly used to write the Maldivian language. This script was called Dives Akuru by H. C. P. Bell who studied Maldive epigraphy when he retired from the British government service in Colombo and wrote an extensive monograph on the archaeology, history and epigraphy of the Maldive islands.

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context. Its block name in Unicode 1.0 was Generic Diacritical Marks.

Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows. Its block name in Unicode 1.0 was Blocks.

Universal Character Set characters Complete list of the characters available on most computers

The Unicode Consortium (UC) and the International Organisation for Standardisation (ISO) collaborate on the Universal Character Set (UCS). The UCS is an international standard to map characters used in natural language, mathematics, music, and other domains to machine-readable values. By creating this mapping, the UCS enables computer software vendors to interoperate and transmit UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple meanings and thus be improperly decoded if the wrong one is chosen.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 13.0, seven of the planes have assigned code points (characters), and five are named.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.

CJK Unified Ideographs Extension G is a Unicode block containing rare and historic CJK Unified Ideographs for Chinese, Japanese, Korean, and Vietnamese. It is the first block to be allocated to the Tertiary Ideographic Plane.

Yezidi is a Unicode block containing characters from the Yezidi script, which was used for writing Kurdish, specifically the Kurmanji dialect for liturgical purposes in Iraq and Georgia. There is also some limited modern usage.

Lisu Supplement is a Unicode block containing supplementary characters of the Fraser alphabet, which is used to write the Lisu language. This is a supplement to the main Lisu block, with currently only a single character used for the Naxi language assigned to it.

Symbols for Legacy Computing is a Unicode block containing graphic characters that were used for various home computers from the 1970s and 1980s and in Teletext broadcasting standards. It includes characters from the Amstrad CPC, MSX, Mattel Aquarius, RISC OS, MouseText, Atari ST, TRS-80 Color Computer, Oric, Texas Instruments TI-99/4A, TRS-80, Minitel, Teletext, ATASCII, PETSCII, ZX80, and ZX81 character sets, as well as semigraphics characters.

KS X 1002 is a South Korean character set standard that is established in order to supplement KS X 1001. It consists of a total of 7,649 characters.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2020-03-11.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2020-03-11.