Devanagari Extended | |
---|---|
Range | U+A8E0..U+A8FF (32 code points) |
Plane | BMP |
Scripts | Devanagari |
Symbol sets | Samaveda cantillation marks nasalization marks |
Assigned | 32 code points |
Unused | 0 reserved code points |
Unicode version history | |
5.2 (2009) | 28 (+28) |
8.0 (2015) | 30 (+2) |
11.0 (2018) | 32 (+2) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Devanagari Extended is a Unicode block containing cantilation marks for writing the Samaveda, and nasalization marks for the Devanagari script.
Devanagari Extended [1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+A8Ex | ꣠ | ꣡ | ꣢ | ꣣ | ꣤ | ꣥ | ꣦ | ꣧ | ꣨ | ꣩ | ꣪ | ꣫ | ꣬ | ꣭ | ꣮ | ꣯ |
U+A8Fx | ꣰ | ꣱ | ꣲ | ꣳ | ꣴ | ꣵ | ꣶ | ꣷ | ꣸ | ꣹ | ꣺ | ꣻ | ꣼ | ꣽ | ꣾ | ꣿ |
Notes
|
The following Unicode-related documents record the purpose and process of defining specific characters in the Devanagari Extended block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
5.2 | U+A8E0..A8FB | 28 | L2/00-144 | Everson, Michael (2000-04-22), Vedic Accents (untitled, undated early draft) | |
L2/00-178 | Everson, Michael (2000-04-22), Encoding Vedic Accents | ||||
L2/00-155 | Moore, Lisa (2000-05-31), Comments on Encoding Vedic Accents Proposal | ||||
L2/03-066 | New Proposal for Vedic Characters and Symbols, 2003-02-26 | ||||
L2/03-067 | Joshi, R. K. (2003-02-27), Vedic Code Set; a draft | ||||
L2/03-102 | Vikas, Om (2003-03-04), Unicode Standard for Indic Scripts | ||||
L2/03-101.2 | Proposed Changes in Indic Scripts [Devanagari document], 2003-03-04 | ||||
L2/04-398 | Proposal to encode Vedic accents, etc., 2004-11-12 | ||||
L2/05-063 | Vikas, Om (2005-02-07), "Awaiting Updates-Devanagari", Issues in Representation of Indic Scripts in Unicode | ||||
L2/05-070 | McGowan, Rick (2005-02-09), Indic ad hoc report | ||||
L2/05-026 | Moore, Lisa (2005-05-16), "Scripts - Indic (C.12)", UTC #102 Minutes | ||||
L2/05-228 | Muller, Eric (2005-08-11), Analysis of a TDIL proposal for Vedic | ||||
L2/06-185 | Proposal for Encoding of Vedic Characters & Symbols in Unicode, 2006-05-10 | ||||
L2/06-384 | Lata, Swaran (2006-10-25), Letter from T. N. Dharmadhikari in support of Vedic repertoire | ||||
L2/07-060 | Scharf, Peter (2007-02-02), Vedic Unicode Workshop Report | ||||
L2/07-095R | N3235R | Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-04-13), Proposal to encode characters for Vedic Sanskrit in the BMP of the UCS | |||
L2/07-230 | N3290 | Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-07-26), Revised proposal to encode characters for Vedic Sanskrit in the BMP of the UCS | |||
L2/07-254 | Bhushan, E. K. Bharat (2007-08-01), Unicode for Vedic Sanskrit (letter to Mark Davis) | ||||
L2/07-262 | Scharf, Peter (2007-08-07), Outline of the development of WG2/n3290 = L2/07-230 | ||||
L2/07-271 | Scharf, Peter (2007-08-08), Comparison of proposed characters in Lata 2006 (L2/06-185) with Scharf and Everson WG2/n3290 (L2/07-230) | ||||
L2/07-272 | Muller, Eric (2007-08-10), "11", Report of the South Asia subcommittee | ||||
L2/07-396 | Joshi, R. K.; Irani, Alka (2007-10-10), Proposal for Encoding of Vaidika Character and Symbols in Unicode | ||||
L2/07-386 | Joshi, R. K. (2007-10-17), Comments on Comparison of proposed characters in Lata 2006 (L2/06-185) with Scharf and Everson WG2/n3290 (L2/07-230) | ||||
L2/07-388 | Joshi, R. K. (2007-10-17), Following observations have been made with reference to the document No. L2/07-095 dated: 2007-04-13 | ||||
L2/07-343 | N3366 | Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-10-18), Proposal to encode 55 characters for Vedic Sanskrit in the BMP of the UCS | |||
L2/07-394 | Scharf, Peter (2007-10-18), Significant differences between L2/07-230 and L2/07-343 | ||||
L2/07-395 | Joshi, R. K.; et al. (2007-10-18), Request for feedback on draft proposal for encoding Vaidika characters and symbols in Unicode | ||||
L2/07-397 | Joshi, R. K.; et al. (2007-10-18), Vaidika Vowels and Consonants | ||||
L2/07-400 | Scharf, Peter (2007-10-18), Comments on R. K. Joshi's documents L2/07-386 and L2/07-388 | ||||
L2/07-401 | Scharf, Peter (2007-10-18), Equivalences between L2/07-396 and L2/07-343 | ||||
L2/07-345 | Moore, Lisa (2007-10-25), "Consensus 113-C19", UTC #113 Minutes | ||||
L2/08-035 | Scharf, Peter; Rosenfield, Susan J. (2008-01-22), Vedic Revisions 2008 Jan 14, Revisions to N3366 = L2/07-343 | ||||
L2/08-042 | Joshi, R. K. (2008-01-23), Proposal for Encoding of Vaidika Characters & Symbols in Unicode | ||||
L2/08-043 | Joshi, R. K. (2008-01-28), Vaidika Extensions A & B (Vedic evidence to accompany L2/08-042) | ||||
L2/08-092 | N3385 | Everson, Michael; Scharf, Peter (2008-01-31), Comparison between two Vedic proposals of January 2008 | |||
L2/08-097 | Amendments to L2/08-042, L2/08-043, 2008-01-31 | ||||
L2/08-050R | N3383R | Everson, Michael; Scharf, Peter (2008-03-06), Summary proposal to encode characters for Vedic in the BMP of the UCS | |||
L2/08-096 | Joshi, R. K. (2008-02-02), Comparison of L2/08-042 and L2/08-050 | ||||
L2/08-110 | Muller, Eric (2008-02-08), South Asia Subcommittee Report | ||||
L2/08-003 | Moore, Lisa (2008-02-14), "Vedic", UTC #114 Minutes | ||||
L2/08-176 | N3456R | Anderson, Deborah (2008-04-18), Summary of Vedic Characters based on N3385, N3383R, and the Unicode Pipeline | |||
L2/08-196 | Proposal for Encoding of Vaidika Sanskrit Characters & Symbols in the BMP of UCS, 2008-05-05 | ||||
L2/08-216 | Scharf, Peter (2008-05-08), Comments on L2/08-196 regarding the encoding of Sanskrit and Vedic | ||||
L2/08-294 | Scharf, Peter (2008-08-06), Placement of characters in Vedic, Devanagari, and Devanagari Extended blocks | ||||
L2/08-317 | Muller, Eric (2008-08-11), "1.2, 1.3", South Asia Subcommittee Report | ||||
L2/08-318 | N3453 (pdf, doc) | Umamaheswaran, V. S. (2008-08-13), "M52.18", Unconfirmed minutes of WG 2 meeting 52 | |||
L2/08-253R2 | Moore, Lisa (2008-08-19), "Vedic (B.15.2, E.1)", UTC #116 Minutes | ||||
L2/08-273R3 | N3488R3 | Everson, Michael; Scharf, Peter (2008-08-21), Proposal to encode two characters for Vedic in the UCS | |||
L2/08-412 | N3553 (pdf, doc) | Umamaheswaran, V. S. (2008-11-05), "M53.03", Unconfirmed minutes of WG 2 meeting 53 | |||
L2/09-067 | Lata, Swaran (2009-01-28), Encoding of Vaidika Sanskrit Characters & Symbols in the BMP of UCS | ||||
8.0 | U+A8FC | 1 | L2/12-147 | Anderson, Deborah; McGowan, Rick; Whistler, Ken (2012-04-25), "VIII. DEVANAGARI", Review of Indic-related L2 documents and Recommendations to the UTC | |
L2/12-123R | N4260 | Pandey, Anshuman (2012-05-03), Proposal to Encode the Sign SIDDHAM for Devanagari | |||
L2/12-343R2 | Moore, Lisa (2012-12-04), "Consensus 133-C13", UTC #133 Minutes | ||||
N4353 (pdf, doc) | "M60.17", Unconfirmed minutes of WG 2 meeting 60, 2013-05-23 | ||||
U+A8FD | 1 | L2/13-056 | N4408 | Pandey, Anshuman (2013-04-25), Proposal to Encode the Sign JAIN OM for Devanagari in ISO/IEC 10646 | |
L2/13-086 | Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-04-26), "8", Recommendations to UTC on Script Proposals | ||||
L2/13-058 | Moore, Lisa (2013-06-12), "D.2", UTC #135 Minutes | ||||
N4403 (pdf, doc) | Umamaheswaran, V. S. (2014-01-28), "10.3.9 Jain Om for Devanagar", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14 | ||||
11.0 | U+A8FE..A8FF | 2 | L2/15-335 | Pandey, Anshuman (2015-12-07), Proposal to encode the Devanagari letter and vowel sign AY | |
L2/16-037 | Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-01-22), "2. Devanagari", Recommendations to UTC #146 January 2016 on Script Proposals | ||||
L2/16-004 | Moore, Lisa (2016-02-01), "D.7", UTC #146 Minutes | ||||
|
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.
Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet.
Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:
Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.
Latin Extended Additional is a Unicode block.
Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally also used for writing Coptic, using the similar Greek letters in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.
Greek Extended is a Unicode block containing the accented vowels necessary for writing polytonic Greek. The regular, unaccented Greek characters as well as the characters with tonos and diaeresis can be found in the Greek and Coptic block. Greek Extended was encoded in version 1.1 of the Unicode Standard. As an alternative to Greek Extended, combining characters can be used to represent the tones and breath marks of polytonic Greek.
Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Bodo, Maithili, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Bengali Unicode block contains characters for the Bengali, Assamese, Bishnupriya Manipuri, Daphla, Garo, Hallam, Khasi, Mizo, Munda, Naga, Riang, and Santali languages. In its original incarnation, the code points U+0981..U+09CD were a direct copy of the Bengali characters A1-ED from the 1988 ISCII standard, as well as several Assamese ISCII characters in the U+09F0 column. The Devanagari, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on ISCII encodings.
Gurmukhi is a Unicode block containing characters for the Punjabi language, in the Gurmukhi script. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Gujarati is a Unicode block containing characters for writing the Gujarati language. In its original incarnation, the code points U+0A81..U+0AD0 were a direct copy of the Gujarati characters A1-F0 from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Oriya is a Unicode block containing characters for the Odia, Khondi and Santali languages of the state of Odisha in India. In its original incarnation, the code points U+0B01..U+0B4D were a direct copy of the Odia characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Tamil is a Unicode block containing characters for the Tamil, and Saurashtra languages of Tamil Nadu India, Sri Lanka, Singapore, and Malaysia. In its original incarnation, the code points U+0B82..U+0BCD were a direct copy of the Tamil characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Telugu is a Unicode block containing characters for the Telugu, Gondi, and Lambadi languages of Indian states of Andhra Pradesh and Telangana. In its original incarnation, the code points U+0C01..U+0C4D were a direct copy of the Telugu characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.
Kannada is a Unicode block containing characters for the Kannada, Sanskrit, Konkani, Sankethi, Havyaka, Tulu and Kodava languages. In its original incarnation, the code points U+0C82..U+0CCD were a direct copy of the Kannada characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, and Malayalam blocks were similarly all based on their ISCII encodings.
Malayalam is a Unicode block containing characters of the Malayalam script. In its original incarnation, the code points U+0D02..U+0D4D were a direct copy of the Malayalam characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, and Kannada blocks were similarly all based on their ISCII encodings.
Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.
Vedic Extensions is a Unicode block containing characters for representing tones and other vedic symbols in Devanagari and other Indic scripts. Related symbols are defined in two other blocks: Devanagari (U+0900–U+097F) and Devanagari Extended (U+A8E0–U+A8FF).
Combining Diacritical Marks Extended is a Unicode block containing diacritical marks used in German dialectology (Teuthonista).
Devanagari Extended-A is a Unicode block containing characters for auspicious signs from Indian inscriptions and manuscripts from the 11th century onward.