Devanagari Extended

Devanagari Extended
Devanagari Extended
Range	U+A8E0..U+A8FF; (32 code points)
Plane	BMP
Scripts	Devanagari
Symbol sets	Samaveda cantillation marks; nasalization marks
Assigned	32 code points
Unused	0 reserved code points
Unicode version history
5.2 (2009)	28 (+28)
8.0 (2015)	30 (+2)
11.0 (2018)	32 (+2)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated June 26, 2023

Devanagari Extended is a Unicode block containing cantilation marks for writing the Samaveda, and nasalization marks for the Devanagari script.

Block

Devanagari Extended ^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+A8Ex	꣠	꣡	꣢	꣣	꣤	꣥	꣦	꣧	꣨	꣩	꣪	꣫	꣬	꣭	꣮	꣯
U+A8Fx	꣰	꣱	ꣲ	ꣳ	ꣴ	ꣵ	ꣶ	ꣷ	꣸	꣹	꣺	ꣻ	꣼	ꣽ	ꣾ	ꣿ
Notes 1. ^ As of Unicode version 15.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Devanagari Extended block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
5.2	U+A8E0..A8FB	28	L2/00-144		Everson, Michael (2000-04-22), Vedic Accents (untitled, undated early draft)
			L2/00-178		Everson, Michael (2000-04-22), Encoding Vedic Accents
			L2/00-155		Moore, Lisa (2000-05-31), Comments on Encoding Vedic Accents Proposal
			L2/03-066		New Proposal for Vedic Characters and Symbols, 2003-02-26
			L2/03-067		Joshi, R. K. (2003-02-27), Vedic Code Set; a draft
			L2/03-102		Vikas, Om (2003-03-04), Unicode Standard for Indic Scripts
			L2/03-101.2		Proposed Changes in Indic Scripts [Devanagari document], 2003-03-04
			L2/04-398		Proposal to encode Vedic accents, etc., 2004-11-12
			L2/05-063		Vikas, Om (2005-02-07), "Awaiting Updates-Devanagari", Issues in Representation of Indic Scripts in Unicode
			L2/05-070		McGowan, Rick (2005-02-09), Indic ad hoc report
			L2/05-026		Moore, Lisa (2005-05-16), "Scripts - Indic (C.12)", UTC #102 Minutes
			L2/05-228		Muller, Eric (2005-08-11), Analysis of a TDIL proposal for Vedic
			L2/06-185		Proposal for Encoding of Vedic Characters & Symbols in Unicode, 2006-05-10
			L2/06-384		Lata, Swaran (2006-10-25), Letter from T. N. Dharmadhikari in support of Vedic repertoire
			L2/07-060		Scharf, Peter (2007-02-02), Vedic Unicode Workshop Report
			L2/07-095R	N3235R	Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-04-13), Proposal to encode characters for Vedic Sanskrit in the BMP of the UCS
			L2/07-230	N3290	Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-07-26), Revised proposal to encode characters for Vedic Sanskrit in the BMP of the UCS
			L2/07-254		Bhushan, E. K. Bharat (2007-08-01), Unicode for Vedic Sanskrit (letter to Mark Davis)
			L2/07-262		Scharf, Peter (2007-08-07), Outline of the development of WG2/n3290 = L2/07-230
			L2/07-271		Scharf, Peter (2007-08-08), Comparison of proposed characters in Lata 2006 (L2/06-185) with Scharf and Everson WG2/n3290 (L2/07-230)
			L2/07-272		Muller, Eric (2007-08-10), "11", Report of the South Asia subcommittee
			L2/07-396		Joshi, R. K.; Irani, Alka (2007-10-10), Proposal for Encoding of Vaidika Character and Symbols in Unicode
			L2/07-386		Joshi, R. K. (2007-10-17), Comments on Comparison of proposed characters in Lata 2006 (L2/06-185) with Scharf and Everson WG2/n3290 (L2/07-230)
			L2/07-388		Joshi, R. K. (2007-10-17), Following observations have been made with reference to the document No. L2/07-095 dated: 2007-04-13
			L2/07-343	N3366	Everson, Michael; Scharf, Peter; Angot, Michel; Chandrashekar, R.; Hyman, Malcolm; Rosenfield, Susan; Sastry, B. V. Venkatakrishna; Witzel, Michael (2007-10-18), Proposal to encode 55 characters for Vedic Sanskrit in the BMP of the UCS
			L2/07-394		Scharf, Peter (2007-10-18), Significant differences between L2/07-230 and L2/07-343
			L2/07-395		Joshi, R. K.; et al. (2007-10-18), Request for feedback on draft proposal for encoding Vaidika characters and symbols in Unicode
			L2/07-397		Joshi, R. K.; et al. (2007-10-18), Vaidika Vowels and Consonants
			L2/07-400		Scharf, Peter (2007-10-18), Comments on R. K. Joshi's documents L2/07-386 and L2/07-388
			L2/07-401		Scharf, Peter (2007-10-18), Equivalences between L2/07-396 and L2/07-343
			L2/07-345		Moore, Lisa (2007-10-25), "Consensus 113-C19", UTC #113 Minutes
			L2/08-035		Scharf, Peter; Rosenfield, Susan J. (2008-01-22), Vedic Revisions 2008 Jan 14, Revisions to N3366 = L2/07-343
			L2/08-042		Joshi, R. K. (2008-01-23), Proposal for Encoding of Vaidika Characters & Symbols in Unicode
			L2/08-043		Joshi, R. K. (2008-01-28), Vaidika Extensions A & B (Vedic evidence to accompany L2/08-042)
			L2/08-092	N3385	Everson, Michael; Scharf, Peter (2008-01-31), Comparison between two Vedic proposals of January 2008
			L2/08-097		Amendments to L2/08-042, L2/08-043, 2008-01-31
			L2/08-050R	N3383R	Everson, Michael; Scharf, Peter (2008-03-06), Summary proposal to encode characters for Vedic in the BMP of the UCS
			L2/08-096		Joshi, R. K. (2008-02-02), Comparison of L2/08-042 and L2/08-050
			L2/08-110		Muller, Eric (2008-02-08), South Asia Subcommittee Report
			L2/08-003		Moore, Lisa (2008-02-14), "Vedic", UTC #114 Minutes
			L2/08-176	N3456R	Anderson, Deborah (2008-04-18), Summary of Vedic Characters based on N3385, N3383R, and the Unicode Pipeline
			L2/08-196		Proposal for Encoding of Vaidika Sanskrit Characters & Symbols in the BMP of UCS, 2008-05-05
			L2/08-216		Scharf, Peter (2008-05-08), Comments on L2/08-196 regarding the encoding of Sanskrit and Vedic
			L2/08-294		Scharf, Peter (2008-08-06), Placement of characters in Vedic, Devanagari, and Devanagari Extended blocks
			L2/08-317		Muller, Eric (2008-08-11), "1.2, 1.3", South Asia Subcommittee Report
			L2/08-318	N3453 (pdf, doc)	Umamaheswaran, V. S. (2008-08-13), "M52.18", Unconfirmed minutes of WG 2 meeting 52
			L2/08-253R2		Moore, Lisa (2008-08-19), "Vedic (B.15.2, E.1)", UTC #116 Minutes
			L2/08-273R3	N3488R3	Everson, Michael; Scharf, Peter (2008-08-21), Proposal to encode two characters for Vedic in the UCS
			L2/08-412	N3553 (pdf, doc)	Umamaheswaran, V. S. (2008-11-05), "M53.03", Unconfirmed minutes of WG 2 meeting 53
			L2/09-067		Lata, Swaran (2009-01-28), Encoding of Vaidika Sanskrit Characters & Symbols in the BMP of UCS
8.0	U+A8FC	1	L2/12-147		Anderson, Deborah; McGowan, Rick; Whistler, Ken (2012-04-25), "VIII. DEVANAGARI", Review of Indic-related L2 documents and Recommendations to the UTC
			L2/12-123R	N4260	Pandey, Anshuman (2012-05-03), Proposal to Encode the Sign SIDDHAM for Devanagari
			L2/12-343R2		Moore, Lisa (2012-12-04), "Consensus 133-C13", UTC #133 Minutes
				N4353 (pdf, doc)	"M60.17", Unconfirmed minutes of WG 2 meeting 60, 2013-05-23
	U+A8FD	1	L2/13-056	N4408	Pandey, Anshuman (2013-04-25), Proposal to Encode the Sign JAIN OM for Devanagari in ISO/IEC 10646
			L2/13-086		Anderson, Deborah; McGowan, Rick; Whistler, Ken; Pournader, Roozbeh (2013-04-26), "8", Recommendations to UTC on Script Proposals
			L2/13-058		Moore, Lisa (2013-06-12), "D.2", UTC #135 Minutes
				N4403 (pdf, doc)	Umamaheswaran, V. S. (2014-01-28), "10.3.9 Jain Om for Devanagar", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14
11.0	U+A8FE..A8FF	2	L2/15-335		Pandey, Anshuman (2015-12-07), Proposal to encode the Devanagari letter and vowel sign AY
			L2/16-037		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-01-22), "2. Devanagari", Recommendations to UTC #146 January 2016 on Script Proposals
			L2/16-004		Moore, Lisa (2016-02-01), "D.7", UTC #146 Minutes
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

Latin Extended Additional is a Unicode block.

Greek and Coptic is the Unicode block for representing modern (monotonic) Greek. It was originally also used for writing Coptic, using the similar Greek letters in addition to the uniquely Coptic additions. Beginning with version 4.1 of the Unicode Standard, a separate Coptic block has been included in Unicode, allowing for mixed Greek/Coptic text that is stylistically contrastive, as is convention in scholarly works. Writing polytonic Greek requires the use of combining characters or the precomposed vowel + tone characters in the Greek Extended character block.

Greek Extended is a Unicode block containing the accented vowels necessary for writing polytonic Greek. The regular, unaccented Greek characters as well as the characters with tonos and diaeresis can be found in the Greek and Coptic block. Greek Extended was encoded in version 1.1 of the Unicode Standard. As an alternative to Greek Extended, combining characters can be used to represent the tones and breath marks of polytonic Greek.

Devanagari is a Unicode block containing characters for writing languages such as Hindi, Marathi, Bodo, Maithili, Sindhi, Nepali, and Sanskrit, among others. In its original incarnation, the code points U+0900..U+0954 were a direct copy of the characters A0-F4 from the 1988 ISCII standard. The Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Bengali Unicode block contains characters for the Bengali, Assamese, Bishnupriya Manipuri, Daphla, Garo, Hallam, Khasi, Mizo, Munda, Naga, Riang, and Santali languages. In its original incarnation, the code points U+0981..U+09CD were a direct copy of the Bengali characters A1-ED from the 1988 ISCII standard, as well as several Assamese ISCII characters in the U+09F0 column. The Devanagari, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on ISCII encodings.

Gurmukhi is a Unicode block containing characters for the Punjabi language, in the Gurmukhi script. In its original incarnation, the code points U+0A02..U+0A4C were a direct copy of the Gurmukhi characters A2-EC from the 1988 ISCII standard. The Devanagari, Bengali, Gujarati, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Gujarati is a Unicode block containing characters for writing the Gujarati language. In its original incarnation, the code points U+0A81..U+0AD0 were a direct copy of the Gujarati characters A1-F0 from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Oriya, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Oriya is a Unicode block containing characters for the Odia, Khondi and Santali languages of the state of Odisha in India. In its original incarnation, the code points U+0B01..U+0B4D were a direct copy of the Odia characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Tamil, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Tamil is a Unicode block containing characters for the Tamil, and Saurashtra languages of Tamil Nadu India, Sri Lanka, Singapore, and Malaysia. In its original incarnation, the code points U+0B82..U+0BCD were a direct copy of the Tamil characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Telugu, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Telugu is a Unicode block containing characters for the Telugu, Gondi, and Lambadi languages of Indian states of Andhra Pradesh and Telangana. In its original incarnation, the code points U+0C01..U+0C4D were a direct copy of the Telugu characters A1-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Kannada, and Malayalam blocks were similarly all based on their ISCII encodings.

Kannada is a Unicode block containing characters for the Kannada, Sanskrit, Konkani, Sankethi, Havyaka, Tulu and Kodava languages. In its original incarnation, the code points U+0C82..U+0CCD were a direct copy of the Kannada characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, and Malayalam blocks were similarly all based on their ISCII encodings.

Malayalam is a Unicode block containing characters of the Malayalam script. In its original incarnation, the code points U+0D02..U+0D4D were a direct copy of the Malayalam characters A2-ED from the 1988 ISCII standard. The Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, and Kannada blocks were similarly all based on their ISCII encodings.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.

Vedic Extensions is a Unicode block containing characters for representing tones and other vedic symbols in Devanagari and other Indic scripts. Related symbols are defined in two other blocks: Devanagari (U+0900–U+097F) and Devanagari Extended (U+A8E0–U+A8FF).

Combining Diacritical Marks Extended is a Unicode block containing diacritical marks used in German dialectology (Teuthonista).

Devanagari Extended-A is a Unicode block containing characters for auspicious signs from Indian inscriptions and manuscripts from the 11th century onward.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-3] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.

[1]

[2]

[1]

[lower-alpha 1]

Devanagari Extended

Contents

Block

History

See also

Related Research Articles

References

Devanagari Extended
Range	U+A8E0..U+A8FF (32 code points)
Plane	BMP
Scripts	Devanagari
Symbol sets	Samaveda cantillation marks nasalization marks
Assigned	32 code points
Unused	0 reserved code points
Unicode version history

5.2 (2009)	28 (+28)
8.0 (2015)	30 (+2)
11.0 (2018)	32 (+2)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]