Arabic Extended-A

Arabic Extended-A
Arabic Extended-A
Range	U+08A0..U+08FF; (96 code points)
Plane	BMP
Scripts	Arabic (95 char.); Common (1 char.)
Major alphabets	African languages; Bashkir; Belarusian; Berber; Chechen; Crimean Tatar; Hindko; Lak; Philippine languages; Punjabi; Rohingya; Tatar; Arwi
Assigned	96 code points
Unused	0 reserved code points
Unicode version history
6.1 (2012)	39 (+39)
7.0 (2014)	47 (+8)
8.0 (2015)	50 (+3)
9.0 (2016)	73 (+23)
11.0 (2018)	74 (+1)
13.0 (2020)	84 (+10)
14.0 (2021)	96 (+12)
Unicode documentation
	Code chart ∣ Web page
	Note:

Last updated September 13, 2023

Arabic Extended-A is a Unicode block encoding Qur'anic annotations and letter variants used for various non-Arabic languages.^[3]

Block

Arabic Extended-A ^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+08Ax	ࢠ	ࢡ	ࢢ	ࢣ	ࢤ	ࢥ	ࢦ	ࢧ	ࢨ	ࢩ	ࢪ	ࢫ	ࢬ	ࢭ	ࢮ	ࢯ
U+08Bx	ࢰ	ࢱ	ࢲ	ࢳ	ࢴ	ࢵ	ࢶ	ࢷ	ࢸ	ࢹ	ࢺ	ࢻ	ࢼ	ࢽ	ࢾ	ࢿ
U+08Cx	ࣀ	ࣁ	ࣂ	ࣃ	ࣄ	ࣅ	ࣆ	ࣇ	ࣈ	ࣉ	࣊	࣋	࣌	࣍	࣎	࣏
U+08Dx	࣐	࣑	࣒	࣓	ࣔ	ࣕ	ࣖ	ࣗ	ࣘ	ࣙ	ࣚ	ࣛ	ࣜ	ࣝ	ࣞ	ࣟ
U+08Ex	࣠	࣡	࣢	ࣣ	ࣤ	ࣥ	ࣦ	ࣧ	ࣨ	ࣩ	࣪	࣫	࣬	࣭	࣮	࣯
U+08Fx	ࣰ	ࣱ	ࣲ	ࣳ	ࣴ	ࣵ	ࣶ	ࣷ	ࣸ	ࣹ	ࣺ	ࣻ	ࣼ	ࣽ	ࣾ	ࣿ
Notes 1. ^ As of Unicode version 15.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic Extended-A block:

Version	Final code points^{[lower-alpha 1]}	Count	L2 ID	WG2 ID	Document
6.1	U+08A0, 08A2..08AC, 08E4..08EF, 08F4..08FE	35	L2/10-288R	N3882	Priest, Lorna; Hosken, Martin (2010-08-12), Proposal to add Arabic script characters for African and Asian languages
			L2/10-221		Moore, Lisa (2010-08-23), "C.11.2", UTC #124 / L2 #221 Minutes
				N3903 (pdf, doc)	"M57.10", Unconfirmed minutes of WG2 meeting 57, 2011-03-31
	U+08F0..08F3	4	L2/01-325		Milo, Thomas (2001-08-14), Three kinds of variation in Arabic script (sukun and jazm)
			L2/02-275		Kew, Jonathan (2002-08-02), Proposal for Koranic alternate marks
			L2/09-335R		Moore, Lisa (2009-11-10), "C.31", UTC #121 / L2 #218 Minutes
			L2/10-094	N3816	King Fahd Glorious Quran Printing Complex (2010-03-21), Proposal to change some combining Arabic characters for Quranic representation
			L2/09-419R	N3791	Pournader, Roozbeh (2010-03-30), Proposal to encode four combining Arabic characters for Koranic use
			L2/10-155		Schmitt, Arno; Hosny, Khaled (2010-05-03), Reservations on L2/09-419 proposal
			L2/10-221		Moore, Lisa (2010-08-23), "C.11.1", UTC #124 / L2 #221 Minutes
				N3803 (pdf, doc)	"M56.08j", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24
			L2/10-371	N3920	Lazrek, Azzeddine (2010-09-30), Comments on Resolutions from Meeting 56 (and the 3rd edition) concerning Koranic characters
7.0	U+08A1	1	L2/10-288R	N3882	Priest, Lorna; Hosken, Martin (2010-08-12), Proposal to add Arabic script characters for African and Asian languages
			L2/10-442R	N3988	Priest, Lorna; Hosken, Martin (2010-10-29), Proposal to add ARABIC LETTER BEH WITH HAMZA ABOVE
				N3903 (pdf, doc)	"M57.10", Unconfirmed minutes of WG2 meeting 57, 2011-03-31
				N4103	"11.2.6 Arabic Letter BEH WITH HAMZA ABOVE", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
	U+08AD..08B1	5	L2/11-138		Yevlampiev, Ilya; Pentzlin, Karl; Joomagueldinov, Nurlan (2011-04-28), Proposal to encode Arabic characters used for Bashkir, Belarusian, Crimean Tatar, Karachay, Karakalpak, and Tatar languages
			L2/11-190		Pournader, Roozbeh (2011-05-12), Proposed codepoints and properties for characters in L2/11-138
			L2/11-201	N4065	Pournader, Roozbeh; Anderson, Deborah (2011-05-12), Proposed codepoints and properties for characters in "Proposal to encode Arabic characters used for Bashkir, Belarusian, Crimean Tatar, Karachay, Karakalpak, and Tatar languages"
			L2/11-116		Moore, Lisa (2011-05-17), "C.17.1.1", UTC #127 / L2 #224 Minutes
			L2/11-209	N4071, N4072	Yevlampiev, Ilya; Pentzlin, Karl; Joomagueldinov, Nurlan (2011-05-20), Revised Proposal to encode Arabic characters used for Bashkir, Belarusian, Crimean Tatar, and Tatar languages
				N4103	"11.12 Arabic characters used for Bashkir, Belarusian, Crimean Tatar, and Tatar languages", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
	U+08B2	1	L2/12-181R	N4271	Priest, Lorna A. (2012-05-09), Proposal to add ARABIC LETTER ZAIN WITH INVERTED V ABOVE
	U+08B2	1	L2/12-112		Moore, Lisa (2012-05-17), "C.19", UTC #131 / L2 #228 Minutes
	U+08FF	1	L2/11-033R	N3989	Priest, Lorna (2011-02-10), Proposal to add Arabic Mark Sideways Noon Ghunna
			L2/11-016		Moore, Lisa (2011-02-15), "C.2.2", UTC #126 / L2 #223 Minutes
				N4103	"11.2.5 Arabic Mark SIDEWAYS NOON GHUNNA", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
8.0	U+08B3..08B4, 08E3	3	L2/08-429		Buhari, Seyed (2008-12-04), Additional Arabic letters needed for Arwi Script
			L2/09-143		Buhari, Seyed (2009-04-15), Additional Arabic letters needed for Arwi Script
			L2/13-165		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh (2013-07-25), "2", Recommendations to UTC on Script Proposals
			L2/13-132		Moore, Lisa (2013-07-29), "Consensus 136-C22", UTC #136 Minutes
			L2/13-168		Ganesan, Naga (2013-07-29), Comment on L2/13-130: Writing classical Tamil in Arabic script
			L2/13-130R	N4474	Pournader, Roozbeh (2013-08-19), Proposal to encode three Arabic characters for Arwi
				N4553 (pdf, doc)	Umamaheswaran, V. S. (2014-09-16), "M62.04a", Minutes of WG 2 meeting 62 Adobe, San Jose, CA, USA
			L2/22-221		Nasrullah, Febri Muhammad (2022-09-11), On ARABIC LETTER KAF WITH DOT BELOW [Affects U+08B4]
			L2/22-248		Anderson, Deborah; et al. (2022-10-31), "3 Arabic", Recommendations to UTC #173 October 2022 on Script Proposals
			L2/22-241		Constable, Peter (2022-11-09), "D.1.5 Arabic: Kaf with dot below", Approved Minutes of UTC Meeting 173, Add an annotation to U+08B4
9.0	U+08B6..08BA	5	L2/13-178		Banafunzi, Hamid; Banafunzi, Marghani; Nuur, Maxamed (2013-08-31), Proposal to encode five Arabic script characters for the Bravanese (Chimiini)
			L2/13-210		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh (2013-10-31), "4", Recommendations to UTC #137 November 2013 on Script Proposals
			L2/13-223R	N4498	Pournader, Roozbeh (2013-11-06), Proposal to encode four Arabic characters for Bravanese
			L2/13-200		Moore, Lisa (2013-11-18), "C.3.1.1", UTC #137 Minutes
				N4553 (pdf, doc)	Umamaheswaran, V. S. (2014-09-16), "M62.09c, M62.09i", Minutes of WG 2 meeting 62 Adobe, San Jose, CA, USA
			L2/14-293R		Pournader, Roozbeh; Afshar, Shervin (2014-11-01), Proposal to Encode Arabic Letter Teh with Small Teh Above for Bravanese
			L2/14-250		Moore, Lisa (2014-11-10), "C.3.3", UTC #141 Minutes
	U+08BB..08BD	3	L2/14-104		Evans, Lorna (2014-04-29), Supporting the Warsh orthography for Arabic script
			L2/14-129		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh (2014-05-02), "8", Recommendations to UTC #139 May 2014 on Script Proposals
			L2/14-170		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Iancu, Laurențiu (2014-07-28), "18", Recommendations to UTC #140 August 2014 on Script Proposals
			L2/14-207R		Pournader, Roozbeh (2014-08-09), Implications of the Unicode Arabic model for the Warsh orthography
			L2/14-211	N4597	Evans, Lorna (2014-08-15), Proposal to encode Warsh-based Arabic script characters
			L2/14-177		Moore, Lisa (2014-10-17), "Warsh Orthography (C.3.2)", UTC #140 Minutes
			L2/16-052	N4603 (pdf, doc)	Umamaheswaran, V. S. (2015-09-01), "M63.03h", Unconfirmed minutes of WG 2 meeting 63
	U+08D4	1	L2/14-148		Shaikh, Lateef Sagar (2014-06-26), Proposal to encode Quranic mark Ar-Rbaa used in Quran published in Pakistan
			L2/14-170		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Iancu, Laurențiu (2014-07-28), "7", Recommendations to UTC #140 August 2014 on Script Proposals
				N4592	Shaikh, Lateef Sagar (2014-08-11), Proposal to encode Quranic mark Ar-Rub used in Quran published in Pakistan
			L2/14-177		Moore, Lisa (2014-10-17), "Quranic mark Ar-Rbaa (C.3.1.1)", UTC #140 Minutes
			L2/16-052	N4603 (pdf, doc)	Umamaheswaran, V. S. (2015-09-01), "M63.03g", Unconfirmed minutes of WG 2 meeting 63
	U+08D5..08E2	14	L2/14-095		Shaikh, Lateef Sagar (2014-04-24), Proposal to encode Quranic marks used in Quran published in Pakistan
			L2/14-129		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh (2014-05-02), "7", Recommendations to UTC #139 May 2014 on Script Proposals
			L2/14-100		Moore, Lisa (2014-05-13), "C.3.1.3", UTC #139 Minutes
			L2/14-105R	N4589	Pournader, Roozbeh (2014-07-27), Proposal to encode fourteen Pakistani Quranic marks
			L2/16-052	N4603 (pdf, doc)	Umamaheswaran, V. S. (2015-09-01), "M63.02b", Unconfirmed minutes of WG 2 meeting 63
			L2/15-254		Moore, Lisa (2015-11-16), "Consensus 145-C16", UTC #145 Minutes, Change the name of U+08E2 to ARABIC DISPUTED END OF AYAH
				N4739	"M64.03d", Unconfirmed minutes of WG 2 meeting 64, 2016-08-31
11.0	U+08D3	1	L2/16-044		Abudena, Mussa A. A. (2015-11-29), Proposal to encode Quranic marks used in Quran published in Libya with Commentary
			L2/15-329		Abudena, Mussa A. A. (2015-12-02), Proposal to encode Quranic marks used in Quran published in Libya
			L2/16-037		Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-01-22), "15. Arabic", Recommendations to UTC #146 January 2016 on Script Proposals
			L2/16-056		Shaikh, Lateef Sagar (2016-02-16), Proposal to encode Al-Dani Quranic marks used in Quran published in Libya
			L2/16-100		Abudena, Mussa A. A. (2016-04-27), Comments on L2/16-056 Proposal to encode AlDani Quranic Marks
			L2/16-102		Anderson, Deborah (2016-05-01), "#10", Consolidated Comments by Mansour, Evans, and Abudena on Al-Dani Quranic Marks (L2/16-056)
			L2/16-153		Abudena, Mussa A. A. (2016-05-04), Types of Quran scripts
			L2/16-156		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu (2016-05-06), "5. Arabic", Recommendations to UTC #147 May 2016 on Script Proposals
			L2/16-121		Moore, Lisa (2016-05-20), "C.10. Arabic", UTC #147 Minutes
			L2/16-268		Lazrek, Azzeddine (2016-09-26), Suggestions on some Al-Dani Quranic Marks proposition
13.0	U+08BE..08C2	5	L2/18-032	N4961	Shaikh, Lateef Sagar (2018-01-17), Proposal to include Hindko alphabets
			L2/18-168		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Chapman, Chris; Cook, Richard (2018-04-28), "21. Hindko", Recommendations to UTC #155 April-May 2018 on Script Proposals
			L2/18-174	N4962	Anderson, Deborah; Pournader, Roozbeh (2018-05-02), Hindko additions for Arabic
			L2/18-115		Moore, Lisa (2018-05-09), "C.3.1", UTC #155 Minutes
				N5020 (pdf, doc)	Umamaheswaran, V. S. (2019-01-11), "10.3.12", Unconfirmed minutes of WG 2 meeting 67
	U+08C3..08C4	2	L2/18-094	N4959	Evans, Lorna; Warren-Rothlin, Andy (2018-04-26), Proposal to encode additional Arabic script characters for Hausa
			L2/18-168		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Chapman, Chris; Cook, Richard (2018-04-28), "19. Arabic for Hausa", Recommendations to UTC #155 April-May 2018 on Script Proposals
			L2/18-115		Moore, Lisa (2018-05-09), "C.3.3", UTC #155 Minutes
				N5020 (pdf, doc)	Umamaheswaran, V. S. (2019-01-11), "10.3.11", Unconfirmed minutes of WG 2 meeting 67
	U+08C5..08C6	2	L2/19-118R	N5049	Patel, Neil; Riley, Charles; MacLean, Jesus (2019-03-25), Proposal to add Arabic letter JEEM WITH THREE DOTS ABOVE and JEEM WITH THREE DOTS BELOW
			L2/19-173		Anderson, Deborah; et al. (2019-04-29), "ARABIC LETTER JEEM WITH THREE DOTS", Recommendations to UTC #159 April-May 2019 on Script Proposals
			L2/19-122		Moore, Lisa (2019-05-08), "C.9.4", UTC #159 Minutes
				N5122	"M68.10", Unconfirmed minutes of WG 2 meeting 68, 2019-12-31
	U+08C7	1	L2/19-111R		Evans, Lorna; Malik, M. G. Abbas (2019-05-01), Proposal to encode ARABIC LETTER LAM WITH SMALL ARABIC LETTER TAH ABOVE in the UCS
			L2/19-173		Anderson, Deborah; et al. (2019-04-29), "ARABIC LETTER LAM WITH SMALL ARABIC LETTER TAH ABOVE", Recommendations to UTC #159 April-May 2019 on Script Proposals
			L2/19-122		Moore, Lisa (2019-05-08), "C.9.3", UTC #159 Minutes
14.0	U+08B5	1	L2/19-313		van Putten, Marijn; Pournader, Roozbeh (2019-09-29), Proposal to encode an Arabic qaf with dot below
			L2/19-343		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-10-06), "b. Arabic qaf with dot below", Recommendations to UTC #161 October 2019 on Script Proposals
			L2/19-323		Moore, Lisa (2019-10-01), "C.6.2", UTC #161 Minutes
	U+08C8	1	L2/19-077		Shaikh, Lateef Sagar (2019-03-03), Proposal to include Balti alphabet
			L2/19-173		Anderson, Deborah; et al. (2019-04-29), "c. Balti", Recommendations to UTC #159 April-May 2019 on Script Proposals
			L2/19-252		Shaikh, Lateef Sagar (2019-05-28), Proposal to include Balti alphabet ARABIC LETTER GRAF
			L2/19-286		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-07-22), "6. Arabic", Recommendations to UTC #160 July 2019 on Script Proposals
			L2/19-270		Moore, Lisa (2019-10-07), "C.9.1", UTC #160 Minutes
			L2/21-069R		Scherer, Markus; Davis, Mark; Freytag, Asmus; Chapman, Christopher; Whistler, Ken; Constable, Peter (2021-04-26), "PRI428i: U+08C8 ArabicShaping name", UTC #167 properties feedback & recommendations
			L2/21-066		Moore, Lisa (2021-05-05), "Action Item 167-A71", UTC #167 Minutes
	U+08C9..08D2	10	L2/19-306	N5142	Pournader, Roozbeh; Anderson, Deborah (2019-09-29), Arabic additions for Quranic orthographies
			L2/19-343		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2019-10-06), "a. Additions for Quranic orthographies", Recommendations to UTC #161 October 2019 on Script Proposals
			L2/19-323		Moore, Lisa (2019-10-01), "Consensus 161-C4", UTC #161 Minutes
			L2/20-105		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-04-20), "3f. Comments on L2/19-306", Recommendations to UTC #163 April 2020 on Script Proposals
			L2/20-102		Moore, Lisa (2020-05-06), "Consensus 163-C15", UTC #163 Minutes, The UTC changes the name for U+08D2 from ARABIC ROUND DOT INSIDE LARGE CIRCLE BELOW to ARABIC LARGE ROUND DOT INSIDE CIRCLE BELOW
↑ Proposed code points and characters names may differ from final code points and names

Related Research Articles

Unicode, formally The Unicode Standard, is a text encoding standard maintained by the Unicode Consortium designed to support the use of text written in all of the world's major writing systems. Version 15.1 of the standard defines 149813 characters and 161 scripts used in various ordinary, literary, academic, and technical contexts. Many common characters, including numerals, punctuation, and other symbols, are unified within the standard and are not treated as specific to any given writing system. Unicode encodes thousands of emoji, with the continued development thereof conducted by the Consortium as a part of the standard. Moreover, the widespread adoption of Unicode was in large part responsible for the initial popularization of emoji outside of Japan. Unicode is ultimately capable of encoding more than 1.1 million characters.

Arial Unicode MS is a TrueType font and the extended version of the font Arial. Compared to Arial, it includes higher line height, omits kerning pairs and adds enough glyphs to cover a large subset of Unicode 2.1—thus supporting most Microsoft code pages, but also requiring much more storage space. It also adds Ideographic layout tables, but unlike Arial, it mandates no smoothing in the 14–18 point range, and contains Roman (upright) glyphs only; there is no oblique (italic) version. Arial Unicode MS was previously distributed with Microsoft Office, but this ended in 2016 version. It is bundled with Mac OS X v10.5 and later. It may also be purchased separately from Ascender Corporation, who licenses the font from Microsoft.

<span class="mw-page-title-main">Gentium</span> Typeface

Gentium is a Unicode serif typeface designed by Victor Gaultney. Gentium fonts are free and open source software, and are released under the SIL Open Font License (OFL), which permits modification and redistribution. Gentium has wide support for languages using the Latin, Greek, and Cyrillic alphabets, and the International Phonetic Alphabet (IPA). Gentium Plus variants released since November 2010 now include over 5,500 glyphs and advanced typographic features through OpenType and formerly Graphite.

There are Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters, or at least a broad selection of Unicode scripts. There are also numerous projects aimed at providing only a certain script, such as the Arabeyes Arabic font. The advantage of targeting only some scripts with a font was that certain Unicode characters should be rendered differently depending on which language they are used in, and that a font that only includes the characters a certain user needs will be much smaller in file size compared to one with many glyphs. Unicode fonts in modern formats such as OpenType can in theory cover multiple languages by including multiple glyphs per character, though very few actually cover more than one language's forms of the unified Han characters.

Doulos SIL is a serif typeface developed by SIL International, very similar to Times or Times New Roman. Unlike Times New Roman, Doulos only has a single face, Regular. The goal of its design according to the SIL International website is to "provide a single Unicode-based font family that would contain a comprehensive inventory of glyphs needed for almost any Roman- or Cyrillic-based writing system, whether used for phonetic or orthographic needs." Along with Charis SIL and Gentium, it is licensed under the SIL Open Font License (OFL). This font has a cousin specially designed for numbered musical notation named Doulos SIL Cipher.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

A fallback font is a reserve typeface containing symbols for as many Unicode characters as possible. When a display system encounters a character that is not part of the repertoire of any of the other available fonts, a symbol from a fallback font is used instead. Typically, a fallback font will contain symbols representative of the various types of Unicode characters.

Charis SIL is a transitional serif typeface developed by SIL International based on Bitstream Charter, one of the first fonts designed for laser printers. The font offers four family members: roman, bold, italic, and bold italic.

The SIL Open Font License is one of the major open font licenses, which allows embedding, or "bundling", of the font in commercially sold products.

<span class="mw-page-title-main">GNU FreeFont</span> Font family

GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.

<span class="mw-page-title-main">Microsoft Sans Serif</span> Neo-grotesque sans-serif typeface

Microsoft Sans Serif is a sans-serif typeface introduced with early Microsoft Windows versions. It is the successor of MS Sans Serif, formerly Helv, a proportional bitmap font introduced in Windows 1.0. Both typefaces are very similar in design to Arial and Helvetica. The typeface was designed to match the MS Sans bitmap included in the early releases of Microsoft Windows.

In Unicode, the Sumero-Akkadian Cuneiform script is covered in three blocks in the Supplementary Multilingual Plane (SMP):

Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms. In English, the common ampersand (&) developed from a ligature in which the handwritten Latin letters e and t were combined. The rules governing ligature formation in Arabic can be quite complex, requiring special script-shaping technologies such as the Arabic Calligraphic Engine by Thomas Milo's DecoType.

GNU Unifont is a free Unicode bitmap font created by Roman Czyborra. The main Unifont covers all of the Basic Multilingual Plane (BMP). The "upper" companion covers significant parts of the Supplementary Multilingual Plane (SMP). The "Unifont JP" companion contains Japanese kanji present in the JIS X 0213 character set.

Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits.

Noto is a font family comprising over 100 individual fonts, which are together designed to cover all the scripts encoded in the Unicode standard. As of October 2016, Noto fonts cover all 93 scripts defined in Unicode version 6.1, although fewer than 30,000 of the nearly 75,000 CJK unified ideographs in version 6.0 are covered. In total, Noto fonts cover nearly 64,000 characters, which is under half of the 149,186 characters defined in Unicode 15.0.

Scheherazade New, formerly Scheherazade, is a traditional Naskh styled font for Arabic script created by SIL, freely available under the Open Font License. It supports a wide range of Arabic-based writing system encoded in Unicode. The font offers two family members: regular and bold.

IBM Plex is an open source typeface superfamily conceptually designed and developed by Mike Abbink at IBM in collaboration with Bold Monday to reflect the design principles of IBM and to be used for all brand material across the company internationally. Plex replaces Helvetica as the IBM corporate typeface after more than fifty years, freeing the company from extensive license payments in the process.

Andika is a sans-serif typeface developed by SIL International for the Latin, Greek and Cyrillic scripts. It is designed for literacy programs and beginning readers, but also has support for IPA transcription and a large number of diacritics. The font offers four family members: roman, bold, italic and bold italic.

References

↑ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
↑ The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8

External links

Scheherazade, an extended Arabic script font designed by SIL International, distributed under the SIL Open Font License (OFL)
Harmattan, an extended Arabic script font designed by SIL International for West Africa, distributed under the SIL Open Font License (OFL)

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[final-4] Proposed code points and characters names may differ from final code points and names

[1] "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.

[2] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.

[3] The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8

[1]

[2]

[3]

[1]

[lower-alpha 1]

Arabic Extended-A
Range	U+08A0..U+08FF (96 code points)
Plane	BMP
Scripts	Arabic (95 char.) Common (1 char.)
Major alphabets	African languages Bashkir Belarusian Berber Chechen Crimean Tatar Hindko Lak Philippine languages Punjabi Rohingya Tatar Arwi
Assigned	96 code points
Unused	0 reserved code points
Unicode version history

6.1 (2012)	39 (+39)
7.0 (2014)	47 (+8)
8.0 (2015)	50 (+3)
9.0 (2016)	73 (+23)
11.0 (2018)	74 (+1)
13.0 (2020)	84 (+10)
14.0 (2021)	96 (+12)

Unicode documentation
Code chart ∣ Web page
Note: ^[1]^[2]