Arabic (Unicode block)

Last updated
Arabic
RangeU+0600..U+06FF
(256 code points)
Plane BMP
Scripts Arabic (238 char.)
Common (6 char.)
Inherited (12 char.)
Major alphabets Arabic
Kurdish
Pashto
Persian
Urdu
Sindhi
Assigned256 code points
Unused0 reserved code points
1 deprecated
Source standards ISO 8859-6
Unicode version history
1.0.0 (1991)169 (+169)
1.1 (1993)194 (+25)
3.0 (1999)206 (+12)
3.2 (2002)208 (+2)
4.0 (2003)227 (+19)
4.1 (2005)235 (+8)
5.1 (2008)250 (+15)
6.0 (2010)252 (+2)
6.1 (2012)253 (+1)
6.3 (2013)254 (+1)
7.0 (2014)255 (+1)
14.0 (2021)256 (+1)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]
Unicode block Arabic.jpg UCB Arabic.png
Unicode block Arabic.jpg

Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits. [3]

Contents

Block

Arabic [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+060x ؀  ؁  ؂  ؃  ؄  ؅ ؆؇؈؉؊؋،؍؎؏
U+061xؘؙؚؐؑؒؓؔؕؖؗ؛ ALM ؝؞؟
U+062xؠءآأؤإئابةتثجحخد
U+063xذرزسشصضطظعغػؼؽؾؿ
U+064xـفقكلمنهوىيًٌٍَُ
U+065xِّْٕٖٜٟٓٔٗ٘ٙٚٛٝٞ
U+066x٠١٢٣٤٥٦٧٨٩٪٫٬٭ٮٯ
U+067xٰٱٲٳٴٵٶٷٸٹٺٻټٽپٿ
U+068xڀځڂڃڄڅچڇڈډڊڋڌڍڎڏ
U+069xڐڑڒړڔڕږڗژڙښڛڜڝڞڟ
U+06Axڠڡڢڣڤڥڦڧڨکڪګڬڭڮگ
U+06Bxڰڱڲڳڴڵڶڷڸڹںڻڼڽھڿ
U+06Cxۀہۂۃۄۅۆۇۈۉۊۋیۍێۏ
U+06Dxېۑےۓ۔ەۖۗۘۙۚۛۜ ۝ ۞۟
U+06Exۣ۠ۡۢۤۥۦۧۨ۩۪ۭ۫۬ۮۯ
U+06Fx۰۱۲۳۴۵۶۷۸۹ۺۻۼ۽۾ۿ
Notes
1. ^ As of Unicode version 15.1
2. ^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
1.0.0U+060C, 061B, 061F, 0621..063A, 0640..0652, 0660..066C, 0670..06B7, 06BA..06BE, 06C0..06CE, 06D0..06D5, 06F0..06F9169(to be determined)
L2/00-115R2 Moore, Lisa (2000-08-08), "Arabic Thousands Separator", Minutes Of UTC Meeting #83
L2/01-184R Moore, Lisa (2001-06-18), "Arabic Cursive Joining", Minutes from the UTC/L2 meeting
L2/01-270 Hosken, Martin (2001-06-19), How U+06D5 works in Uighur, Some technical information collected
L2/01-295R Moore, Lisa (2001-11-06), "Properties - Joining Behavior of U+06D5", Minutes from the UTC/L2 meeting #88
L2/04-290 Karlsson, Kent (2004-07-16), Updating the Arabic Shaping normative data
L2/04-419 Davis, Mark (2004-11-18), ArabicShaping suggestion e-mail
L2/09-146 Pournader, Roozbeh (2009-04-15), Moving dots and Arabic script shaping: Farsi Yeh's and Jawi Nya
L2/09-104 Moore, Lisa (2009-05-20), "Motion 119-M2", UTC #119 / L2 #216 Minutes, Deprecate U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW.
L2/09-335R Moore, Lisa (2009-11-10), "Deprecation of U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW (B.10.1)", UTC #121 / L2 #218 Minutes
L2/10-045 Allawi, Adil (2010-01-27), Proposal for changes to ArabicShaping.txt to allow machine generation of Arabic fonts and glyphs
L2/10-168 Mansour, Kamal (2010-05-04), Problems with the joining behavior of Arabic Letter Yeh Barree (U+06D2)
L2/10-108 Moore, Lisa (2010-05-19), "Action item 123-A49", UTC #123 / L2 #220 Minutes, Update section 8.2 to explain how to deal with the need for representing "medial" of Yeh Barree in text.
L2/11-092 Pournader, Roozbeh (2011-03-08), Changes to schematic names of Arabic letters
L2/11-206 N4066 Proposing to Supplement with the Script and Character of Chaghatay Language, 2011-04-25
N4067 Proposal to Encode Special Scripts and Characters in UCS for Uighur language, 2011-05-15
L2/11-245 N4113 Aalto, Tero (2011-06-08), Ad hoc report on Uighur
N4103 "11.11 Additional Characters for Uighur and Chaghatay Languages", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
L2/12-063 N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS, 2012-02-02
L2/12-101 N4231 Pournader, Roozbeh; Anderson, Deborah (2012-02-09), Comments on N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS
N4253 (pdf, doc)"10.3.3 NUSI-s for Uyghur Letters", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
L2/12-381 Pournader, Roozbeh (2012-11-03), Initial and medial forms of Arabic Letter Noon Ghunna
L2/12-343R2 Moore, Lisa (2012-12-04), "Consensus 133-C29", UTC #133 Minutes, Accept 9 named sequences...
L2/13-119 Pournader, Roozbeh (2013-05-08), Dot positioning of U+06A3 Arabic Letter Feh with Dot Below
L2/13-058 Moore, Lisa (2013-06-12), "B.13.4", UTC #135 Minutes
N4463 Silamu, Wushour; Anderson, Deborah; Constable, Peter (2013-06-28), User Guidelines for Uyghur, Kazakh, Kyrgyz, and Chagatai
L2/13-226 Milo, Thomas (2013-11-26), Arabic Amphibious Characters
N4403 (pdf, doc)Umamaheswaran, V. S. (2014-01-28), "10.4.6 Draft UTN on User Guidelines for Uyghur, Kazakh, Kyrgyz and Chagatai", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14
L2/14-109 Milo, Thomas (2014-05-01), Koranic and Classic orthography in Unicode and computer typography
L2/14-136 Pournader, Roozbeh (2014-05-08), The right hehs for Arabic script orthographies of Sorani Kurdish and Uighur
L2/14-100 Moore, Lisa (2014-05-13), "C.3.5", UTC #139 Minutes
L2/20-289 N5155 Evans, Lorna Priest (2020-12-07), Request for glyph changes and annotations for Kazakh, Kyrgyz, and Uyghur [Affects U+0626, U+0674-0678, U+06C5, and U+06C7]
L2/21-016R Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-01-14), "11a. Glyph changes and annotations for Kazakh, Kyrgyz, and Uyghur", Recommendations to UTC #166 January 2021 on Script Proposals
L2/21-009 Moore, Lisa (2021-01-27), "B.1 — 11a. Glyph changes and annotations for Kazakh, Kyrgyz, and Uyghur", UTC #166 Minutes
L2/21-050 N5160 Chinese comments on WG2 N5155, 2021-02-02
L2/21-098 N5162 Constable, Peter (2021-04-09), Response to China NB comments on WG2 N5155 (UTC document L2/21-050)
L2/21-073 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-04-23), "4b. China comments on WG2 N5155", Recommendations to UTC #167 April 2021 on Script Proposals
1.1U+066D, 06D6..06ED25(to be determined)
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks
L2/01-405R Moore, Lisa (2001-12-12), "Arabic Enclosing Marks", Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining
L2/05-150 Freytag, Asmus (2005-05-05), Arabic errata
L2/05-151 Milo, Thomas (2005-05-12), Annotations to the printing of the 1924 Azhar Qur'an
L2/05-203 McGowan, Rick (2005-08-04), Public Review Issue #73: Representative Glyphs for Arabic Characters U+06DF, U+06E0, and U+06E1
L2/05-231 Mansour, Kamal (2005-08-11), Regarding the proposed changes for the representative glyphs for 06DF, 06E0, and 06E1
N2956 Freytag, Asmus (2005-08-12), "Representative Glyphs for Arabic Characters U+06DF, U+06E0, and U+06E1", Unicode Consortium Liaison Report for WG2 Meeting #47
L2/05-180 Moore, Lisa (2005-08-17), "Consensus 104-C8", UTC #104 Minutes, Change the representative glyphs for three Arabic characters: U+06DF, U+06E0, U+06E1.
L2/05-108R Moore, Lisa (2005-08-26), "Arabic Glyph Errata (B.21)", UTC #103 Minutes
N2953 (pdf, doc)Umamaheswaran, V. S. (2006-02-16), "M47.16 (Miscellaneous glyph defects)", Unconfirmed minutes of WG 2 meeting 47, Sophia Antipolis, France; 2005-09-12/15
L2/06-324R2 Moore, Lisa (2006-11-29), "B.14.2", UTC #109 Minutes
L2/09-358R Pournader, Roozbeh (2009-10-28), Discussion document for polishing Koranic support in Unicode
L2/10-209 Pournader, Roozbeh (2010-06-07), Public Review Issue #171: Changing the properties of U+06DE from a combining mark to a spacing symbol
L2/10-221 Moore, Lisa (2010-08-23), "Consensus 124-C13", UTC #124 / L2 #221 Minutes, Change the general category of U+06DE to from "Me" to "So" and bidi class from "NSM" to "ON", and linebreak property from "CN" to "AL" and remove the dotted circle from the glyph, for Unicode 6.0.
3.0U+0653..06553L2/97-130McGowan, R. (1997-02-24), The Unicode draft proposal for Syriac character encoding
L2/98-051Nelson, Paul; Kiraz, George (1998-02-23), Supporting letters for Encoding Syriac
L2/98-052Nelson, Paul; Kiraz, George (1998-02-23), Examples of Syriac
L2/98-070 Aliprand, Joan; Winkler, Arnold, "3.A.2. item a. Syriac script", Minutes of the joint UTC and L2 meeting from the meeting in Cupertino, February 25-27, 1998
L2/98-069 Nelson, Paul; Kiraz, George (1998-02-27), Presentation to support the coding of Syriac
L2/98-050 N1718Nelson, Paul; Kiraz, George; Hasso, Sargon (1998-03-06), Proposal to Encode Syriac in ISO/IEC 10646
L2/98-156Kiraz, George, Syriac: Unicode character properties
L2/98-158 Aliprand, Joan; Winkler, Arnold (1998-05-26), "Character Properties for Syriac Script", Draft Minutes – UTC #76 & NCITS Subgroup L2 #173 joint meeting, Tredyffrin, Pennsylvania, April 20-22, 1998
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.23", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
N1837, N1837-Ireland Summary of Voting/Table of Replies - Amendment 27 - Syriac, 1998-08-27
L2/98-322 N1907 ISO/IEC 10646-1/FPDAM. 27, AMENDMENT 27: Syriac, 1998-10-22
N1906 Paterson, Bruce; Everson, Michael (1998-10-22), Disposition of Comments - FPDAM27 - Syriac
L2/99-010 N1903 (pdf, html, doc)Umamaheswaran, V. S. (1998-12-30), "6.7.10", Minutes of WG 2 meeting 35, London, U.K.; 1998-09-21--25
U+06B8..06B9, 06BF, 06CF, 06FA..06FE9N1573Additional Arabic characters (chiefly from ISO 11822), 1997-06-19
L2/97-288 N1603 Umamaheswaran, V. S. (1997-10-24), "8.24.6", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June – 4 July 1997
L2/98-004R N1681Text of ISO 10646 – AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22
L2/98-318 N1894 Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22
L2/20-288 Evans, Lorna Priest (2020-12-07), "U+06FE ARABIC SIGN SINDHI POSTPOSITION MEN", Request for annotations for Sindhi and Behdini Kurdish
L2/21-016R Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai (2021-01-14), "11b. Sindhi and Behdini Kurdish", Recommendations to UTC #166 January 2021 on Script Proposals
L2/21-009 Moore, Lisa (2021-01-27), "B.1 — 11b. Sindhi and Behdini Kurdish", UTC #166 Minutes
3.2U+066E..066F2 L2/00-354 Davis, Mark; Mansour, Kamal (2000-10-12), Proposal For Addition To Arabic repertoire
L2/00-324 Moore, Lisa (2001-01-29), "Motion 85-M6", Minutes from UTC #85, San Diego
L2/01-150 N2357 Proposal to encode two Arabic characters to the UCS, 2001-04-04
L2/01-344 N2353 (pdf, doc)Umamaheswaran, V. S. (2001-09-09), "7.15", Minutes from SC2/WG2 meeting #40 -- Mountain View, April 2001
4.0U+0600..0602, 060D..060E, 0610..0614, 0656..065813 L2/00-135 Nelson, Paul; Farhan, Ashhar; Hisam, Arif; Hisam, Kashif; Clews, John (2000-04-07), Proposal to Add Urdu Epethit and Abbreviation Diacritics to the Arabic Block
L2/01-303 Vikas, Om (2001-07-26), Letter from the Government from India on "Draft for Unicode Standard for Indian Scripts"
L2/01-304 Feedback on Unicode Standard 3.0, 2001-08-02
L2/01-305 McGowan, Rick (2001-08-08), Draft UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0"
L2/01-425 N2483 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks
L2/01-426 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks, Appendix: Examples of usage
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks
L2/01-439 Milo, Tom (2001-11-02), Arabic Year-sign examples
L2/01-430R McGowan, Rick (2001-11-20), UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0"
L2/02-061 N2482 Kew, Jonathan (2002-01-29), Bidi committee consensus on Arabic additions from L2/01-425
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21
L2/02-070 Moore, Lisa (2002-08-26), "Scripts and New Characters - Arabic", Minutes for UTC #90
L2/03-102 Vikas, Om (2003-03-04), Unicode Standard for Indic Scripts
L2/03-101.10 Proposed Changes in Indic Scripts [Urdu, Sindhi, and Kashmiri document], 2003-03-04
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining
L2/04-196 N2653 (pdf, doc)Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44
L2/06-332 Esfahbod, Behdad; Pournader, Roozbeh (2006-10-15), Proposal to change the Bidi category of five Arabic characters from AL to AN
L2/06-372 Lata, Swaran (2006-11-04), Issues Pertinent to Kashmiri
L2/06-324R2 Moore, Lisa (2006-11-29), "Action item 109-A6, Consensus 109-C15", UTC #109 Minutes
L2/15-183R Pournader, Roozbeh (2015-07-28), Candidate characters for Grapheme_Cluster_Break=Prepend
L2/15-187 Moore, Lisa (2015-08-11), "Consensus 144-C6", UTC #144 Minutes, Change the Grapheme_Cluster_Break property of the 12 characters listed in L2/15-183R to "Prepend" for Unicode 9.0.
U+0603, 060F, 06153 L2/02-005 Hussain, Sarmad; Afzal, Muhammad (2001-12-18), Urdu Computing Standards (Charts and Exhibits)
L2/02-006 (pdf, doc) N2413-1 Zia, Khaver (2002-01-10), Towards Unicode Standard for Urdu
L2/02-003 N2413-2 Afzal, Muhammad; Hussain, Sarmad (2001-12-28), Urdu Computing Standards: Development of Urdu Zabta Takhti (UZT) 1.01
L2/02-004 N2413-3 Hussain, Sarmad; Afzal, Muhammad (2001-12-28), Urdu Computing Standards: Urdu Zabta Takhti (UZT) 1.01
L2/02-163 N2413-4 (pdf, doc)Proposal to add Marks and Digits in Arabic Code Block (for Urdu), 2002-04-30
L2/02-011R Kew, Jonathan (2002-01-12), Comments on L2/02-006: Towards Unicode Standard for Urdu
L2/02-197 Freytag, Asmus (2002-05-01), Urdu Feedback from Bidi Committee
L2/02-166R2 Moore, Lisa (2002-08-09), "Motion 91-M3", UTC #91 Minutes
L2/02-372 N2453 (pdf, doc)Umamaheswaran, V. S. (2002-10-30), "7.9 Urdu contribution", Unconfirmed minutes of WG 2 meeting 42
L2/03-034 Nelson, Paul; Ross, Fiona; Holloway, Tim; Hudson, John (2003-02-10), Proposal to change character properties of ARABIC SIGN SAFHA (U+0603)
L2/04-196 N2653 (pdf, doc)Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44
U+06EE..06EF, 06FF3 L2/01-427 N2481 Kew, Jonathan (2001-11-01), Proposal to add Parkari letters to Arabic block
L2/01-405R Moore, Lisa (2001-12-12), "Motion 89-M3", Minutes from the UTC/L2 meeting in Mountain View, November 6-9, 2001
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21
4.1U+060B1 N2523 Everson, Michael (2002-11-20), Proposal to encode the AFGHANI SIGN in the UCS
L2/03-330 N2640 Everson, Michael (2003-10-01), Revised proposal to encode the AFGHANI SIGN in the UCS
U+061E, 065A..065C4L2/98-274Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP
L2/98-419 (pdf, doc)Aliprand, Joan (1999-02-05), "Additional Arabic characters", Approved Minutes -- UTC #78 & NCITS Subgroup L2 # 175 Joint Meeting, San Jose, CA -- December 1-4, 1998
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire
L2/03-154 Kew, Jonathan; Mansour, Kamal; Davis, Mark (2003-05-16), Proposal to encode productive Arabic-script modifier marks
L2/03-168 Kew, Jonathan (2003-06-02), Proposal to encode Arabic-script letters for African languages
L2/03-210 Kew, Jonathan (2003-06-12), Draft chart showing UTC #95 additions to Arabic blocks
L2/03-223 N2598 Kew, Jonathan (2003-07-10), Proposal to encode additional Arabic-script characters
U+06591 L2/03-133R N2581R2 Everson, Michael; Pournader, Roozbeh (2003-05-29), Proposal to encode the ARABIC ZWARAKAY in the UCS
U+065D..065E2 L2/04-025R N2723 Kew, Jonathan (2004-03-15), Proposal to encode Additional Arabic script characters
5.1U+0606..060A5 L2/05-318 Lazrek, Azzeddine (2005-10-24), Proposals for Unicode Consortium [Arabic mathematical symbols]
L2/05-320 Lazrek, Azzeddine (2005-07-10), Arabic Mathematical Diverse Symbols, Additional characters proposed to Unicode
L2/06-125 N3086-1, N3086 Lazrek, Azzeddine (2006-03-30), Diverse Arabic Mathematical Symbols
L2/06-108 Moore, Lisa (2006-05-25), "C.16", UTC #107 Minutes
N3103 (pdf, doc)Umamaheswaran, V. S. (2006-08-25), "8.14", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27
N3153 (pdf, doc)Umamaheswaran, V. S. (2007-02-16), "M49.7", Unconfirmed minutes of WG 2 meeting 49 AIST, Akihabara, Tokyo, Japan; 2006-09-25/29
U+0616, 063B..063F6 L2/06-345R N3180R Everson, Michael; Pournader, Roozbeh; Sarbar, Elnaz (2006-10-24), Proposal to encode eight Arabic characters for Persian and Azerbaijani in the UCS
L2/06-324R2 Moore, Lisa (2006-11-29), "C.12", UTC #109 Minutes
L2/07-221 Hallissy, Bob (2007-07-19), Shaping behavior of Arabic characters based on Farsi Yeh [2007.07.19]
L2/07-268 N3253 (pdf, doc)Umamaheswaran, V. S. (2007-07-26), "M50.15", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
L2/22-104 Pournader, Roozbeh (2022-05-19), Fixing the name and glyph for U+0616
L2/22-128 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-07-20), "4a ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH", Recommendations to UTC #172 July 2022 on Script Proposals
L2/22-121 Constable, Peter (2022-08-01), "Consensus 172-C2", Draft Minutes of UTC Meeting 172, Create a formal name alias type correction for U+0616 ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH, with the value ARABIC SMALL HIGH LIGATURE ALEF WITH YEH BARREE.
U+0617..061A4 L2/06-358R N3185R Everson, Michael; Pournader, Roozbeh (2006-11-01), Proposal to encode four Qur'anic Arabic characters in the UCS
L2/06-324R2 Moore, Lisa (2006-11-29), "Consensus 109-C29", UTC #109 Minutes
L2/07-268 N3253 (pdf, doc)Umamaheswaran, V. S. (2007-07-26), "M50.14", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
6.0U+0620, 065F2L2/98-274Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP
L2/98-419 (pdf, doc)Aliprand, Joan (1999-02-05), "Additional Arabic characters", Approved Minutes -- UTC #78 & NCITS Subgroup L2 # 175 Joint Meeting, San Jose, CA -- December 1-4, 1998
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire
L2/09-406 N3686-I Proposal to add one character in the Arabic block for representation of Kashmiri and annotation of existing characters, 2008-10-24
L2/09-176 Aazim, Muzaffar; Mansour, Kamal; Pournader, Roozbeh (2009-04-30), Proposal to add two Kashmiri characters and one annotation to the Arabic block
L2/09-215 N3673 Pournader, Roozbeh; Anderson, Deborah (2009-05-14), Proposal to add two Kashmiri characters
L2/09-104 Moore, Lisa (2009-05-20), "B.15.11.4", UTC #119 / L2 #216 Minutes
N3703 (pdf, doc)Umamaheswaran, V. S. (2010-04-13), "M55.8", Unconfirmed minutes of WG 2 meeting no. 55, Tokyo 2009-10-26/30
L2/10-169 Lata, Swaran (2010-05-06), Comments on the Proposed Arabic Letter Kashmiri Yeh
6.1U+06041 L2/09-335R Moore, Lisa (2009-11-10), "C.11", UTC #121 / L2 #218 Minutes
L2/09-144R3 N3734 Pandey, Anshuman (2009-11-20), Proposal to Encode the Samvat Date Sign for Arabic
N3803 (pdf, doc)"M56.08a", Unconfirmed minutes of WG 2 meeting no. 56, 2010-09-24
6.3U+061C1 L2/03-159 Kew, Jonathan (2003-05-28), Proposal to encode Arabic triple dot punctuation mark
L2/11-005 Allouche, Matitiahu; Mohie, Mohamed (2011-01-16), Proposal to encode an Arabic-Letter Mark (ALM)
L2/11-016 Moore, Lisa (2011-02-15), "Scripts and Symbols — Arabic letter mark", UTC #126 / L2 #223 Minutes
L2/11-278 Allouche, Matitiahu; Mohie, Mohamed (2011-07-17), Proposal to encode an Arabic-Letter Mark (ALM)
L2/11-397 Edberg, Peter (2011-10-25), Proposed addition of AL MARK and LEVEL DIRECTION MARK (PRI #205 background)
L2/11-398 Edberg, Peter (2011-10-25), Accumulated Feedback on PRI #205 (moderated)
L2/11-330 N4181 Anderson, Deborah (2011-11-04), Proposed Additions to ISO/IEC 10646
L2/11-353 Moore, Lisa (2011-11-30), "B.11.18", UTC #129 / L2 #226 Minutes
L2/11-432R N4180 (pdf, doc)Allouche, Matitiahu; Mohie, Mohamed (2012-02-15), Proposal to encode the Arabic Letter Mark (ALM)
N4253 (pdf, doc)"M59.16c", Unconfirmed minutes of WG 2 meeting 59, 2012-09-12
L2/13-040 Pournader, Roozbeh; Lanin, Aharon (2013-01-29), Fasttracking Arabic Letter Mark (ALM)
L2/13-011 Moore, Lisa (2013-02-04), "Consensus 134-C14", UTC #134 Minutes
L2/13-240 Davis, Mark (2013-12-12), Reconciling Script and Script_Extensions
L2/16-306 Constable, Peter (2016-10-28), Script property of Arabic Letter Mark and interaction with digit substitution mechanisms
L2/17-016 Moore, Lisa (2017-02-08), "Consensus 150-C24", UTC #150 Minutes, Change the Script property of U+061C from Common to Arabic, and change Script_Extensions from Default to Arabic, Syriac, and Thaana, for Unicode 10.0.
7.0U+06051 L2/09-163R Pandey, Anshuman (2009-09-15), Proposal to Encode Coptic Numerals in ISO/IEC 10646
L2/10-114 N3786 Pandey, Anshuman (2010-04-10), Towards an Encoding for Coptic Numbers in the UCS
L2/10-206R N3843R Pandey, Anshuman (2010-06-21), Final Proposal to Encode Coptic Numbers
L2/10-421R N3958R Pandey, Anshuman (2010-11-01), Request to Rename 'Coptic Numbers' to 'Coptic Epact Numerals'
L2/11-062R N3990 Pandey, Anshuman (2011-02-14), Final Proposal to Encode Coptic Epact Numbers
N3903 (pdf, doc)"M57.16", Unconfirmed minutes of WG2 meeting 57, 2011-03-31
N4103 "T.6. Arabic", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03
14.0U+061D1 L2/20-245 Hosny, Khaled; Pournader, Roozbeh (2020-09-09), Proposal to encode three Arabic symbols
L2/20-250 Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Constable, Peter; Liang, Hai (2020-10-01), "5a. Three Symbols", Recommendations to UTC #165 October 2020 on Script Proposals
L2/20-237 Moore, Lisa (2020-10-27), "Consensus 165-C15", UTC #165 Minutes
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

A Unicode block is one of several contiguous ranges of numeric character codes of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.

Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes, and other control characters. Its block name in Unicode 1.0 was Pictures for Control Codes.

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

CJK Unified Ideographs Extension-A is a Unicode block containing rare Han ideographs submitted to the Ideographic Research Group between 1992 and 1998, plus ten ideographs added in Unicode 13.0 which had previously been mistakenly unified with others.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically for internal use.

Arabic Extended-A is a Unicode block encoding Qur'anic annotations and letter variants used for various non-Arabic languages.

Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.

Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee block contains all the uppercase letters plus six lowercase letters. The Cherokee Supplement block, added in version 8.0, contains the rest of the lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2002 and 2006, plus five "urgently needed" characters added in Unicode versions 14.0 and 15.0, some of which had previously been mistakenly unified with other characters.

Dingbats is a Unicode block containing dingbats. Most of its characters were taken from Zapf Dingbats; it was the Unicode block to have imported characters from a specific typeface; Unicode later adopted a policy that excluded symbols with "no demonstrated need or strong desire to exchange in plain text", and thus no further dingbat typefaces were encoded until Webdings and Wingdings were encoded in Version 7.0. Some ornaments are also an emoji, having optional presentation variants.

Cherokee Supplement is a Unicode block containing the syllabic characters for writing the Cherokee language. When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicameral script. The Cherokee Supplement block contains lowercase letters only, whereas the Cherokee block contains all the uppercase letters, together with six lowercase letters. For backwards compatibility, the Unicode case folding algorithm—which usually converts a string to lowercase characters—maps Cherokee characters to uppercase.

Arabic Extended-B is a Unicode block encoding Qur'anic annotations and letter variants used for various non-Arabic languages. The block also includes currency symbols and an abbreviation mark.

Arabic Extended-C is a Unicode block encoding Qur'anic marks used in Turkey.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN   978-1-936213-01-6), Chapter 8