Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (including click symbols in Latin Extended-B) and the Vietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). [1] Latin Extended-F and -G contain characters for phonetic transcription.
As of version 16.0 of the Unicode Standard, 1,487 characters in the following 19 blocks are classified as belonging to the Latin script. [2]
In addition, a number of Latin-like characters are encoded in the Currency Symbols, Control Pictures, CJK Compatibility, Enclosed Alphanumerics, Enclosed CJK Letters and Months, Mathematical Alphanumeric Symbols, and Enclosed Alphanumeric Supplement blocks, but, although they are Latin letters graphically, they have the script property common , and, so, do not belong to the Latin script in Unicode terms. Lisu also consists almost entirely of Latin forms, but uses its own script property.
In this table those characters with the Unicode script property of Latin are highlighted in colour, indicating the version of Unicode they were introduced in. Reserved code points (which may be assigned as characters at a future date) have a grey background. All characters that do not belong to the Latin script have a white background (and the version of Unicode they were introduced in is therefore not indicated).
Legend: Unicode version | |
---|---|
Unicode 1.0 | Unicode 6.1 |
Unicode 1.1 | Unicode 7.0 |
Unicode 2.0 | Unicode 8.0 |
Unicode 3.0 | Unicode 9.0 |
Unicode 3.2 | Unicode 11.0 |
Unicode 4.0 | Unicode 12.0 |
Unicode 4.1 | Unicode 13.0 |
Unicode 5.0 | Unicode 14.0 |
Unicode 5.1 | Unicode 15.0 |
Unicode 5.2 | Unicode 16.0 |
Unicode 6.0 | |
Reserved | Not Latin script |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | Block | # |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0040 | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | C0 Controls and Basic Latin 0000–007F (identical to ASCII) | 52 |
0050 | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ | ||
0060 | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | ||
0070 | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL | ||
00A0 | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | ® | ¯ | C1 Controls and Latin-1 Supplement 0080–00FF (identical to ISO/IEC 8859-1) | 64 | ||
00B0 | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ | ||
00C0 | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï | ||
00D0 | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß | ||
00E0 | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï | ||
00F0 | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ | ||
0100 | Ā | ā | Ă | ă | Ą | ą | Ć | ć | Ĉ | ĉ | Ċ | ċ | Č | č | Ď | ď | Latin Extended-A 0100–017F | 128 |
0110 | Đ | đ | Ē | ē | Ĕ | ĕ | Ė | ė | Ę | ę | Ě | ě | Ĝ | ĝ | Ğ | ğ | ||
0120 | Ġ | ġ | Ģ | ģ | Ĥ | ĥ | Ħ | ħ | Ĩ | ĩ | Ī | ī | Ĭ | ĭ | Į | į | ||
0130 | İ | ı | IJ | ij | Ĵ | ĵ | Ķ | ķ | ĸ | Ĺ | ĺ | Ļ | ļ | Ľ | ľ | Ŀ | ||
0140 | ŀ | Ł | ł | Ń | ń | Ņ | ņ | Ň | ň | ʼn | Ŋ | ŋ | Ō | ō | Ŏ | ŏ | ||
0150 | Ő | ő | Œ | œ | Ŕ | ŕ | Ŗ | ŗ | Ř | ř | Ś | ś | Ŝ | ŝ | Ş | ş | ||
0160 | Š | š | Ţ | ţ | Ť | ť | Ŧ | ŧ | Ũ | ũ | Ū | ū | Ŭ | ŭ | Ů | ů | ||
0170 | Ű | ű | Ų | ų | Ŵ | ŵ | Ŷ | ŷ | Ÿ | Ź | ź | Ż | ż | Ž | ž | ſ | ||
0180 | ƀ | Ɓ | Ƃ | ƃ | Ƅ | ƅ | Ɔ | Ƈ | ƈ | Ɖ | Ɗ | Ƌ | ƌ | ƍ | Ǝ | Ə | Latin Extended-B 0180–024F | 208 |
0190 | Ɛ | Ƒ | ƒ | Ɠ | Ɣ | ƕ | Ɩ | Ɨ | Ƙ | ƙ | ƚ | ƛ | Ɯ | Ɲ | ƞ | Ɵ | ||
01A0 | Ơ | ơ | Ƣ | ƣ | Ƥ | ƥ | Ʀ | Ƨ | ƨ | Ʃ | ƪ | ƫ | Ƭ | ƭ | Ʈ | Ư | ||
01B0 | ư | Ʊ | Ʋ | Ƴ | ƴ | Ƶ | ƶ | Ʒ | Ƹ | ƹ | ƺ | ƻ | Ƽ | ƽ | ƾ | ƿ | ||
01C0 | ǀ | ǁ | ǂ | ǃ | DŽ | Dž | dž | LJ | Lj | lj | NJ | Nj | nj | Ǎ | ǎ | Ǐ | ||
01D0 | ǐ | Ǒ | ǒ | Ǔ | ǔ | Ǖ | ǖ | Ǘ | ǘ | Ǚ | ǚ | Ǜ | ǜ | ǝ | Ǟ | ǟ | ||
01E0 | Ǡ | ǡ | Ǣ | ǣ | Ǥ | ǥ | Ǧ | ǧ | Ǩ | ǩ | Ǫ | ǫ | Ǭ | ǭ | Ǯ | ǯ | ||
01F0 | ǰ | DZ | Dz | dz | Ǵ | ǵ | Ƕ | Ƿ | Ǹ | ǹ | Ǻ | ǻ | Ǽ | ǽ | Ǿ | ǿ | ||
0200 | Ȁ | ȁ | Ȃ | ȃ | Ȅ | ȅ | Ȇ | ȇ | Ȉ | ȉ | Ȋ | ȋ | Ȍ | ȍ | Ȏ | ȏ | ||
0210 | Ȑ | ȑ | Ȓ | ȓ | Ȕ | ȕ | Ȗ | ȗ | Ș | ș | Ț | ț | Ȝ | ȝ | Ȟ | ȟ | ||
0220 | Ƞ | ȡ | Ȣ | ȣ | Ȥ | ȥ | Ȧ | ȧ | Ȩ | ȩ | Ȫ | ȫ | Ȭ | ȭ | Ȯ | ȯ | ||
0230 | Ȱ | ȱ | Ȳ | ȳ | ȴ | ȵ | ȶ | ȷ | ȸ | ȹ | Ⱥ | Ȼ | ȼ | Ƚ | Ⱦ | ȿ | ||
0240 | ɀ | Ɂ | ɂ | Ƀ | Ʉ | Ʌ | Ɇ | ɇ | Ɉ | ɉ | Ɋ | ɋ | Ɍ | ɍ | Ɏ | ɏ | ||
0250 | ɐ | ɑ | ɒ | ɓ | ɔ | ɕ | ɖ | ɗ | ɘ | ə | ɚ | ɛ | ɜ | ɝ | ɞ | ɟ | IPA Extensions 0250–02AF | 96 |
0260 | ɠ | ɡ | ɢ | ɣ | ɤ | ɥ | ɦ | ɧ | ɨ | ɩ | ɪ | ɫ | ɬ | ɭ | ɮ | ɯ | ||
0270 | ɰ | ɱ | ɲ | ɳ | ɴ | ɵ | ɶ | ɷ | ɸ | ɹ | ɺ | ɻ | ɼ | ɽ | ɾ | ɿ | ||
0280 | ʀ | ʁ | ʂ | ʃ | ʄ | ʅ | ʆ | ʇ | ʈ | ʉ | ʊ | ʋ | ʌ | ʍ | ʎ | ʏ | ||
0290 | ʐ | ʑ | ʒ | ʓ | ʔ | ʕ | ʖ | ʗ | ʘ | ʙ | ʚ | ʛ | ʜ | ʝ | ʞ | ʟ | ||
02A0 | ʠ | ʡ | ʢ | ʣ | ʤ | ʥ | ʦ | ʧ | ʨ | ʩ | ʪ | ʫ | ʬ | ʭ | ʮ | ʯ | ||
02B0 | ʰ | ʱ | ʲ | ʳ | ʴ | ʵ | ʶ | ʷ | ʸ | ʹ | ʺ | ʻ | ʼ | ʽ | ʾ | ʿ | Spacing Modifier Letters 02B0–02FF | 14 |
02E0 | ˠ | ˡ | ˢ | ˣ | ˤ | ˥ | ˦ | ˧ | ˨ | ˩ | ˪ | ˫ | ˬ | ˭ | ˮ | ˯ | ||
1D00 | ᴀ | ᴁ | ᴂ | ᴃ | ᴄ | ᴅ | ᴆ | ᴇ | ᴈ | ᴉ | ᴊ | ᴋ | ᴌ | ᴍ | ᴎ | ᴏ | Phonetic Extensions 1D00–1D7F | 111 |
1D10 | ᴐ | ᴑ | ᴒ | ᴓ | ᴔ | ᴕ | ᴖ | ᴗ | ᴘ | ᴙ | ᴚ | ᴛ | ᴜ | ᴝ | ᴞ | ᴟ | ||
1D20 | ᴠ | ᴡ | ᴢ | ᴣ | ᴤ | ᴥ | ᴦ | ᴧ | ᴨ | ᴩ | ᴪ | ᴫ | ᴬ | ᴭ | ᴮ | ᴯ | ||
1D30 | ᴰ | ᴱ | ᴲ | ᴳ | ᴴ | ᴵ | ᴶ | ᴷ | ᴸ | ᴹ | ᴺ | ᴻ | ᴼ | ᴽ | ᴾ | ᴿ | ||
1D40 | ᵀ | ᵁ | ᵂ | ᵃ | ᵄ | ᵅ | ᵆ | ᵇ | ᵈ | ᵉ | ᵊ | ᵋ | ᵌ | ᵍ | ᵎ | ᵏ | ||
1D50 | ᵐ | ᵑ | ᵒ | ᵓ | ᵔ | ᵕ | ᵖ | ᵗ | ᵘ | ᵙ | ᵚ | ᵛ | ᵜ | ᵝ | ᵞ | ᵟ | ||
1D60 | ᵠ | ᵡ | ᵢ | ᵣ | ᵤ | ᵥ | ᵦ | ᵧ | ᵨ | ᵩ | ᵪ | ᵫ | ᵬ | ᵭ | ᵮ | ᵯ | ||
1D70 | ᵰ | ᵱ | ᵲ | ᵳ | ᵴ | ᵵ | ᵶ | ᵷ | ᵸ | ᵹ | ᵺ | ᵻ | ᵼ | ᵽ | ᵾ | ᵿ | ||
1D80 | ᶀ | ᶁ | ᶂ | ᶃ | ᶄ | ᶅ | ᶆ | ᶇ | ᶈ | ᶉ | ᶊ | ᶋ | ᶌ | ᶍ | ᶎ | ᶏ | Phonetic Extensions Supplement 1D80–1DBF | 63 |
1D90 | ᶐ | ᶑ | ᶒ | ᶓ | ᶔ | ᶕ | ᶖ | ᶗ | ᶘ | ᶙ | ᶚ | ᶛ | ᶜ | ᶝ | ᶞ | ᶟ | ||
1DA0 | ᶠ | ᶡ | ᶢ | ᶣ | ᶤ | ᶥ | ᶦ | ᶧ | ᶨ | ᶩ | ᶪ | ᶫ | ᶬ | ᶭ | ᶮ | ᶯ | ||
1DB0 | ᶰ | ᶱ | ᶲ | ᶳ | ᶴ | ᶵ | ᶶ | ᶷ | ᶸ | ᶹ | ᶺ | ᶻ | ᶼ | ᶽ | ᶾ | ᶿ | ||
1E00 | Ḁ | ḁ | Ḃ | ḃ | Ḅ | ḅ | Ḇ | ḇ | Ḉ | ḉ | Ḋ | ḋ | Ḍ | ḍ | Ḏ | ḏ | Latin Extended Additional 1E00–1EFF | 256 |
1E10 | Ḑ | ḑ | Ḓ | ḓ | Ḕ | ḕ | Ḗ | ḗ | Ḙ | ḙ | Ḛ | ḛ | Ḝ | ḝ | Ḟ | ḟ | ||
1E20 | Ḡ | ḡ | Ḣ | ḣ | Ḥ | ḥ | Ḧ | ḧ | Ḩ | ḩ | Ḫ | ḫ | Ḭ | ḭ | Ḯ | ḯ | ||
1E30 | Ḱ | ḱ | Ḳ | ḳ | Ḵ | ḵ | Ḷ | ḷ | Ḹ | ḹ | Ḻ | ḻ | Ḽ | ḽ | Ḿ | ḿ | ||
1E40 | Ṁ | ṁ | Ṃ | ṃ | Ṅ | ṅ | Ṇ | ṇ | Ṉ | ṉ | Ṋ | ṋ | Ṍ | ṍ | Ṏ | ṏ | ||
1E50 | Ṑ | ṑ | Ṓ | ṓ | Ṕ | ṕ | Ṗ | ṗ | Ṙ | ṙ | Ṛ | ṛ | Ṝ | ṝ | Ṟ | ṟ | ||
1E60 | Ṡ | ṡ | Ṣ | ṣ | Ṥ | ṥ | Ṧ | ṧ | Ṩ | ṩ | Ṫ | ṫ | Ṭ | ṭ | Ṯ | ṯ | ||
1E70 | Ṱ | ṱ | Ṳ | ṳ | Ṵ | ṵ | Ṷ | ṷ | Ṹ | ṹ | Ṻ | ṻ | Ṽ | ṽ | Ṿ | ṿ | ||
1E80 | Ẁ | ẁ | Ẃ | ẃ | Ẅ | ẅ | Ẇ | ẇ | Ẉ | ẉ | Ẋ | ẋ | Ẍ | ẍ | Ẏ | ẏ | ||
1E90 | Ẑ | ẑ | Ẓ | ẓ | Ẕ | ẕ | ẖ | ẗ | ẘ | ẙ | ẚ | ẛ | ẜ | ẝ | ẞ | ẟ | ||
1EA0 | Ạ | ạ | Ả | ả | Ấ | ấ | Ầ | ầ | Ẩ | ẩ | Ẫ | ẫ | Ậ | ậ | Ắ | ắ | ||
1EB0 | Ằ | ằ | Ẳ | ẳ | Ẵ | ẵ | Ặ | ặ | Ẹ | ẹ | Ẻ | ẻ | Ẽ | ẽ | Ế | ế | ||
1EC0 | Ề | ề | Ể | ể | Ễ | ễ | Ệ | ệ | Ỉ | ỉ | Ị | ị | Ọ | ọ | Ỏ | ỏ | ||
1ED0 | Ố | ố | Ồ | ồ | Ổ | ổ | Ỗ | ỗ | Ộ | ộ | Ớ | ớ | Ờ | ờ | Ở | ở | ||
1EE0 | Ỡ | ỡ | Ợ | ợ | Ụ | ụ | Ủ | ủ | Ứ | ứ | Ừ | ừ | Ử | ử | Ữ | ữ | ||
1EF0 | Ự | ự | Ỳ | ỳ | Ỵ | ỵ | Ỷ | ỷ | Ỹ | ỹ | Ỻ | ỻ | Ỽ | ỽ | Ỿ | ỿ | ||
2070 | ⁰ | ⁱ | ⁴ | ⁵ | ⁶ | ⁷ | ⁸ | ⁹ | ⁺ | ⁻ | ⁼ | ⁽ | ⁾ | ⁿ | Superscripts and Subscripts 2070–209F | 15 | ||
2090 | ₐ | ₑ | ₒ | ₓ | ₔ | ₕ | ₖ | ₗ | ₘ | ₙ | ₚ | ₛ | ₜ | |||||
2120 | ℠ | ℡ | ™ | ℣ | ℤ | ℥ | Ω | ℧ | ℨ | ℩ | K | Å | ℬ | ℭ | ℮ | ℯ | Letterlike symbols 2100–214F | 4 |
2130 | ℰ | ℱ | Ⅎ | ℳ | ℴ | ℵ | ℶ | ℷ | ℸ | ℹ | ℺ | ℻ | ℼ | ℽ | ℾ | ℿ | ||
2140 | ⅀ | ⅁ | ⅂ | ⅃ | ⅄ | ⅅ | ⅆ | ⅇ | ⅈ | ⅉ | ⅊ | ⅋ | ⅌ | ⅍ | ⅎ | ⅏ | ||
2160 | Ⅰ | Ⅱ | Ⅲ | Ⅳ | Ⅴ | Ⅵ | Ⅶ | Ⅷ | Ⅸ | Ⅹ | Ⅺ | Ⅻ | Ⅼ | Ⅽ | Ⅾ | Ⅿ | Number Forms 2150–218F | 41 |
2170 | ⅰ | ⅱ | ⅲ | ⅳ | ⅴ | ⅵ | ⅶ | ⅷ | ⅸ | ⅹ | ⅺ | ⅻ | ⅼ | ⅽ | ⅾ | ⅿ | ||
2180 | ↀ | ↁ | ↂ | Ↄ | ↄ | ↅ | ↆ | ↇ | ↈ | ↉ | ↊ | ↋ | ||||||
2C60 | Ⱡ | ⱡ | Ɫ | Ᵽ | Ɽ | ⱥ | ⱦ | Ⱨ | ⱨ | Ⱪ | ⱪ | Ⱬ | ⱬ | Ɑ | Ɱ | Ɐ | Latin Extended-C 2C60–2C7F | 32 |
2C70 | Ɒ | ⱱ | Ⱳ | ⱳ | ⱴ | Ⱶ | ⱶ | ⱷ | ⱸ | ⱹ | ⱺ | ⱻ | ⱼ | ⱽ | Ȿ | Ɀ | ||
A720 | ꜠ | ꜡ | Ꜣ | ꜣ | Ꜥ | ꜥ | Ꜧ | ꜧ | Ꜩ | ꜩ | Ꜫ | ꜫ | Ꜭ | ꜭ | Ꜯ | ꜯ | Latin Extended-D A720–A7FF | 194 |
A730 | ꜰ | ꜱ | Ꜳ | ꜳ | Ꜵ | ꜵ | Ꜷ | ꜷ | Ꜹ | ꜹ | Ꜻ | ꜻ | Ꜽ | ꜽ | Ꜿ | ꜿ | ||
A740 | Ꝁ | ꝁ | Ꝃ | ꝃ | Ꝅ | ꝅ | Ꝇ | ꝇ | Ꝉ | ꝉ | Ꝋ | ꝋ | Ꝍ | ꝍ | Ꝏ | ꝏ | ||
A750 | Ꝑ | ꝑ | Ꝓ | ꝓ | Ꝕ | ꝕ | Ꝗ | ꝗ | Ꝙ | ꝙ | Ꝛ | ꝛ | Ꝝ | ꝝ | Ꝟ | ꝟ | ||
A760 | Ꝡ | ꝡ | Ꝣ | ꝣ | Ꝥ | ꝥ | Ꝧ | ꝧ | Ꝩ | ꝩ | Ꝫ | ꝫ | Ꝭ | ꝭ | Ꝯ | ꝯ | ||
A770 | ꝰ | ꝱ | ꝲ | ꝳ | ꝴ | ꝵ | ꝶ | ꝷ | ꝸ | Ꝺ | ꝺ | Ꝼ | ꝼ | Ᵹ | Ꝿ | ꝿ | ||
A780 | Ꞁ | ꞁ | Ꞃ | ꞃ | Ꞅ | ꞅ | Ꞇ | ꞇ | ꞈ | ꞉ | ꞊ | Ꞌ | ꞌ | Ɥ | ꞎ | ꞏ | ||
A790 | Ꞑ | ꞑ | Ꞓ | ꞓ | ꞔ | ꞕ | Ꞗ | ꞗ | Ꞙ | ꞙ | Ꞛ | ꞛ | Ꞝ | ꞝ | Ꞟ | ꞟ | ||
A7A0 | Ꞡ | ꞡ | Ꞣ | ꞣ | Ꞥ | ꞥ | Ꞧ | ꞧ | Ꞩ | ꞩ | Ɦ | Ɜ | Ɡ | Ɬ | Ɪ | ꞯ | ||
A7B0 | Ʞ | Ʇ | Ʝ | Ꭓ | Ꞵ | ꞵ | Ꞷ | ꞷ | Ꞹ | ꞹ | Ꞻ | ꞻ | Ꞽ | ꞽ | Ꞿ | ꞿ | ||
A7C0 | Ꟁ | ꟁ | Ꟃ | ꟃ | Ꞔ | Ʂ | Ᶎ | Ꟈ | ꟈ | Ꟊ | ꟊ | | | | ||||
A7D0 | Ꟑ | ꟑ | ꟓ | ꟕ | Ꟗ | ꟗ | Ꟙ | ꟙ | | | | |||||||
A7E0 | ||||||||||||||||||
A7F0 | ꟲ | ꟳ | ꟴ | Ꟶ | ꟶ | ꟷ | ꟸ | ꟹ | ꟺ | ꟻ | ꟼ | ꟽ | ꟾ | ꟿ | ||||
AB30 | ꬰ | ꬱ | ꬲ | ꬳ | ꬴ | ꬵ | ꬶ | ꬷ | ꬸ | ꬹ | ꬺ | ꬻ | ꬼ | ꬽ | ꬾ | ꬿ | Latin Extended-E AB30–AB6F | 56 |
AB40 | ꭀ | ꭁ | ꭂ | ꭃ | ꭄ | ꭅ | ꭆ | ꭇ | ꭈ | ꭉ | ꭊ | ꭋ | ꭌ | ꭍ | ꭎ | ꭏ | ||
AB50 | ꭐ | ꭑ | ꭒ | ꭓ | ꭔ | ꭕ | ꭖ | ꭗ | ꭘ | ꭙ | ꭚ | ꭛ | ꭜ | ꭝ | ꭞ | ꭟ | ||
AB60 | ꭠ | ꭡ | ꭢ | ꭣ | ꭤ | ꭥ | ꭦ | ꭧ | ꭨ | ꭩ | ꭪ | ꭫ | ||||||
FB00 | ff | fi | fl | ffi | ffl | ſt | st | Alphabetic Presentation Forms | 7 | |||||||||
FF20 | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | Halfwidth and Fullwidth Forms (fullwidth Latin letters) FF00–FFEF | 52 |
FF30 | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ | ||
FF40 | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | ||
FF50 | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ⦅ | ||
10780 | 𐞀 | 𐞁 | 𐞂 | 𐞃 | 𐞄 | 𐞅 | 𐞇 | 𐞈 | 𐞉 | 𐞊 | 𐞋 | 𐞌 | 𐞍 | 𐞎 | 𐞏 | Latin Extended-F 10780–107BF | 57 | |
10790 | 𐞐 | 𐞑 | 𐞒 | 𐞓 | 𐞔 | 𐞕 | 𐞖 | 𐞗 | 𐞘 | 𐞙 | 𐞚 | 𐞛 | 𐞜 | 𐞝 | 𐞞 | 𐞟 | ||
107A0 | 𐞠 | 𐞡 | 𐞢 | 𐞣 | 𐞤 | 𐞥 | 𐞦 | 𐞧 | 𐞨 | 𐞩 | 𐞪 | 𐞫 | 𐞬 | 𐞭 | 𐞮 | 𐞯 | ||
107B0 | 𐞰 | 𐞲 | 𐞳 | 𐞴 | 𐞵 | 𐞶 | 𐞷 | 𐞸 | 𐞹 | 𐞺 | ||||||||
1DF00 | 𝼀 | 𝼁 | 𝼂 | 𝼃 | 𝼄 | 𝼅 | 𝼆 | 𝼇 | 𝼈 | 𝼉 | 𝼊 | 𝼋 | 𝼌 | 𝼍 | 𝼎 | 𝼏 | Latin Extended-G 1DF00–1DFFF | 37 |
1DF10 | 𝼐 | 𝼑 | 𝼒 | 𝼓 | 𝼔 | 𝼕 | 𝼖 | 𝼗 | 𝼘 | 𝼙 | 𝼚 | 𝼛 | 𝼜 | 𝼝 | 𝼞 | |||
1DF20 | 𝼥 | 𝼦 | 𝼧 | 𝼨 | 𝼩 | 𝼪 | ||||||||||||
Total characters | 1,487 |
Bitstream Cyberbit is a commercial serif Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was one of the first widely available fonts to support a large portion of the Unicode repertoire.
Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
New Gulim (새굴림/SaeGulRim) is a sans-serif type Unicode font designed especially for the Korean-language script, designed by HanYang System Co., Limited. It is an expanded version of Hanyang Gulrim.
Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Greek or Cyrillic. Apart from the International Phonetic Alphabet (IPA), extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.
In computing, a Unicode symbol is a Unicode character which is not part of a script used to write a natural language, but is nonetheless available for use as part of a text.
The Unicode Standard encodes almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their properties, and guidelines for implementation. Mathematical operators and symbols are in multiple Unicode blocks. Some of these blocks are dedicated to, or primarily contain, mathematical characters while others are a mix of mathematical and non-mathematical characters. This article covers all Unicode characters with a derived property of "Math".
GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.
In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:
A character that would not have been encoded except for compatibility and round-trip convertibility with other standards
Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms. In English, the common ampersand (&) developed from a ligature in which the handwritten Latin letters e and t were combined. The rules governing ligature formation in Arabic can be quite complex, requiring special script-shaping technologies such as the Arabic Calligraphic Engine by Thomas Milo's DecoType.
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+hhhhhh). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version 16.0, five of the planes have assigned code points (characters), and seven are named.
The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.
Unicode contains a number of characters that represent various cultural, political, and religious symbols. Most, but not all, of these symbols are in the Miscellaneous Symbols block.
Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.
A variant form is an alternate glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character.
CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF). The square forms can have different presentations when they are used in horizontal or vertical text. For example, the characters U+333E㌾SQUARE BORUTO and U+3327㌧SQUARE TON should look different in horizontal and in vertical right-to-left: ㌧㌾
The Vietnamese language is written with a Latin script with diacritics which requires several accommodations when typing on phone or computers. Software-based systems are a form of writing Vietnamese on phones or computers with software that can be installed on the device or from third-party software such as UniKey. Telex is the oldest input method devised to encode the Vietnamese language with its tones. Other input methods may also include VNI and VIQR. VNI input method is not to be confused with VNI code page.
A number of Greek letters, variants, digits, and other symbols are supported by the Unicode character encoding standard.