Alphabetic Presentation Forms | |
---|---|
Range | U+FB00..U+FB4F (80 code points) |
Plane | BMP |
Scripts | Armenian (5 char.) Hebrew (46 char.) Latin (7 char.) |
Assigned | 58 code points |
Unused | 22 reserved code points |
Unicode version history | |
1.0.1 (1992) | 1 (+1) |
1.1 (1993) | 57 (+56) |
3.0 (1999) | 58 (+1) |
Chart | |
Code chart | |
Note: Range was initially part of the Private Use Area in Unicode 1.0.0, [1] and removed from it in Unicode 1.0.1. One character was moved from the Hebrew block to the Alphabetic Presentation Forms block in version 1.0.1 during the process of unifying with ISO 10646. [2] [3] [4] Contents |
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.
Alphabetic Presentation Forms [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+FB0x | ff | fi | fl | ffi | ffl | ſt | st | |||||||||
U+FB1x | ﬓ | ﬔ | ﬕ | ﬖ | ﬗ | יִ | ﬞ | ײַ | ||||||||
U+FB2x | ﬠ | ﬡ | ﬢ | ﬣ | ﬤ | ﬥ | ﬦ | ﬧ | ﬨ | ﬩ | שׁ | שׂ | שּׁ | שּׂ | אַ | אָ |
U+FB3x | אּ | בּ | גּ | דּ | הּ | וּ | זּ | טּ | יּ | ךּ | כּ | לּ | מּ | |||
U+FB4x | נּ | סּ | ףּ | פּ | צּ | קּ | רּ | שּ | תּ | וֹ | בֿ | כֿ | פֿ | ﭏ | ||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Alphabetic Presentation Forms block:
Version | Final code points [lower-alpha 1] | Count | UTC ID | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|---|
1.0.1 | U+FB1E | 1 | (to be determined) | |||
1.1 | U+FB00..FB06, FB13..FB17, FB1F..FB36, FB38..FB3C, FB3E, FB40..FB41, FB43..FB44, FB46..FB4F | 56 | (to be determined) | |||
3.0 | U+FB1D | 1 | X3L2/95-124 | N1364 | David, Mark (1995-09-29), Proposal for Inclusion of One Additional Character, HEBREW LETTER YOD WITH HIRIQ, in the Unicode/ISO 10646 Standard | |
UTC/1995-054 | "Additional Yiddish Letter", Unicode Technical Committee Meeting #66, Draft Minutes, 1995-09-29 | |||||
N1353 | Umamaheswaran, V. S.; Ksar, Mike (1996-06-25), "8.13", Draft minutes of WG2 Copenhagen Meeting # 30 | |||||
L2/98-004R | N1681 | Text of ISO 10646 - AMD 18 for PDAM registration and FPDAM ballot, 1997-12-22 | ||||
L2/98-318 | N1894 | Revised text of 10646-1/FPDAM 18, AMENDMENT 18: Symbols and Others, 1998-10-22 | ||||
L2/01-004 | Duerst, Martin (2000-12-20), The impact of normalization (proposal to add U+FB1D to the Composition Exclusion list) | |||||
L2/01-008 | Constable, Peter (2000-12-22), Argument to add U+FB1D to the Composition Exclusion list | |||||
L2/01-038 | Rosenne, Jonathan (2001-01-18), Add U+FB1D to the Composition Exclusion List | |||||
L2/01-012R | Moore, Lisa (2001-05-21), "Normalization - YOD WITH HIRIQ", Minutes UTC #86 in Mountain View, Jan 2001 | |||||
|
The Ugaritic writing system is a cuneiform abjad used from around either 1400 BCE or 1300 BCE for Ugaritic, an extinct Northwest Semitic language, and discovered in Ugarit, Syria, in 1928. It has 30 letters. Other languages were occasionally written in the Ugaritic script in the area around Ugarit, although not elsewhere.
Bitstream Cyberbit is a commercial serif Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was one of the first widely available fonts to support a large portion of the Unicode repertoire.
The internationalized domain name (IDN) homograph attack is a way a malicious party may deceive computer users about what remote system they are communicating with, by exploiting the fact that many different characters look alike. For example, a regular user of example.com may be lured to click a link where the Latin character "a" is replaced with the Cyrillic character "а".
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.
GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.
Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms.
The Basic Latin or C0 Controls and Basic Latin Unicode block is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.
The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
IPA Extensions is a block (U+0250–U+02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as well as former and proposed IPA signs and non-IPA phonetic letters. Additional characters employed for phonetics, like the palatalization sign, are encoded in the blocks Phonetic Extensions (1D00–1D7F) and Phonetic Extensions Supplement (1D80–1DBF). Diacritics are found in the Spacing Modifier Letters (02B0–02FF) and Combining Diacritical Marks (0300–036F) blocks. Its block name in Unicode 1.0 was Standard Phonetic.
The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.
The Unicode Standard assigns various properties to each Unicode character and code point.
Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.
Arabic Presentation Forms-A is a Unicode block encoding contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. This block also allocates 32 noncharacters in Unicode, designed specifically for internal use.
Arabic Presentation Forms-B is a Unicode block encoding spacing forms of Arabic diacritics, and contextual letter forms. The special codepoint ZWNBSP is also here, which is only meant for a byte order mark. The block name in Unicode 1.0 was Basic Glyphs for Arabic Language; its characters were re-ordered in the process of merging with ISO 10646 in Unicode 1.0.1 and 1.1.
Armenian is a Unicode block containing characters for writing the Armenian language, both the traditional Western Armenian and reformed Eastern Armenian orthographies. Five Armenian ligatures are encoded in the Alphabetic Presentation Forms block.
Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.
Syriac is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam.
Phoenician is a Unicode block containing characters used across the Mediterranean world from the 12th century BCE to the 3rd century CE. The Phoenician alphabet was added to the Unicode Standard in July 2006 with the release of version 5.0. An alternative proposal to handle it as a font variation of Hebrew was turned down.
PragmataPro is a monospaced font family designed for programming, created by Fabrizio Schiavi. It is a narrow programming font designed for legibility. The font implements Unicode characters, including (polytonic) Greek, Cyrillic, Arabic, Hebrew and the APL codepoints. The font specifically implements ligatures for programming, such as multiple-character operators. The characters are hinted by hand.