Gothic (Unicode block)

Last updated
Gothic
RangeU+10330..U+1034F
(32 code points)
Plane SMP
Scripts Gothic
Major alphabetsGothic
Assigned27 code points
Unused5 reserved code points
Unicode version history
3.127 (+27)
Note: [1] [2]

Gothic is a Unicode block containing characters for writing the East Germanic Gothic language.

In Unicode, a block is defined as one contiguous range of code points. Blocks are named uniquely and have no overlap. They have a starting code point of the form hhh0 and an ending code point of the form hhhF. A block explicitly can include code points that are unassigned and non-characters. Code points not belonging to any of the named blocks, e.g. in the unassigned planes 3–13, have the value block="No_block".

Gothic [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1033x𐌰𐌱𐌲𐌳𐌴𐌵𐌶𐌷𐌸𐌹𐌺𐌻𐌼𐌽𐌾𐌿
U+1034x𐍀𐍁𐍂𐍃𐍄𐍅𐍆𐍇𐍈𐍉𐍊
Notes
1. ^ As of Unicode version 12.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Gothic block:

Version Final code points [lower-alpha 1] Count L2  ID WG2  IDDocument
3.1U+10330..1034A27L2/97-102Jenkins, John H. (1997-05-21), Proposal to add Gothic to ISO/IEC 10646
L2/97-196 N1581 Jenkins, John; Everson, Michael (1997-05-27), Gothic proposal
L2/97-288 N1603 Umamaheswaran, V. S. (1997-10-24), "8.24.1", Unconfirmed Meeting Minutes, WG 2 Meeting # 33, Heraklion, Crete, Greece, 20 June - 4 July 1997
L2/97-255R Aliprand, Joan (1997-12-03), "Gothic", Approved Minutes - UTC #73 & L2 #170 joint meeting, Palo Alto, CA - August 4-5, 1997
L2/98-286 N1703 Umamaheswaran, V. S.; Ksar, Mike (1998-07-02), "8.20.2", Unconfirmed Meeting Minutes, WG 2 Meeting #34, Redmond, WA, USA; 1998-03-16--20
L2/00-353 Everson, Michael (2000-10-12), Gothic 10330--1034F (draft block description)
L2/00-358 Everson, Michael (2000-10-13), Gothic 10330--1034F (draft block description)
L2/05-068 Scherer, Markus (2005-02-08), Missing numeric value for Gothic Letter Nine Hundred
  1. Proposed code points and characters names may differ from final code points and names

Related Research Articles

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0-25FF.

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorise these characters as being "letterlike".

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.

Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows.

Halfwidth and fullwidth forms Alternative width characters in East Asian typography

In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters. With fixed-width fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name.

Latin Extended-A is a Unicode block and is the third block of the Unicode standard. It encodes Latin letters from the Latin ISO character sets other than Latin-1 and also legacy characters from the ISO 6937 standard.

Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts.

CJK Unified Ideographs Extension-A is a Unicode block containing rare Han ideographs.

Hebrew is a Unicode block containing characters for writing the Hebrew, Yiddish, Ladino, and other Jewish diaspora languages.

Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of Tibet, Bhutan, Nepal, and northern India. The Tibetan Unicode block is unique for having been allocated as a standard virama-based encoding for version 1.0, removed from the Unicode Standard when unifying with ISO 10646 for version 1.1, then reintroduced as an explicit root/subjoined encoding, with a larger block size in version 2.0.

Hiragana is a Unicode block containing hiragana characters for the Japanese language.

Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages.

Katakana Phonetic Extensions is a Unicode block containing additional katakana characters for writing the Ainu language, in addition to characters in the Katakana block.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. During the unification with ISO 10646 for version 1.1, the Japanese Industrial Standard Symbol was reassigned from the code point U+32FF at the end of the block to U+3004. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with east Asian character sets.

Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana characters. Additional hentaigana characters are encoded in the Kana Extended-A block.

Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine-era musical notation.

Ancient Greek Musical Notation is a Unicode block containing symbols representing musical notations used in ancient Greece.

Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet consists of glyphs resembling capital letters in the basic Latin alphabet in their standard form and horizontally or vertically mirrored.

Tai Viet is a Unicode block containing characters for writing the Tai languages Tai Dam, Tai Dón, and Thai Song.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.