This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these messages)
|
Symbols for Legacy Computing Supplement | |
---|---|
Range | U+1CC00..U+1CEBF (704 code points) |
Plane | SMP |
Scripts | Common |
Assigned | 686 code points |
Unused | 18 reserved code points |
Unicode version history | |
16.0 (2024) | 686 (+686) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1] [2] |
Symbols for Legacy Computing Supplement is a Unicode block containing additional graphic characters that were used for various home computers from the 1970s and 1980s. It includes characters from Amstrad CPC, Apple 8-bit, Kaypro CP/M, Ohio Scientific, Robotron KC, Sharp MZ computers, HP terminals, and TRS-80. It includes a set of semigraphics in the form of 230 "octant" characters, large images split into four "characters", and the "large type" characters used for building large text characters. [3]
Symbols for Legacy Computing Supplement [1] [2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1CC0x | | | | | | | | | | | | | | | | |
U+1CC1x | | | | | | | | | | | | | | | | |
U+1CC2x | | | | | | | | | | | | | | | | |
U+1CC3x | | | | | | | | | | | | | | | | |
U+1CC4x | | | | | | | | | | | | | | | | |
U+1CC5x | | | | | | | | | | | | | | | | |
U+1CC6x | | | | | | | | | | | | | | | | |
U+1CC7x | | | | | | | | | | | | | | | | |
U+1CC8x | | | | | | | | | | | | | | | | |
U+1CC9x | | | | | | | | | | | | | | | | |
U+1CCAx | | | | | | | | | | | | | | | | |
U+1CCBx | | | | | | | | | | | | | | | | |
U+1CCCx | | | | | | | | | | | | | | | | |
U+1CCDx | | | | | | | | | | | | | | | | |
U+1CCEx | | | | | | | | | | | | | | | | |
U+1CCFx | | | | | | | | | | | ||||||
U+1CD0x | | | | | | | | | | | | | | | | |
U+1CD1x | | | | | | | | | | | | | | | | |
U+1CD2x | | | | | | | | | | | | | | | | |
U+1CD3x | | | | | | | | | | | | | | | | |
U+1CD4x | | | | | | | | | | | | | | | | |
U+1CD5x | | | | | | | | | | | | | | | | |
U+1CD6x | | | | | | | | | | | | | | | | |
U+1CD7x | | | | | | | | | | | | | | | | |
U+1CD8x | | | | | | | | | | | | | | | | |
U+1CD9x | | | | | | | | | | | | | | | | |
U+1CDAx | | | | | | | | | | | | | | | | |
U+1CDBx | | | | | | | | | | | | | | | | |
U+1CDCx | | | | | | | | | | | | | | | | |
U+1CDDx | | | | | | | | | | | | | | | | |
U+1CDEx | | | | | | | | | | | | | | | | |
U+1CDFx | | | | | | | | | | | | | | | | |
U+1CE0x | | | | | | | | | | | | | | | | |
U+1CE1x | | | | | | | | | | | | | | | | |
U+1CE2x | | | | | | | | | | | | | | | | |
U+1CE3x | | | | | | | | | | | | | | | | |
U+1CE4x | | | | | | | | | | | | | | | | |
U+1CE5x | | | | | | | | | | | | | | | | |
U+1CE6x | | | | | | | | | | | | | | | | |
U+1CE7x | | | | | | | | | | | | | | | | |
U+1CE8x | | | | | | | | | | | | | | | | |
U+1CE9x | | | | | | | | | | | | | | | | |
U+1CEAx | | | | | | | | | | | | | | | | |
U+1CEBx | | | | | ||||||||||||
Notes |
The following Unicode-related documents record the purpose and process of defining specific characters in the Symbols for Legacy Computing Supplement block:
Version | Final code points [lower-alpha 1] | Count | L2 ID | Document |
---|---|---|---|---|
16.0 | U+1CC00..1CCF9, 1CD00..1CEAF | 682 | L2/21-235 (full, no_attach, sources, mappings_zip) | Bettencourt, Rebecca; Ewell, Doug; Bánffy, Ricardo; Everson, Michael; Hietaniemi, Jarkko; Silva, Eduardo Marín; Mårtenson, Elias; Shoulson, Mark; Steele, Shawn; Turner, Rebecca (2021-12-20), Proposal to add further characters from legacy computers and teletext to the UCS |
L2/22-023 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-01-22), "17 Legacy Computing Symbols", Recommendations to UTC #170 January 2022 on Script Proposals | |||
L2/21-235R (full, no_attach) | Bettencourt, Rebecca; Ewell, Doug; Bánffy, Ricardo; Everson, Michael; Hietaniemi, Jarkko; Silva, Eduardo Marín; Mårtenson, Elias; Shoulson, Mark; Steele, Shawn; Turner, Rebecca (2022-01-26), Proposal to add further characters from legacy computers and teletext to the UCS | |||
L2/22-016 | Constable, Peter (2022-04-21), "D.1 17 Legacy Computing Symbols", UTC #170 Minutes | |||
L2/23-012 | Anderson, Deborah; et al. (2023-01-17), "10 Legacy Computing Symbols", Recommendations to UTC #174 January 2023 on Script Proposals | |||
L2/23-005 | Constable, Peter (2023-02-01), "Consensus 174-C24", UTC #174 Minutes, The UTC accepts the name change for U+1CE35 from LARGE TYPE PIECE RAISED UPPER RIGHT ARC to LARGE TYPE PIECE RAISED UPPER LEFT ARC | |||
L2/23-164 | Anderson, Deborah; Kučera, Jan; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2023-07-21), "13 Symbols for Legacy Computers Supplement", Recommendations to UTC #176 July 2023 on Script Proposals | |||
L2/23-157 | Constable, Peter (2023-07-31), "Consensus 176-C37", UTC #176 Minutes, Change the name of U+1CE07 TOP RIGHT BLACK LEFT-POINTING SMALL TRIANGLE to TOP LEFT BLACK LEFT-POINTING SMALL TRIANGLE | |||
U+1CEB0..1CEB3 | 4 | L2/21-234 (full, no_attach, sources, mappings_zip) | Bettencourt, Rebecca; Ewell, Doug; Bánffy, Ricardo; Everson, Michael; Hietaniemi, Jarkko; Silva, Eduardo Marín; Mårtenson, Elias; Shoulson, Mark; Steele, Shawn; Turner, Rebecca (2021-12-20), Proposal to add characters from Smalltalk to the UCS | |
L2/22-023 | Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Constable, Peter (2022-01-22), "20 Smalltalk", Recommendations to UTC #170 January 2022 on Script Proposals | |||
L2/22-016 | Constable, Peter (2022-04-21), "Consensus 170-C17", UTC #170 Minutes, UTC accepts ... 5 Smalltalk symbols | |||
|
The glyphs for Symbols for Legacy Computing Supplement block were added to Cascadia Code [4] , GNU Unifont [5] and Iosevka [6] fonts.
Unicode, formally The Unicode Standard, is a text encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 of the standard defines 154998 characters and 168 scripts used in various ordinary, literary, academic, and technical contexts.
In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language.
In typography, a dingbat is an ornament, specifically, a glyph used in typesetting, often employed to create box frames, or as a dinkus. Some of the dingbat symbols have been used as signature marks or used in bookbinding to order sections.
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.
The ATASCII character set, from ATARI Standard Code for Information Interchange, alternatively ATARI ASCII, is a character encoding used in the Atari 8-bit home computers. ATASCII is based on ASCII, but is not fully compatible with it.
Box-drawing characters, also known as line-drawing characters, are a form of semigraphics widely used in text user interfaces to draw various geometric frames and boxes. These characters are characterized by being designed to be connected horizontally and/or vertically with adjacent characters, which requires proper alignment. Box-drawing characters therefore typically only work well with monospaced fonts.
There are Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters, or at least a broad selection of Unicode scripts. There are also numerous projects aimed at providing only a certain script, such as the Arabeyes Arabic font. The advantage of targeting only some scripts with a font was that certain Unicode characters should be rendered differently depending on which language they are used in, and that a font that only includes the characters a certain user needs will be much smaller in file size compared to one with many glyphs. Unicode fonts in modern formats such as OpenType can in theory cover multiple languages by including multiple glyphs per character, though very few actually cover more than one language's forms of the unified Han characters.
A fallback font is a reserve typeface containing symbols for as many Unicode characters as possible. When a display system encounters a character that is not part of the repertoire of any of the other available fonts, a symbol from a fallback font is used instead. Typically, a fallback font will contain symbols representative of the various types of Unicode characters.
A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only support the basic Latin alphabet. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts", although as the maximum number of glyphs that can be defined in a TrueType font is restricted to 65,535, it is not possible for a single font to provide individual glyphs for all defined Unicode characters. This article lists some widely used Unicode fonts that support a comparatively large number and broad range of Unicode characters.
Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.
In computing, a Unicode symbol is a Unicode character which is not part of a script used to write a natural language, but is nonetheless available for use as part of a text.
GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters, shade characters, and terminal graphic characters. These can be used for filling regions of the screen and portraying drop shadows. Its block name in Unicode 1.0 was Blocks.
MouseText is a set of 32 graphical characters designed by Bruce Tognazzini and first implemented in the Apple IIc. They were then retrofitted to the Apple IIe forming part of the Enhanced IIe upgrade. A slightly revised version was then released with the Apple IIGS.
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.
GNU Unifont is a free Unicode bitmap font created by Roman Czyborra. The main Unifont covers all of the Basic Multilingual Plane (BMP). The "upper" companion covers significant parts of the Supplementary Multilingual Plane (SMP). The "Unifont JP" companion contains Japanese kanji present in the JIS X 0213 character set.
Extended ASCII is a repertoire of character encodings that include the original 96 ASCII character set, plus up to 128 additional characters. There is no formal definition of "extended ASCII", and even use of the term is sometimes criticized, because it can be mistakenly interpreted to mean that the American National Standards Institute (ANSI) had updated its ANSI X3.4-1986 standard to include more characters, or that the term identifies a single unambiguous encoding, neither of which is the case.
The Unicode block Braille Patterns (U+2800..U+28FF) contains all 256 possible patterns of an 8-dot braille cell, thereby including the complete 6-dot cell range. In Unicode, a braille cell does not have a letter or meaning defined. For example, Unicode does not define U+2817⠗BRAILLE PATTERN DOTS-1235 to be "R".
DEC Special Graphics is a 7-bit character set developed by Digital Equipment Corporation. This was used very often to draw boxes on the VT100 video terminal and the many emulators, and used by bulletin board software. The designation escape sequence ESC ( 0
switched the codes for lower-case ASCII letters to draw this set, and the sequence ESC ( B
switched back. IBM calls it Code page 1090.
Symbols for Legacy Computing is a Unicode block containing graphic characters that were used for various home computers from the 1970s and 1980s and in Teletext broadcasting standards. It includes characters from the Amstrad CPC, MSX, Mattel Aquarius, RISC OS, MouseText, Atari ST, TRS-80 Color Computer, Oric, Texas Instruments TI-99/4A, TRS-80, Minitel, Teletext, ATASCII, PETSCII, ZX80, and ZX81 character sets. Semigraphics characters are also included in the form of new block-shaped characters, line-drawing characters, and 60 "sextant" characters.