Letterlike Symbols

Last updated
Letterlike Symbols
RangeU+2100..U+214F
(80 code points)
Plane BMP
Scripts Greek (1 char.)
Latin (4 char.)
Common (75 char.)
Symbol setsMathematics
abbreviations
Assigned80 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)57 (+57)
3.0 (1999)59 (+2)
3.2 (2002)74 (+15)
4.0 (2003)75 (+1)
4.1 (2005)77 (+2)
5.0 (2006)79 (+2)
5.1 (2008)80 (+1)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."

Contents

Symbols

Unicode Letterlike Symbols [3]
Char ImageNameUnicode
U+
U+2100.svg Account of2100
U+2101.svg Addressed to the subject (i.e., care of)2101
U+2102.svg Double-struck capital C2102
U+2103.svg Degree Celsius 2103
U+2104.svg Center line symbol2104
U+2105.svg Care of 2105
U+2106.svg Cada una [4] 2106
U+2107.svg Euler constant [5] 2107
U+2108.svg Scruple 2108
U+2109.svg Degree Fahrenheit 2109
U+210A.svg Script small G210A
U+210B.svg Script capital H210B
U+210C.svg Black-letter capital H210C
U+210D.svg Double-struck capital H210D
U+210E.svg Planck constant 210E
U+210F.svg Reduced Planck constant (Planck constant over 2π)210F
U+2110.svg Script capital I2110
U+2111.svg Black-letter capital I2111
U+2112.svg Script capital L2112
U+2113.svg Script small L (LaTeX: \ell)2113
U+2114.svg L B bar symbol2114
U+2115.svg Double-struck capital N2115
U+2116.svg Numero sign 2116
U+2117.svg Sound recording copyright symbol 2117
U+2118.svg Script capital P
alias: Weierstrass elliptic function
2118
U+2119.svg Double-struck capital P2119
U+211A.svg Double-struck capital Q211A
U+211B.svg Script capital R211B
U+211C.svg Black-letter capital R211C
U+211D.svg Double-struck capital R211D
U+211E.svg Prescription take211E
U+211F.svg Response211F
U+2120.svg Service mark 2120
U+2121.svg Telephone sign2121
U+2122.svg Trademark sign2122
U+2123.svg Versicle2123
U+2124.svg Double-struck capital Z2124
U+2125.svg Ounce sign2125
Ω U+2126.svg Ohm sign2126
U+2127.svg Inverted ohm sign2127
U+2128.svg Black-letter capital Z2128
U+2129.svg Turned Greek small letter iota 2129
K U+212A.svg Kelvin sign212A
Å U+212B.svg Ångström sign212B
U+212C.svg Script capital B212C
U+212D.svg Black-letter capital C212D
U+212E.svg Estimated symbol 212E
U+212F.svg Script small E212F
U+2130.svg Script capital E2130
U+2131.svg Script capital F2131
U+2132.svg Turned capital F2132
U+2133.svg Script capital M2133
U+2134.svg Script small O2134
U+2135.svg Alef symbol2135
U+2136.svg Bet symbol2136
U+2137.svg Gimel symbol2137
U+2138.svg Dalet symbol2138
U+2139.svg Information source2139
U+213A.svg Rotated capital Q213A
U+213B.svg Fax sign213B
U+213C.svg Double-struck small pi 213C
U+213D.svg Double-struck small gamma 213D
U+213E.svg Double-struck capital gamma 213E
U+213F.svg Double-struck capital pi213F
U+2140 alt.svg Double-struck n-ary summation 2140
U+2141.svg Turned sans-serif capital G2141
U+2142.svg Turned sans-serif capital L2142
U+2143.svg Reversed sans-serif capital L2143
U+2144.svg Turned sans-serif capital Y2144
U+2145.svg Double-struck italic capital D2145
U+2146.svg Double-struck italic small D2146
U+2147.svg Double-struck italic small E2147
U+2148.svg Double-struck italic small I2148
U+2149.svg Double-struck italic small J2149
U+214A.svg Property line 214A
U+214B.svg Turned ampersand 214B
U+214C.svg Per sign 214C
U+214D.svg Aktieselskab 214D
U+214E.svg Turned small F214E
U+214F.svg Symbol for Samaritan source214F

Glyph variants

Variation selectors may be used to specify chancery (U+FE00) vs roundhand (U+FE01) forms, if the font supports them:

Code pointPlainFE00FE01
U+212C
U+2130
U+2131
U+210B
U+2110
U+2112
U+2133
U+211B

The remainder of the set is at Mathematical Alphanumeric Symbols.

Block

Letterlike Symbols [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1. ^ As of Unicode version 15.1

Emoji

The Letterlike Symbols block contains two emoji: U+2122 and U+2139. [6] [7]

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation. [8]

Emoji variation sequences
U+21222139
base code point
base+VS15 (text)
base+VS16 (emoji)

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Letterlike Symbols block:

See also

Related Research Articles

Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts often have specific, fixed meanings in particular areas of mathematics. By providing uniformity over numerous mathematical articles and books, these conventions help to read mathematical formulas. These also may be used to differentiate between concepts that share a letter in a single problem.

Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories: astrological, astronomical, chess, dice, musical notation, political symbols, recycling, religious symbols, trigrams, warning signs, and weather, among others.

Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF.

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

Supplemental Arrows-B is a Unicode block containing miscellaneous arrows, arrow tails, crossing arrows used in knot descriptions, curved arrows, and harpoons.

Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills, astrological symbols, technical symbols, intonation marks, and others.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

Miscellaneous Symbols and Pictographs is a Unicode block containing meteorological and astronomical symbols, emoji characters largely for compatibility with Japanese telephone carriers' implementations of Shift JIS, and characters originally from the Wingdings and Webdings fonts found in Microsoft Windows.

Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alphanumerics: a few unit abbreviations, circled numbers from 21 to 50, and circled multiples of 10 from 10 to 80 enclosed in black squares.

Mahjong Tiles is a Unicode block containing characters depicting the standard set of tiles used in the game of Mahjong.

General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.

Dingbats is a Unicode block containing dingbats. Most of its characters were taken from Zapf Dingbats; it was the Unicode block to have imported characters from a specific typeface; Unicode later adopted a policy that excluded symbols with "no demonstrated need or strong desire to exchange in plain text", and thus no further dingbat typefaces were encoded until Webdings and Wingdings were encoded in Version 7.0. Some ornaments are also an emoji, having optional presentation variants.

Arrows is a Unicode block containing line, curve, and semicircle symbols terminating in barbs or arrows.

<span class="mw-page-title-main">Enclosed Ideographic Supplement</span> Unicode character block

Enclosed Ideographic Supplement is a Unicode block containing forms of characters and words from Chinese, Japanese and Korean enclosed within or stylised as squares, brackets, or circles. It contains three such characters containing one or more kana, and many containing CJK ideographs. Many of its characters were added for compatibility with the Japanese ARIB STD-B24 standard. Six symbols from Chinese folk religion were added in Unicode version 10.

Emoticons is a Unicode block containing emoticons or emoji. Most of them are intended as representations of faces, although some of them include hand gestures or non-human characters.

Transport and Map Symbols is a Unicode block containing transportation and map icons, largely for compatibility with Japanese telephone carriers' emoji implementations of Shift JIS, and to encode characters in the Wingdings and Wingdings 2 character sets.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. Unicode chart (PDF)
  4. Spanish for "each one."
  5. It is unknown which constant this is supposed to be. Xerox standard XCCS 353/046 just says "Euler's."
  6. "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
  7. "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
  8. "UTS #51 Emoji Variation Sequences". The Unicode Consortium.