Mathematical Operators (Unicode block)

Last updated
Mathematical Operators
RangeU+2200..U+22FF
(256 code points)
Plane BMP
Scripts Common
Symbol setsMathematical symbols
Logic and Set operators
Relation symbols
Assigned256 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)242 (+242)
3.2 (2002)256 (+14)
Unicode documentation
Code chart ∣ Web page
Note: [1] [2]

Mathematical Operators is a Unicode block containing characters for mathematical, logical, and set notation.

Contents

Notably absent are the plus sign (+), greater than sign (>) and less than sign (<), due to them already appearing in the Basic Latin Unicode block, and the plus-or-minus sign (±), multiplication sign (×) and obelus (÷), due to them already appearing in the Latin-1 Supplement block, although a distinct minus sign () is included, semantically different from the Basic Latin hyphen-minus (-).

Block

Mathematical Operators [1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+220x
U+221x
U+222x
U+223x
U+224x
U+225x
U+226x
U+227x
U+228x
U+229x
U+22Ax
U+22Bx
U+22Cx
U+22Dx
U+22Ex
U+22Fx
Notes
1. ^ As of Unicode version 15.1

Variation sequences

The Mathematical Operators block has sixteen variation sequences defined for standardized variants. [3] [4] They use U+FE00 VARIATION SELECTOR-1 (VS01) to denote variant symbols (depending on the font):

Variation sequences
Base characterBase+VS01Description
U+2205 EMPTY SETzero with long diagonal stroke overlay form
U+2229 INTERSECTIONwith serifs
U+222A UNIONwith serifs
U+2268 LESS-THAN BUT NOT EQUAL TOwith vertical stroke
U+2269 GREATER-THAN BUT NOT EQUAL TOwith vertical stroke
U+2272 LESS-THAN OR EQUIVALENT TOfollowing the slant of the lower leg
U+2273 GREATER-THAN OR EQUIVALENT TOfollowing the slant of the lower leg
U+228A SUBSET OF WITH NOT EQUAL TOwith stroke through bottom members
U+228B SUPERSET OF WITH NOT EQUAL TOwith stroke through bottom members
U+2293 SQUARE CAPwith serifs
U+2294 SQUARE CUPwith serifs
U+2295 CIRCLED PLUSwith white rim
U+2297 CIRCLED TIMESwith white rim
U+229C CIRCLED EQUALSwith equal sign touching the circle
U+22DA LESS-THAN EQUAL TO OR GREATER-THANwith slanted equal
U+22DB GREATER-THAN EQUAL TO OR LESS-THANwith slanted equal

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Mathematical Operators block:

Version Final code points [lower-alpha 1] Count UTC  ID L2  ID WG2  IDDocument
1.0.0U+2200..22F1242(to be determined)
UTC/1999-013 Karlsson, Kent (1999-05-27), Tildes and micro sign decompositions
L2/99-176R Moore, Lisa (1999-11-04), "Not Tilde", Minutes from the joint UTC/L2 meeting in Seattle, June 8-10, 1999
L2/00-115R2 Moore, Lisa (2000-08-08), "Motion 83-M21", Minutes Of UTC Meeting #83
L2/01-342 Suignard, Michel (2001-09-10), "T.9 B.1 List of combining characters/Variation selectors", Comments accompanying the US positive vote on the FPDAM 1 to ISO/IEC 10646-1:2001
L2/07-268 N3253 (pdf, doc)Umamaheswaran, V. S. (2007-07-26), "M50.7 (Math symbol glyph correction) [U+22C4]", Unconfirmed minutes of WG 2 meeting 50, Frankfurt-am-Main, Germany; 2007-04-24/27
L2/15-268 Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015-10-30), Proposal to Represent the Slashed Zero Variant of Empty Set
L2/15-254 Moore, Lisa (2015-11-16), "B.12.1.2 Proposal to Represent the Slashed Zero Variant of Empty Set", UTC #145 Minutes
3.2U+22F2..22FF14 L2/00-119 [lower-alpha 2] N2191R Whistler, Ken; Freytag, Asmus (2000-04-19), Encoding Additional Mathematical Symbols in Unicode
L2/00-234 N2203 (rtf, txt)Umamaheswaran, V. S. (2000-07-21), "8.18", Minutes from the SC2/WG2 meeting in Beijing, 2000-03-21 -- 24
L2/00-115R2 Moore, Lisa (2000-08-08), "Motion 83-M11", Minutes Of UTC Meeting #83
  1. Proposed code points and characters names may differ from final code points and names
  2. Refer to the history section of the Miscellaneous Mathematical Symbols-B block for additional math-related documents

See also

Related Research Articles

Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike."

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

<span class="mw-page-title-main">Mathematical operators and symbols in Unicode</span>

The Unicode Standard encodes almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive information about the character repertoire, their properties, and guidelines for implementation. Mathematical operators and symbols are in multiple Unicode blocks. Some of these blocks are dedicated to, or primarily contain, mathematical characters while others are a mix of mathematical and non-mathematical characters. This article covers all Unicode characters with a derived property of "Math".

Supplemental Arrows-B is a Unicode block containing miscellaneous arrows, arrow tails, crossing arrows used in knot descriptions, curved arrows, and harpoons.

Supplemental Mathematical Operators is a Unicode block containing various mathematical symbols, including N-ary operators, summations and integrals, intersections and unions, logical and relational operators, and subset/superset relations.

Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills, astrological symbols, technical symbols, intonation marks, and others.

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character.

Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplementary Multilingual Plane.

A variant form is a different glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character.

Variation Selectors Supplement is a Unicode block containing additional Variation Selectors beyond those found in the Variation Selectors block.

Dingbats is a Unicode block containing dingbats. Most of its characters were taken from Zapf Dingbats; it was the Unicode block to have imported characters from a specific typeface; Unicode later adopted a policy that excluded symbols with "no demonstrated need or strong desire to exchange in plain text", and thus no further dingbat typefaces were encoded until Webdings and Wingdings were encoded in Version 7.0. Some ornaments are also an emoji, having optional presentation variants.

Arrows is a Unicode block containing line, curve, and semicircle symbols terminating in barbs or arrows.

Emoticons is a Unicode block containing emoticons or emoji. Most of them are intended as representations of faces, although some of them include hand gestures or non-human characters.

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants.

Variation Selectors is a Unicode block containing 16 variation selectors used to specify a glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively.

Egyptian Hieroglyph Format Controls is a Unicode block containing formatting characters that enable full formatting of quadrats for Egyptian hieroglyphs.

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
  4. Whistler, Ken; Freytag, Asmus (2000-04-19), "Symbol variants defined using a Variation Selector", L2/00-119: Encoding Additional Mathematical Symbols in Unicode (PDF)