Currency Symbols (Unicode block)

Last updated
Currency Symbols
RangeU+20A0..U+20CF
(48 code points)
Plane BMP
Scripts Common
Symbol setsCurrency signs
Assigned33 code points
Unused15 reserved code points
Unicode version history
1.0.0 (1991)11 (+11)
2.0 (1996)12 (+1)
2.1 (1998)13 (+1)
3.0 (1999)16 (+3)
3.2 (2002)18 (+2)
4.1 (2005)22 (+4)
5.2 (2009)25 (+3)
6.0 (2010)26 (+1)
6.2 (2012)27 (+1)
7.0 (2014)30 (+3)
8.0 (2015)31 (+1)
10.0 (2017)32 (+1)
14.0 (2021)33 (+1)
Chart
Code chart
Note: [1] [2]

Currency Symbols is a Unicode block containing characters for representing unique monetary signs. Many currency signs can be found in other Unicode blocks, especially when the currency symbol is unique to a country that uses a script not generally used outside that country.

Contents

The display of Unicode currency symbols among various typefaces is inconsistent, more so than other characters in the repertoire. The French franc sign (U+20A3) is typically displayed as a struck-through F, but various versions of Garamond display it as an Fr ligature. The peseta sign (U+20A7), inherited from code page 437, is usually displayed as a Pts ligature, but Roboto displays it as a Pt ligature and Arial Unicode MS displays it as a partially struck-through P. The rupee sign (U+20A8) is usually displayed as an Rs digraph, but Microsoft Sans Serif uses the quantity-neutral "Rp" digraph instead.

Block

Currency Symbols [1] [2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+20Ax
U+20Bx
U+20Cx
Notes
1. ^ As of Unicode version 14.0
2. ^ Grey areas indicate non-assigned code points

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Currency Symbols block:

Related Research Articles

Unicode Character encoding standard

Unicode, formally the Unicode Standard, is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines 144,697 characters covering 159 modern and historic scripts, as well as symbols, emoji, and non-visual control and formatting codes.

The symbol # is known variously in English-speaking regions as the number sign, hash, or pound sign. The symbol has historically been used for a wide range of purposes including the designation of an ordinal number and as a ligatured abbreviation for pounds avoirdupois – having been derived from the now-rare .

Ø is a letter used in the Danish, Norwegian, Faroese, and Southern Sámi languages. It is mostly used as a representation of mid front rounded vowels, such as and, except for Southern Sámi where it is used as an diphthong.

ß Letter of the Latin alphabet; used in German

In German orthography, the letter ß, called Eszett or scharfes S, represents the phoneme in Standard German when following long vowels and diphthongs. The name Eszett combines the names of the letters of ⟨s⟩ and ⟨z⟩ in German. The character's Unicode names in English are sharp s and eszett. The letter is only used in German, and can be replaced with ⟨ss⟩ if the character is unavailable or capitalized, though a capitalized version has existed officially since 2017. In the 20th century, it was completely replaced by ⟨ss⟩ in Swiss Standard German, while it remains part of the orthography of Standard German elsewhere.

Yen and yuan sign Latin symbol for CN and JP currencies

The yen and yuan sign, ¥, is a currency sign used for the Japanese yen and the Chinese yuan currencies when writing in Latin scripts. This monetary symbol resembles a Latin letter Y with a single or double horizontal stroke. The symbol is usually placed before the value it represents, for example: ¥50, or JP¥50 and CN¥50 when disambiguation is needed. When writing in Japanese and Chinese, the Japanese kanji and Chinese character is written following the amount, for example 50円 in Japan, and 50元 or 50圆 in China.

Ligature (writing) Glyph combining two or more letterforms in a single typeset or handwritten character

In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters æ and œ used in English and French, in which the letters 'a' and 'e' are joined for the first ligature and the letters 'o' and 'e' are joined for the second ligature. For stylistic and legibility reasons, 'f' and 'i' are often merged to create 'fi' ; the same is true of 's' and 't' to create 'st'. The common ampersand (&) developed from a ligature in which the handwritten Latin letters 'e' and 't' were combined.

IJ (digraph) Latin-script digraph

IJ is a digraph of the letters i and j. Occurring in the Dutch language, it is sometimes considered a ligature, or a letter in itself. In most fonts that have a separate character for ij, the two composing parts are not connected but are separate glyphs, which are sometimes slightly kerned.

The numero sign or numero symbol, ,, is a typographic abbreviation of the word number(s) indicating ordinal numeration, especially in names and titles. For example, using the numero sign, the written long-form of the address "Number 22 Acacia Avenue" is shortened to "№ 22 Acacia Ave", yet both forms are spoken long.

ArmSCII Set of obsolete single-byte character encodings

ArmSCII or ARMSCII is a set of obsolete single-byte character encodings for the Armenian alphabet defined by Armenian national standard 166–9. ArmSCII is an acronym for Armenian Standard Code for Information Interchange, similar to ASCII for the American standard. It has been superseded by the Unicode standard.

In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane, and one each in, and nearly covering, planes 15 and 16. The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions.

Miscellaneous Technical is a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language, and academic professions. For example:

Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription.

In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:

A character that would not have been encoded except for compatibility and round-trip convertibility with other standards

Many scripts in Unicode, including Arabic and Devanāgarī, have special orthographic rules that require certain combinations of letterforms to be combined into special ligature forms.

The Latin-1 Supplement is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.

Unicode contains a number of characters that represent various cultural, political, and religious symbols.

The Unicode Standard assigns various properties to each Unicode character and code point.

The rupee sign “” is a currency sign used to represent the monetary unit of account in Pakistan, Sri Lanka, Nepal, Mauritius, Seychelles, and formerly in India. It resembles, and is often written as, the Latin character sequence "Rs", of which it is an orthographic ligature.

CJK Compatibility is a Unicode block containing square symbols encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF).

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.