Subscript and superscript

Last updated
Example of subscript and superscript. In each example the first "2" is professionally designed, and is included as part of the glyph set; the second "2" is a manual approximation using a small version of the standard "2". The visual weight of the first "2" matches the other characters better. (The top typeface is Adobe Garamond Pro; the size of the subscript is about 62% of the original characters, dropped below the baseline by about 16%. The second typeface is Myriad Pro; the superscript is about 60% of the original characters, raised by about 44% above the baseline.) Subscript superscript expert.png
Example of subscript and superscript. In each example the first "2" is professionally designed, and is included as part of the glyph set; the second "2" is a manual approximation using a small version of the standard "2". The visual weight of the first "2" matches the other characters better. (The top typeface is Adobe Garamond Pro; the size of the subscript is about 62% of the original characters, dropped below the baseline by about 16%. The second typeface is Myriad Pro; the superscript is about 60% of the original characters, raised by about 44% above the baseline.)

A subscript or superscript is a character (such as a number or letter) that is set slightly below or above the normal line of type, respectively. It is usually smaller than the rest of the text. Subscripts appear at or below the baseline, while superscripts are above. Subscripts and superscripts are perhaps most often used in formulas, mathematical expressions, and specifications of chemical compounds and isotopes, but have many other uses as well.

Contents

In professional typography, subscript and superscript characters are not simply ordinary characters reduced in size; to keep them visually consistent with the rest of the font, typeface designers make them slightly heavier (i.e. medium or bold typography) than a reduced-size character would be. The vertical distance that sub- or superscripted text is moved from the original baseline varies by typeface and by use.

In typesetting, such types are traditionally called "superior" and "inferior" letters, figures, etc., or just "superiors" and "inferiors". In English, most nontechnical use of superiors is archaic. [1] Superior and inferior figures on the baseline are used for fractions and most other purposes, while lowered inferior figures are needed for chemical and mathematical subscripts. [2]

Uses

The four common locations of subscripts and superscripts. The typeface is Myriad Pro. Sub super num dem.svg
The four common locations of subscripts and superscripts. The typeface is Myriad Pro.

A single typeface may contain sub- and superscript glyphs at different positions for different uses. The four most common positions are listed here. Because each position is used in different contexts, not all alphanumerics may be available in all positions. For example, subscript letters on the baseline are quite rare, and many typefaces provide only a limited number of superscripted letters. Despite these differences, all reduced-size glyphs go by the same generic terms subscript and superscript, which are synonymous with the terms inferior letter (or number) and superior letter (or number), respectively. Most fonts that contain superscript/subscript will have predetermined size and orientation that is dependent on the design of the font.

Subscripts that are dropped below the baseline

Perhaps the most familiar example of subscripts is in chemical formulas. For example, the chemical formula for glucose is C 6 H 12 O 6 (meaning that it is a molecule with 6 carbon atoms, 12 hydrogen atoms and 6 oxygen atoms). One of the most famous molecules, water, is known almost universally by its chemical formula: H2O (meaning that it contains 2 hydrogen atoms and 1 oxygen atom).

A subscript is also used to distinguish between different versions of a subatomic particle. Thus electron, muon, and tau neutrinos are denoted
ν
e

ν
μ
and
ν
τ
. A particle may be distinguished by multiple subscripts, such as
Ω
bbb
for the triple bottom omega particle.

Similarly, subscripts are also used frequently in mathematics to define different versions of the same variable: for example, in an equation x0 and xf might indicate the initial and final value of x, while vrocket and vobserver would stand for the velocities of a rocket and an observer. Commonly, variables with a zero in the subscript are referred to as the variable name followed by "nought" (e.g. v0 would be read, "v-nought"). [3]

Subscripts are often used to refer to members of a mathematical sequence or set or elements of a vector. For example, in the sequence O = (45, 2, 800), O3 refers to the third member of sequence O, which is 800.

Also in mathematics and computing, a subscript can be used to represent the radix, or base, of a written number, especially where multiple bases are used alongside each other. For example, comparing values in hexadecimal, denary, and octal one might write Chex = 12dec = 14oct.

Subscripted numbers dropped below the baseline are also used for the denominators of stacked fractions, like this: 67/68.

Subscripts that are aligned with the baseline

The only common use of these subscripts is for the denominators of diagonal fractions[ citation needed ], like ½ or the signs for percent  %, permille ‰, and basis point ‱. Certain standard abbreviations are also composed as diagonal fractions, such as (care of), (account of), (addressed to the subject), or in Spanish (cada uno/una, "each one").

Superscripts that typically do not extend above the ascender line

These superscripts typically share a baseline with numerator digits, the top of which are aligned with the top of the full-height numerals of the base font; lowercase ascenders may extend above.

Ordinal indicators are sometimes written as superscripts (1st, 2nd, 3rd, 4th, rather than 1st, 2nd, 3rd, 4th), although many English-language style guides recommend against this use. [4] Romance languages use a similar convention, such as 1er or 2e in French, or 4ª and 4º in Galician and Italian, or 4.ª and 4.º in Portuguese and Spanish.

In medieval manuscripts, many superscript as well as subscript signs were used to abbreviate text. From these developed modern diacritical marks (glyphs, or "accents" placed above or below the letter). Also, in early Middle High German, umlauts and other modifications to pronunciation would be indicated by superscript letters placed directly above the letter they modified. Thus the modern umlaut ü was written as uͤ. Both vowels and consonants were used in this way, as in ſheͨzze and boͮsen. [5] In modern typefaces, these letters are usually smaller than other superscripts, and their baseline is slightly above the base font's midline, making them extend no higher than a typical ordinal indicator.

Superscripts are used for the standard abbreviations for service mark () and trademark (). The signs for copyright © and registered trademark ® are also sometimes superscripted, depending on the typeface or house style.

On handwritten documents and signs, a monetary amount may be written with the cents value superscripted, as in $8⁰⁰ or 8€⁵⁰. Often the superscripted numbers are underlined: $8⁰⁰, 8€⁵⁰. The currency symbol itself may also be superscripted, as in $80 or 6¢.There is no ruling whether or not these characters need to be supercript, or made smaller than the numbers, or aligned to any of the various guide lines. That of course is decided by the preference of the typographer.

Superscripts that typically extend above the ascender line

Both low and high superscripts can be used to indicate the presence of a footnote in a document, like this5 or thisxi. Any combination of characters can be used for this purpose; in technical writing footnotes are sometimes composed of letters and numbers together, like this.A.2 The choice of low or high alignment depends on taste, but high-set footnotes tend to be more common, as they stand out more from the text.

In mathematics, high superscripts are used for exponentiation to indicate that one number or variable is raised to the power of another number or variable. Thus y4 is y raised to the fourth power, 2x is 2 raised to the power of x, and the famous equation E = mc2 includes a term for the speed of light squared. This led over time to an "abuse of notation" whereby superscripts indicate iterative function composition, including derivatives. In an unrelated use, superscripts also indicate contravariant tensors in Ricci calculus.

The charges of ions and subatomic particles are also denoted by superscripts. Cl is a negatively charged chlorine atom, Pb4+ is an atom of lead with a positive charge of four, e is an electron, e+ is a positron, and μ+ is an antimuon.

Atomic isotopes are written using superscripts. In symbolic form, the number of nucleons is denoted as a superscripted prefix to the chemical symbol (for example 3
He
, 12
C
, 13
C
, 131
I
, and 238
U
). The letters m or f may follow the number to indicate metastable or fission isomers, as in 58m
Co
or 240f
Pu
.

Subscripts and superscripts can also be used together to give more specific information about nuclides. For example, 235
92
U
denotes an atom of uranium with 235 nucleons, 92 of which are protons. A chemical symbol can be completely surrounded: 14
6
C2+
2
is a divalent cation of carbon with 14 nucleons, of which six are protons and 8 are neutrons, and there are two atoms in this chemical compound.

The numerators of stacked fractions (such as 34/35) usually use high-set superscripts, although some specially designed glyphs keep the top of the numerator aligned with the top of the full-height numerals.

Alignment examples

Subscript superscript examples.png

This image shows the four common locations for subscripts and superscripts, according to their typical uses. The typeface is Minion Pro, set in Adobe Illustrator. Note that the default superscripting algorithms of most word processors would set the "th" and "lle" too high, and the weight of all the subscript and superscript glyphs would be too light.

HTML subscripts and superscripts
X6
O8M
X6
O8M
Default subscript and superscript rendered in HTML for fonts in normal styles.Example of possible collision of italic styles in HTML.

Another minor adjustment that is often omitted by renderers is the control of the direction of movement[ clarification needed ] for superscripts and subscripts, when they do not lie on the baseline. Ideally this should allow for the font, e.g. italics are slanting; most renderers adjust the position only vertically and do not also shift it horizontally. This may create a collision with surrounding letters in the same italic font size. One can see an example of such collision on the right side when rendered in HTML (see the figure on the right). To avoid this, it is often desirable to insert a small positive horizontal margin (or a thin space) (on the left side of the first superscript character), or a negative margin (or a tiny backspace) before a subscript. It is more critical with glyphs from fonts in Oblique styles that are more slanted than those from fonts in Italic style, and some fonts reverse the direction of slanting, so there is no general solution except when the renderer takes into account the font metrics properties that specifies the angle of slanting,

However the same problem occurs more generally between spans of normal glyphs (non-superscript and non-subscript) when slanting styles are mixed.

Software support

Desktop publishing

Many text editing and word processing programs have automatic subscripting and superscripting features, although these programs usually simply use ordinary characters reduced in size and moved up or down rather than separately designed subscript or superscript glyphs. Professional typesetting programs such as QuarkXPress or Adobe InDesign also have similar features for automatically converting regular type to subscript or superscript. These programs, however, may also offer native OpenType support for the special subscript and superscript glyphs included in many professional typeface packages (such as those shown in the image above).

Comparison of software support
SoftwareOpenType support for
professional glyphs?
Default values for glyph transformation (non-professional glyphs)Keyboard shortcuts
SizeSubscript position [ clarification needed ]Superscript position [ clarification needed ]User-modifiable settings?SuperscriptSubscript
OpenOffice.org 3.3No58%−33%+33%YesCtrl+⇧ Shift+PCtrl+⇧ Shift+B
LibreOffice 5.3Yes [lower-alpha 1] 58%−33%+33%YesCtrl+⇧ Shift+PCtrl+⇧ Shift+B
Microsoft Word 2015Yes50%−14.1%+40%Manual [lower-alpha 2] Ctrl+⇧ Shift+=Ctrl+=
Adobe Illustrator CS3Yes58.3%−33.3%+33.3%Yes
Adobe Photoshop CS3 Ordinal indicators only[ citation needed ]58.3%−33.3%+33.3%Manual [lower-alpha 2] Alt+Ctrl+⇧ Shift+=Ctrl+⇧ Shift+=
LaTeX Yes (using XeLaTeX or LuaTeX only)≈70% [lower-alpha 3] −14% [lower-alpha 4] +25% [lower-alpha 4] Manual [lower-alpha 5]
Notes:
  1. Available by typing a syntax consisting font name, feature tag and its value into Font Name area. [6]
  2. 1 2 Default subscript and superscript options can be overcome by manually changing the font size and raising/lowering text.
  3. Dependent on math mode; differs for subsub- and supersuper-scripts.
  4. 1 2 Nominal values; dependent on fontdimen parameters (16 and 17).
  5. Changing fontdimen values requires some skill the textual commands can be modified to use the \raisebox command.

HTML

In HTML and Wiki syntax, subscript text is produced by putting it inside the tags <sub> and </sub>. Similarly, superscripts are produced with <sup> and </sup>. [7] The exact size and position of the resulting characters will vary by font and browser, but are usually reduced to around 75% original size.

TeX

In TeX's mathematics mode (as used in MediaWiki), subscripts are typeset with the underscore, while superscripts are made with the caret. Thus $X_{ab}$ produces , and $X^{ab}$ produces .

In LaTeX text mode the math method above is inappropriate, as letters will be in math italic, so the command n\textsuperscript{th} will give nth and A\textsubscript{base} will give Abase (textual subscripts are rare, so \textsubscript is not built-in, but requires the fixltx2e package). As in other systems, when using UTF-8 encoding, the masculine º and feminine ª ordinal indicators can be used as characters, with no need to use a command.

In line with its origin as a superscript circle, the degree symbol (°) is composed by a superscript circle operator (∘). ^{\circ}.

Superscripts and subscripts of arbitrary height can be done with the \raisebox{<dimen>}{<text>} command: the first argument is the amount to raise, and the second is the text; a negative first argument will lower the text. In this case the text is not resized automatically, so a sizing command can be included, e.g. go\raisebox{1ex}{\large home}.

Unicode

Unicode defines subscript and superscript characters in several areas; in particular, it has a full set of superscript and subscript digits. Owing to the popularity of using these characters to make fractions, most modern fonts render most or all of these as cap height superscripts and baseline subscripts. The same font may align letters and numbers in different ways. Other than numbers, the set of super- and subscript letters and other symbols is incomplete and somewhat random, and many fonts do not contain them. Because of these inconsistencies, these glyphs may not be suitable for some purposes (see Uses, above).

OpenType

Several advanced features of OpenType typefaces are supported for professionally designed subscript and superscript glyphs. Exactly which glyphs are included varies by typeface; some have only basic support for numerals, while others contain a full set of letters, numerals, and punctuation. They can be available via activating subs or sups feature tag. These feature tags can be turned on if software environment support optional features. In addition, some other typefaces placed them in a Unicode Private Use Area.

See also

Related Research Articles

<span class="mw-page-title-main">Blackboard bold</span> Typeface style

Blackboard bold is a style of writing bold symbols on a blackboard by doubling certain strokes, commonly used in mathematical lectures, and the derived style of typeface used in printed mathematical texts. The style is most commonly used to represent the number sets , (integers), , , and .

<span class="mw-page-title-main">Typeface</span> Set of characters that share common design features

A typeface is a design of letters, numbers and other symbols, to be used in printing or for electronic display. Most typefaces include variations in size, weight, slope, width, and so on. Each of these variations of the typeface is a font.

<span class="mw-page-title-main">Phi</span> Twenty-first letter in the Greek alphabet

Phi is the twenty-first letter of the Greek alphabet.

<span class="mw-page-title-main">Bitstream Cyberbit</span> Unicode font

Bitstream Cyberbit is a commercial serif Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was one of the first widely available fonts to support a large portion of the Unicode repertoire.

<span class="mw-page-title-main">Text figures</span> Numerals typeset with varying heights

Text figures are numerals designed with varying heights in a fashion that resembles a typical line of running text, hence the name. They are contrasted with lining figures, which are the same height as upper-case letters. Georgia is an example of a popular typeface that employs text figures by default.

Apple Symbols is a font introduced in Mac OS X 10.3 “Panther”. This is a TrueType font intended to provide coverage for characters defined as symbols in the Unicode Standard. It continues to ship with Mac OS X as part of the default installation. Prior to Mac OS X 10.5, its path was /Library/Fonts/Apple Symbols.ttf. From Mac OS X 10.5 onward, it is to be found at /System/Library/Fonts/Apple Symbols.ttf, meaning it is now considered an essential part of the system software, not to be deleted by users.

The degree symbol or degree sign, °, is a glyph or symbol that is used, among other things, to represent degrees of arc, hours, degrees of temperature or alcohol proof. The symbol consists of a small superscript circle.

<span class="mw-page-title-main">Zapfino</span> Typeface

Zapfino is a calligraphic typeface designed for Linotype by typeface designer Hermann Zapf in 1998. It is based on an alphabet Zapf originally penned in 1944. As a font, it makes extensive use of ligatures and character variations.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

<span class="mw-page-title-main">Parisine</span> Typeface

Parisine is a typeface that was created by Jean-François Porchez and is distributed by Typofonderie.

<span class="mw-page-title-main">Monospace (typeface)</span> Serif typeface

Monospace is a monospaced Unicode font, developed by George Williams. It is based on the typeface Courier. This font contains 2860 glyphs. It includes characters in the following unicode ranges: Basic Latin, Latin-1 Supplement, Latin Extended-A, Latin Extended-B, IPA Extensions, Spacing Modifier Letters, Combining Diacritical Marks, Greek, Cyrillic, Hebrew, Latin Extended Additional, Greek Extended, General Punctuation, Superscripts and Subscripts, Currency Symbols, Combining Diacritical Marks for Symbols, Letterlike Symbols, Number Forms, Arrows, Mathematical Operators, Miscellaneous Technical, Control Pictures, Enclosed Alphanumerics, Box Drawing, Block Elements, Geometric Shapes, Miscellaneous Symbols, Alphabetic Presentation Forms, Halfwidth and Fullwidth Forms.

<span class="mw-page-title-main">Microsoft Sans Serif</span> Neo-grotesque sans-serif typeface

Microsoft Sans Serif is a sans-serif typeface introduced with early Microsoft Windows versions. It is the successor of MS Sans Serif, formerly Helv, a proportional bitmap font introduced in Windows 1.0. Both typefaces are very similar in design to Arial and Helvetica. The typeface was designed to match the MS Sans bitmap included in the early releases of Microsoft Windows.

In typography and handwriting, a superior letter is a lower-case letter placed above the baseline and made smaller than an ordinary script. The style has traditionally been distinct from superscript. Formerly quite common in abbreviations, the original purpose was to make handwritten abbreviations clearly distinct from normal words. These could also be used to make the important words on signs larger. In technical terms, the superior letter can also be called the superscripted minuscule letter. In modern usage, with word processors and text entry interfaces, superscript and superior letters are produced in the same way and look identical. Their distinction would refer to their usage and not to their form.

Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character sets, which often included similar or identical characters.

<span class="mw-page-title-main">Universal Character Set characters</span> Complete list of the characters available on most computers

The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.

In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:

A character that would not have been encoded except for compatibility and round-trip convertibility with other standards

A numeral is a character that denotes a number. The decimal number digits 0–9 are used widely in various writing systems throughout the world, however the graphemes representing the decimal digits differ widely. Therefore Unicode includes 22 different sets of graphemes for the decimal digits, and also various decimal points, thousands separators, negative signs, etc. Unicode also includes several non-decimal numerals such as Aegean numerals, Roman numerals, counting rod numerals, Mayan numerals, Cuneiform numerals and ancient Greek numerals. There is also a large number of typographical variations of the Western Arabic numerals provided for specialized mathematical use and for compatibility with earlier character sets, such as ² or ②, and composite characters such as ½.

The Unicode Standard assigns various properties to each Unicode character and code point.


Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1, were included in the Latin-1 Supplement block.

<span class="mw-page-title-main">PragmataPro</span> Monospaced typeface

PragmataPro is a monospaced font family designed for programming, created by Fabrizio Schiavi. It is a narrow programming font designed for legibility. The font implements Unicode characters, including (polytonic) Greek, Cyrillic, Arabic, Hebrew and the APL codepoints. The font specifically implements ligatures for programming, such as multiple-character operators. The characters are hinted by hand.

References

  1. Bringhurst 2005, pp. 311–312.
  2. Bringhurst 2005, p. 309.
  3. "Your Head Will Spin: "Naught," "Aught," and "Ought"" . Retrieved 2020-11-21.
  4. "UCC EPU: Editing Word files for publication: Making the best of what Word provides". Academic and Collaborative Technologies Unit. Publish.ucc.ie. 2007. 2011-07-03. Archived from the original on 2014-01-03. Retrieved 2014-01-03.
  5. Marc Wilhelm Küster; Isabel Wojtovicz (14 September 2000). "Diacritics for medieval studies" (PDF). Archived from the original (PDF) on 2007-06-12. Retrieved 2008-03-31.
  6. "Opentype features now enabled? Documentation?". Ask LibreOffice. 18 November 2016. Archived from the original on 30 December 2016. Retrieved 18 May 2017.
  7. Pyatt, Elizabeth J. (27 September 2016). "Math Unicode Entities". Information Sciences and Technology . Pennsylvania State University . Retrieved 6 January 2021.

Bibliography