Thin space

Last updated
Spacing examples. The top row is unspaced, the middle row has a thin space between the words, and the bottom has a regular space. Example of thin space.svg
Spacing examples. The top row is unspaced, the middle row has a thin space between the words, and the bottom has a regular space.

In typography, a thin space is a space character whose width is usually 15 or 16 of an em. It is used to add a narrow space, such as between nested quotation marks or to separate glyphs that interfere with one another. It is not as narrow as the hair space. It is also used in the International System of Units and in many countries as a thousands separator when writing numbers in groups of three digits, in order to facilitate reading. [1] It also avoids the ambiguity of the comma, used as a thousands separator in many countries but as a decimal point in Europe.

In Unicode, thin space is encoded at U+2009THIN SPACE ( ,  ). Some text editors, such as IntelliJ IDEA and Android Studio, will display the character as its suggested abbreviation of "THSP". [2] Unicode's U+202FNARROW NO-BREAK SPACE is a non-breaking space with a width similar to that of the thin space. b:Unicode/Character reference/2000-2FFF#ref

In LaTeX and Plain TeX, \thinspace produces a narrow, non-breaking space. [3] [4] Inside and outside of math formulae in LaTeX, \, also produces a narrow, non-breaking space.

In all versions of LibreOffice and in some of Microsoft Word, the special characters and symbols dialog (often available via Insert > Symbol or Insert > Special Characters), has both the thin space and the narrow no-break space available for point-and-click insertion. In LibreOffice's Symbol dialog, there is an easy-to-find box field to narrow the searching; in Word's Symbol dialog, under font = "(normal text)", the characters are found in subset = "General Punctuation", Unicode character 2009 and nearby. Other word processing programs and in many Linux configurations, have ways of producing a thin space using keyboard shortcuts.

See also

Related Research Articles

In typography, the pilcrow () is a glyph used to identify a paragraph. In editorial production the pilcrow typographic character may also be known as the paragraph mark, the paragraph sign, the paragraph symbol, the paraph, and the blind P.

<span class="mw-page-title-main">Decimal separator</span> Numerical symbol

A decimal separator is a symbol that separates the integer part from the fractional part of a number written in decimal form. Different countries officially designate different symbols for use as the separator. The choice of symbol also affects the choice of symbol for the thousands separator used in digit grouping.

An interpunct·, also known as an interpoint, middle dot, middot, centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in Classical Latin. It appears in a variety of uses in some modern languages.

In writing, a space is a blank area that separates words, sentences, syllables and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex. Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.

The tilde is a grapheme ˜ or ~ with a number of uses. The name of the character came into English from Spanish tilde, which in turn came from the Latin titulus, meaning 'title' or 'superscription'. Its primary use is as a diacritic (accent) in combination with a base letter. Its freestanding form is used in modern texts mainly to indicate approximation.

<span class="mw-page-title-main">Zero-width non-joiner</span> Non-printing character that separates two normally joined characters

The zero-width non-joiner is a non-printing character used in the computerization of writing systems that make use of ligatures. When placed between two characters that would otherwise be connected into a ligature, a ZWNJ causes them to be printed in their final and initial forms, respectively. This is also an effect of a space character, but a ZWNJ is used when it is desirable to keep the characters closer together or to connect a word with its morpheme.

<span class="mw-page-title-main">Soft hyphen</span> Unicode character

In computing and typesetting, a soft hyphen or syllable hyphen, is a code point reserved in some coded character sets for the purpose of breaking words across lines by inserting visible hyphens if they fall on the line end but remain invisible within the line.

In word processing and digital typesetting, a non-breaking space, also called NBSP, required space, hard space, or fixed space, is a space character that prevents an automatic line break at its position. In some formats, including HTML, it also prevents consecutive whitespace characters from collapsing into a single space. Non-breaking space characters with other widths also exist.

The degree symbol or degree sign, °, is a glyph or symbol that is used, among other things, to represent degrees of arc, hours, degrees of temperature or alcohol proof. The symbol consists of a small superscript circle.

Line breaking, also known as word wrapping, is breaking a section of text into lines so that it will fit into the available width of a page, window or other display area. In text display, line wrap is continuing on a new line when a line is full, so that each line fits into the viewable window, allowing text to be read from top to bottom without any horizontal scrolling. Word wrap is the additional feature of most text editors, word processors, and web browsers, of breaking lines between words rather than within words, where possible. Word wrap makes it unnecessary to hard-code newline delimiters within paragraphs, and allows the display of text to adapt flexibly and dynamically to displays of varying sizes.

A whitespace character is a character data element that represents white space when text is rendered for display by a computer.

An overline, overscore, or overbar, is a typographical feature of a horizontal line drawn immediately above the text. In old mathematical notation, an overline was called a vinculum, a notation for grouping symbols which is expressed in modern notation by parentheses, though it persists for symbols under a radical sign. The original use in Ancient Greek was to indicate compositions of Greek letters as Greek numerals. In Latin, it indicates Roman numerals multiplied by a thousand and it forms medieval abbreviations (sigla). Marking one or more words with a continuous line above the characters is sometimes called overstriking, though overstriking generally refers to printing one character on top of an already-printed character.

<span class="mw-page-title-main">Universal Character Set characters</span> Complete list of the characters available on most computers

The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.

In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:

A character that would not have been encoded except for compatibility and round-trip convertibility with other standards

Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:

The zero-width space (ZWSP) is a non-printing character used in computerized typesetting to indicate where the word boundaries are, without actually displaying a visible space in the rendered text. This enables text-processing systems for scripts that do not use explicit spacing to recognize where word boundaries are for the purpose of handling line breaks appropriately. Zero-width space is unicode character U+200B, and is located in the unicode General Punctuation block, and can be represented by numeric character references &#x200B; or &#8203;.

The full stop, period, or full point. is a punctuation mark used for several purposes, most often to mark the end of a declarative sentence.

The Unicode Standard assigns various properties to each Unicode character and code point.

Sentence spacing in digital media concerns the horizontal width of the space between sentences in computer- and web-based media. Digital media allow sentence spacing variations not possible with the typewriter. Most digital fonts permit the use of a variable space or a no-break space. Some modern font specifications, such as OpenType, have the ability to automatically add or reduce space after punctuation, and users may be able to choose sentence spacing variations.

General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.

References

  1. "8th edition of the SI Brochure" (PDF). Bureau International des Poids et Mesures (BIPM). Retrieved 9 October 2019.
  2. Schneider, Marcel (January 13, 2020). "Proposal to extend support for abbreviations - For consideration by Unicode Technical Committee (20007-abbreviations.pdf)" (PDF).
  3. Knuth, Donald E. (1986) [Incorporates the final corrections made in 1996]. The TeXbook (PDF). Illustrations by Duane Bibby. Addison Wesley. pp. 5, 352. hdl:2027/mdp.49015000850066. ISBN   978-0-201-13447-6. LCCN   85-30845. OCLC   682395096. OL   7406778M. Archived (PDF) from the original on September 24, 2004.
  4. Braams, Johannes; et al. (October 1, 2015). The LaTeX 2ε Sources (PDF) (1.2 ed.). p. 79.