Space (punctuation)

Last updated
 
Space
U+0020 SPACE (Note: Representations here of a regular space are replaced with a no-break space)
See also
U+00A0  NO-BREAK SPACE ( ,  )
Other types of spaces

In writing, a space () is a blank area that separates words, sentences, syllables (in syllabification) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex.[ citation needed ] Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.

Contents

Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text. The typewriter, on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and the design of printed works.[ citation needed ]

Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways:

Use in natural languages

Between words

Modern English uses a space to separate words, but not all languages follow this practice. Spaces were not used to separate words in Latin until roughly 600–800 AD. Ancient Hebrew and Arabic did use spaces partly to compensate in clarity for the lack of vowels. [1] The earliest Greek script also used interpuncts to divide words rather than spacing, although this practice was soon displaced by the scriptura continua .

Word spacing was later used by Irish and Anglo-Saxon scribes, beginning after the creation of the Carolingian minuscule by Alcuin of York and the scribes' adoption of it. Spacing would become standard in Renaissance Italy and France, and then Byzantium by the end of the 16th century; then entering into the Slavic languages in Cyrillic in the 17th century, and only in modern times entering modern Sanskrit. [2] [ dubious ]

CJK languages do not use spaces when dealing with text containing mostly Chinese characters and kana. In Japanese, spaces may occasionally be used to separate people's family names from given names, to denote omitted particles (especially the topic particle wa), and for certain literary or artistic effects. Modern Korean, however, has spaces as an essential part of its writing system (because of Western influence), given the phonetic nature of the hangul script that requires word dividers to avoid ambiguity, as opposed to Chinese characters which are mostly very distinguishable from each other. In Korean, spaces are used to separate chunks of nouns, nouns and particles, adjectives, and verbs; for certain compounds or phrases, spaces may be used or not, for example the phrase for "Republic of Korea" is usually spelled without spaces as 대한민국 rather than with a space as 대한 민국.

Runic texts use either an interpunct-like or a colon-like punctuation mark to separate words. There are two Unicode characters dedicated for this: U+16EBRUNIC SINGLE PUNCTUATION and U+16ECRUNIC MULTIPLE PUNCTUATION.

Between sentences

Languages with a Latin-derived alphabet have used various methods of sentence spacing since the advent of movable type in the 15th century.

There has been some controversy regarding the proper amount of sentence spacing in typeset material. The Elements of Typographic Style states that only a single word space is required for sentence spacing. [21] Psychological studies suggest "readers benefit from having two spaces after periods." [22]

Unit symbols and numbers

The International System of Units (SI) prescribes inserting a space between a number and a unit of measurement (the space being regarded as an implied multiplication sign) but never between a prefix and a base unit; a space (or a multiplication dot) should also be used between units in compound units. [23]

5.0 cm, not5.0cm or 5.0 c m or 5.0 cms
45 kg, not45kg or 45 k g or 45 kgs
32 °C, not32°C or 32° C
20 kN m or 20 kN⋅m, not20 kNm or 20 k Nm
π/2 rad, notπ/2rad or π / 2 rad
50 %, not50% or 50 percent (Note: % is not an SI unit, and many style guides do not follow this recommendation; note that 50% is used as adjective, e.g. to express concentration as in 50% acetic acid.)

The only exception to this rule is the traditional symbolic notation of angles: degree (e.g., 30°), minute of arc (e.g., 22′), and second of arc (e.g., 8″).

The SI also prescribes the use of a space [24] (often typographically a thin space) as a thousands separator where required. Both the point and the comma are reserved as decimal markers.

1 000 000 000 000 (thin space) or 1000000 not 1,000,000 or 1.000.000
1 000 000 000 000 (regular space which is significantly wider)

Sometimes a narrow non-breaking space or non-breaking space, respectively, is recommended (as in, for example, IEEE Standards [25] and IEC standards [26] ) to avoid the separation of units and values or parts of compounds units, due to automatic line wrap and word wrap.

Encoding

Unicode defines many variants of a single whitespace character, with various properties; the more commonly encountered variations include:

In URLs, spaces are percent encoded with its ASCII/UTF-8 representation %20.

Types of spaces

See also

Related Research Articles

Punctuation marks are marks indicating how a piece of written text should be read and, consequently, understood. The oldest known examples of punctuation marks were found in the Mesha Stele from the 9th century BC, consisting of points between the words and horizontal strokes between sections. The alphabet-based writing began with no spaces, no capitalization, no vowels, and with only a few punctuation marks, as it was mostly aimed at recording business transactions. Only with the Greek playwrights did the ends of sentences begin to be marked to help actors know when to make a pause during performances. Punctuation includes space between words and the other, historically or currently used, signs.

The colon, :, is a punctuation mark consisting of two equally sized dots aligned vertically. A colon often precedes an explanation, a list, or a quoted sentence. It is also used between hours and minutes in time, between certain elements in medical journal citations, between chapter and verse in Bible citations, and, in the US, for salutations in business letters and other formal letter writing.

In English writing, quotation marks or inverted commas, also known informally as quotes, talking marks, speech marks, quote marks, quotemarks or speechmarks, are punctuation marks placed on either side of a word or phrase in order to identify it as a quotation, direct speech or a literal title or name. Quotation marks may be used to indicate that the meaning of the word or phrase they surround should be taken to be different from that typically associated with it, and are often used in this way to express irony. They are also sometimes used to emphasise a word or phrase, although this is usually considered incorrect.

In punctuation, a word divider is a form of glyph which separates written words. In languages which use the Latin, Cyrillic, and Arabic alphabets, as well as other scripts of Europe and West Asia, the word divider is a blank space, or whitespace. This convention is spreading, along with other aspects of European punctuation, to Asia and Africa, where words are usually written without word separation.

An interpunct⟨·⟩, also known as an interpoint, middle dot, middot, centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in Classical Latin. It appears in a variety of uses in some modern languages and is present in Unicode as U+00B7·MIDDLE DOT.

<span class="mw-page-title-main">Emphasis (typography)</span> Typographical distinction

In typography, emphasis is the strengthening of words in a text with a font in a different style from the rest of the text, to highlight them. It is the equivalent of prosody stress in speech.

In word processing and digital typesetting, a non-breaking space, also called NBSP, required space, hard space, or fixed space, is a space character that prevents an automatic line break at its position. In some formats, including HTML, it also prevents consecutive whitespace characters from collapsing into a single space. Non-breaking space characters with other widths also exist.

Letter spacing, character spacing or tracking is an optically consistent typographical adjustment to the space between letters to change the visual density of a line or block of text. Letter spacing is distinct from kerning, which adjusts the spacing of particular pairs of adjacent characters such as "7." which would appear to be badly spaced if left unadjusted, and leading, the spacing between lines.

Line breaking, also known as word wrapping, is breaking a section of text into lines so that it will fit into the available width of a page, window or other display area. In text display, line wrap is continuing on a new line when a line is full, so that each line fits into the viewable window, allowing text to be read from top to bottom without any horizontal scrolling. Word wrap is the additional feature of most text editors, word processors, and web browsers, of breaking lines between words rather than within words, where possible. Word wrap makes it unnecessary to hard-code newline delimiters within paragraphs, and allows the display of text to adapt flexibly and dynamically to displays of varying sizes.

In typesetting and page layout, alignment or range is the setting of text flow or image placement relative to a page, column (measure), table cell, or tab.

A whitespace character is a character data element that represents white space when text is rendered for display by a computer.

Sentence spacing concerns how spaces are inserted between sentences in typeset text and is a matter of typographical convention. Since the introduction of movable-type printing in Europe, various sentence spacing conventions have been used in languages with a Latin alphabet. These include a normal word space, a single enlarged space, and two full spaces.

<span class="mw-page-title-main">River (typography)</span> Coincidental alignment of spaces in typesetting

In typography, rivers are gaps in typesetting which appear to run through a paragraph of text due to a coincidental alignment of spaces. Rivers can occur regardless of the spacing settings, but are most noticeable with wide inter-word spaces caused by full text justification or monospaced fonts. Rivers are less noticeable with proportional fonts, due to narrow spacing. Another cause of rivers is the close repetition of a long word or similar words at regular intervals, such as "maximization" with "minimization" or "optimization".

Writing systems that use Chinese characters also include various punctuation marks, derived from both Chinese and Western sources. Historically, judou annotations were often used to indicate the boundaries of sentences and clauses in text. The use of punctuation in written Chinese only became mandatory during the 20th century, due to Western influence. Unlike modern punctuation, judou marks were added by scholars for pedagogical purposes and were not viewed as integral to the text. Texts were therefore generally transmitted without judou. In most cases, this practice did not interfere with the interpretation of a text, although it occasionally resulted in ambiguity.

The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the en dash, generally longer than the hyphen but shorter than the minus sign; the em dash, longer than either the en dash or the minus sign; and the horizontal bar, whose length varies across typefaces but tends to be between those of the en and em dashes.

The full stop, period, or full point. is a punctuation mark used for several purposes, most often to mark the end of a declarative sentence.

<span class="mw-page-title-main">History of sentence spacing</span> Evolution of sentence spacing conventions from the introduction of movable type in Europe

The history of sentence spacing is the evolution of sentence spacing conventions from the introduction of movable type in Europe by Johannes Gutenberg to the present day.

Sentence spacing guidance is provided in many language and style guides. The majority of style guides that use a Latin-derived alphabet as a language base now prescribe or recommend the use of a single space after the concluding punctuation of a sentence.

Sentence spacing in digital media concerns the horizontal width of the space between sentences in computer- and web-based media. Digital media allow sentence spacing variations not possible with the typewriter. Most digital fonts permit the use of a variable space or a no-break space. Some modern font specifications, such as OpenType, have the ability to automatically add or reduce space after punctuation, and users may be able to choose sentence spacing variations.

The compound point is an obsolete typographical construction. Keith Houston reported that this form of punctuation doubling, which involved the comma dash (,—), the semicolon dash (;—), the colon dash, or "dog's bollocks" (:—), and less often the stop-dash (.—) arose in the seventeenth century, citing examples from as early as 1622. More traditionally, these paired forms of punctuation seem most often to have been called (generically) compound points and (specifically) semicolon dash, comma dash, colon dash, and point dash.

References

  1. Saenger 2000, p. 10: "the Semitic languages (Hebrew, Aramaic, Arabic, and Syriac), when written without vowels, were virtually always written with word separation in antiquity and continued to be so transcribed into modern times"
  2. Saenger, Paul. Space Between Words: The Origins of Silent Reading. Stanford University Press, 1997, 9–14.
  3. Einsohn, Amy (2006). "Punctuation, Eyeballing every mark". The Copyeditor's Handbook: A Guide for Book Publishing and Corporate Communications (2nd ed.). Berkeley, Los Angeles, London: University of California Press. p. 113. ISBN   9780520246881 . Retrieved 2010-04-25. If you are working on documents that will be printed without any intervention from a compositor (e.g., documents produced on the office laser printer), you will have to carefully scrutinize every piece of punctuation to be sure that the document contains the correct character (see table 5). You should also delete any extra wordspacing before and after punctuation marks. The conventions are: One space follows sentence-ending punctuation mark (period, question mark, or exclamation point). One space follows comma, colon, or semicolon ...
  4. Thomas A. Fine. "How many spaces at the end of a sentence? One or two?". Archived from the original on 2014-02-20. Retrieved 2013-08-08.
  5. Farhad Manjoo (2011-01-13). "Space Invaders: Why you should never, ever use two spaces after a period". Slate . Retrieved 2011-03-29.
  6. Heraclitus (1 November 2011). "Why two spaces after a period isn't wrong". Archived from the original on 26 July 2013. Retrieved 8 August 2013.
  7. Felici, James (2003). The Complete Manual of Typography: A Guide to Setting Perfect Type. Berkeley, CA: Peachpit Press. p. 80. ISBN   0321127307.;
  8. David Spencer (24 May 2011). "The Curious Misconception Surrounding Sentence Spacing". Type Desk. Matador. Archived from the original on 10 June 2011. Retrieved 27 May 2011.
  9. Bringhurst, Robert (2004). The Elements of Typographic Style (3 ed.). Washington and Vancouver: Hartley & Marks. p. 28. ISBN   0881792063. 2.1.4 Use a single word space between sentences. In the nineteenth century, which was a dark and inflationary age in typography and type design, many compositors were encouraged to stuff extra space between sentences. Generations of twentieth century typists were then taught to do the same, by hitting the spacebar twice after every period [full stop]. Your typing as well as your typesetting will benefit from unlearning this quaint Victorian habit. As a general rule, no more than a single space is required after a period, colon, or any other mark of punctuation
  10. Schriver, Karen (1997). Dynamics in Document Design: Creating Text for Readers (1 ed.). New York: Wiley. p. 502. ISBN   9780471306368. Use one space (not two) after these punctuation marks [sc. period, question mark, exclamation point, or colon], as the practice of using two spaces is just another holdover from using a typewriter.
  11. Strauss, Jane (2007). "Spacing with Punctuation". The Blue Book of Grammar and Punctuation: An Easy-to-Use Guide with Clear Rules, Real-World Examples, and Reproducible Quizzes (10 ed.). Jossey-Bass. p. 176. ISBN   9780470222683. Archived from the original on 2010-04-28. Retrieved 2010-04-25. Rule 1. With a computer, use only one space following periods, commas, semicolons, colons, exclamation points, question marks, and quotation marks. The space needed after these punctuation marks is proportioned automatically. With some typewriters and word processors, follow ending punctuation with two spaces when using a fixed-pitch font.
  12. "2.49 Leading and spacing". The GPO Style Manual (30 ed.). Washington: The U.S. Government Printing Office. 2008. p. 469. ISBN   9780160818127. Archived from the original (PDF) on 2008-08-31. Retrieved 2010-04-25. A single justified word space will be used between sentences. This applies to all types of composition.
  13. "FAQ: How many spaces should I leave after a period or other concluding mark of punctuation?". MLA Handbook for Writers of Research Papers (7 ed.). Modern Language Association. 2009. p. 292. ISBN   978-0-87352-297-7. Archived from the original on 2011-02-28. Retrieved 2010-04-25. Publications in the United States today usually have the same spacing after a punctuation mark as between words on the same line. Since word processors make available the same fonts used by typesetters for printed works, many writers, influenced by the look of typeset publications, now leave only one space after a concluding punctuation mark. In addition, most publishers' guidelines for preparing electronic manuscripts ask authors to type only the spaces that are to appear in print.
  14. "FAQ: How many spaces should I leave after a period or other concluding mark of punctuation?". The Chicago Manual of Style (7 ed.). University of Chicago Press. 2009. p. 292. ISBN   9780873522977. Archived from the original on 2011-02-28. Retrieved 2010-04-25. Publications in the United States today usually have the same spacing after a punctuation mark as between words on the same line. Since word processors make available the same fonts used by typesetters for printed works, many writers, influenced by the look of typeset publications, now leave only one space after a concluding punctuation mark. In addition, most publishers' guidelines for preparing electronic manuscripts ask authors to type only the spaces that are to appear in print.
  15. "The Chicago Manual of Style Online (Q&A: One Space or Two?)". University of Chicago Press. 2003. Archived from the original on 2009-01-05. Retrieved 2010-04-25. The view at CMOS is that there is no reason for two spaces after a period in published work. Some people, however—my colleagues included—prefer it, relegating this preference to their personal correspondence and notes. I've noticed in old American books printed in the few decades before and after the turn of the twentieth century (ca. 1870–1930 at least) that there seemed to be a trend in publishing to use extra space (sometimes quite a bit of it) after periods. And many people were taught to use that extra space in typing class (I was). But introducing two spaces after the period causes problems: (1) it is inefficient, requiring an extra keystroke for every sentence; (2) even if a program is set to automatically put an extra space after a period, such automation is never foolproof; (3) there is no proof that an extra space actually improves readability—as your comment suggests, it's probably just a matter of familiarity (Who knows? perhaps it's actually more efficient to read with less regard for sentences as individual units of thought—many centuries ago, for example in ancient Greece, there were no spaces even between words, and no punctuation); (4) two spaces are harder to control for than one in electronic documents (I find that the earmark of a document that imposes a two-space rule is a smattering of instances of both three spaces and one space after a period, and two spaces in the middle of sentences); and (5) two spaces can cause problems with line breaks in certain programs. So, in our efficient, modern world, I think there is no room for two spaces after a period. In the opinion of this particular copyeditor, this is a good thing.
  16. "Chapter 5. Manuscript Preparation and Sample Papers to be Submitted for Publication". Publication Manual of the American Psychological Association (aka APA Style) (5 ed.). Washington: American Psychological Association. 2001. pp.  439. ISBN   9781557987907 . Retrieved 2010-04-25. 5.11 Spacing and Punctuation: Space once after all punctuation as follows: after commas, colons, and semicolons; after punctuation marks at the end of sentences; after periods that separate parts of a reference citation; and after the periods of the initials in personal names (e.g., J. R. Zhang). Exception: Do not use space after internal periods in abbreviations (e.g., a.m, i.e., U.S.)
  17. Style Manual: for Authors, Editors and Printers (aka AGPS Style) (6 ed.). Stafford, Australia: Wiley Australia, The Commonwealth Government of Australia Printing Office. 2002. p. 117. ISBN   978-0-7016-3647-0. Archived from the original on 2015-03-26. Retrieved 2010-04-25. In typewritten (as distinct from typeset) material, it was customary to place two spaces after a colon, semicolon, full stop or other sentence closing punctuation. Programs for word processing and desktop publishing offer more sophisticated, variable spacing, so this practice of double spacing is now avoided because it can create distracting gaps on a page.
  18. Mergenthaler Linotype Company (1940). Linotype Keyboard Operation: Methods of Study and Procedures for Setting Various Kinds of Composition on the Linotype. Mergenthaler Linotype Company. cited in Mark Simonson (5 March 2004). "Double-spacing after Periods". Typophile. Archived from the original on 20 January 2010. Retrieved 5 April 2010.
  19. Eijkhout, Victor (2008), TeX by Topic, A TeXnician's Reference (PDF), Lulu, pp. 185–188[ permanent dead link ]
  20. Felici, James (2003). The Complete Manual of Typography: A Guide to Setting Perfect Type. Berkeley, CA: Peachpit Press. p. 80. ISBN   0-321-12730-7.; Fogarty, Mignon (2008). Grammar Girl's Quick and Dirty Tips for Better Writing (Quick and Dirty Tips). New York: Holt Paperbacks. p.  85. ISBN   978-0-8050-8831-1.; Straus, Jane (2009). The Blue Book of Grammar and Punctuation: An Easy-to-Use Guide with Clear Rules, Real-World Examples, and Reproducible Quizzes (10th ed.). San Francisco, CA: Jossey-Bass. p. 52. ISBN   978-0-470-22268-3.
  21. Bringhurst, Robert (2004). The Elements of Typographic Style (3.0 ed.). Washington and Vancouver: Hartley & Marks. pp. 28, 30. ISBN   0-88179-206-3.
  22. Hamblin, James (11 May 2018). "The Scientific Case for Two Spaces After a Period". The Atlantic . Archived from the original on 14 May 2018. Retrieved 14 May 2018.
  23. International Bureau of Weights and Measures (2019). The International System of Units (SI) (9th ed.). p. 149. Archived from the original on 2023-01-13. Retrieved 2023-01-13..
  24. The International System of Units (PDF) (9th ed.). International Bureau of Weights and Measures. 2019. p. 150. ISBN   978-92-822-2272-0. Archived from the original on 2021-10-18. Retrieved 2019-11-13.
  25. "SCC 14 Conventions for Metrication of IEEE Standards" (PDF). IEEE. 2017-10-31. Archived (PDF) from the original on 2019-07-23. Retrieved 2019-11-12.
  26. "Writing and formatting | IEC". International Electrotechnical Commission. 2022-07-18. Archived from the original on 2022-07-18. Retrieved 2022-07-18.

Further reading