Strikethrough

Last updated
An example of strikethrough Striked out text.png
An example of strikethrough

Strikethrough, or strikeout, is a typographical presentation of words with a horizontal line through their middle. Less common alternative forms of strikethrough are an X or a forward slash typed over each letter. [1]

Contents

In medieval manuscripts, where strikethrough appears as a red line over the otherwise black text, the purpose is to highlight the text, giving it emphasis. In modern texts, the purpose is the opposite; it indicates removed text. Contrary to censored or redacted texts, the words remain readable.

Appearance

Strikethrough primarily appears as a horizontal line across the middle of words. This appearance is the easiest to achieve by hand, as it only requires one continuous stroke with a pen, pencil, or other writing instruments. Less common forms of strikethrough include drawing an X (cross) or / (solidus) over each letter of the word. To strike long words or phrases, this method is cumbersome to produce by hand, but may be the preferred method for single letters, where the horizontal line does not produce the necessary visibility, and could be missed.

On personal computers, double strikethrough is an option in some word processors, spreadsheets, presentation software, and graphics software, e.g., in Microsoft Office, Collabora Online and LibreOffice. In Japan, double strikethrough is conventionally used because in complex kanji, a single strikethrough may be missed or confused with a stroke in the character. Otherwise, there is no generally agreed meaning of double strikethrough.

Uses

In modern prose, however, strikethrough is primarily used to mark text as a mistake or to be removed. Word processors capable of tracking changes use strikethrough to on deleted words.

Historically, however, strikethrough was not even the primary way to mark errors. Errors were more commonly marked by placing dots under letters to be ignored (Latin : punctum delens). [2] In medieval manuscripts such as the Domesday Book, "strikethrough" of text with red ink often functions as highlighting similar to modern underline. [3]

Computer representations

Word Processors

Wordstar supported the "strikeout" decoration since version 3.0 in 1982, although the functionality may have been present in earlier versions. [4] Wordstar was launched in 1978. It dominated the personal computer market as the most popular word processing program until 1985, when WordPerfect gained dominance. All word processors with functionality beyond basic editing include strikeout, though the prefer the term "strikethrough".

HTML and CSS

Early versions of the Hypertext Markup Language (HTML) specify the < strike > or < s > tags for rendering text with strikethrough. These tags, however, are purely decorative. In 1999, the HTML standard version 4.01 deprecated them in favor of the < del > tag, a semantic element for marking deleted text. Web browser and other user agents often render the contents of this tag with strikethrough. [5] [6] In the HTML5 draft, the <s> is redefined as a semantic tag that marks its text that as no longer correct. The <del> remains unchanged and still specifies deleted text. [7] [8]

Since HTML5 has done away with purely decorative strikethrough tags, the Cascading Style Sheets (CSS) language is in charge of decoration and formatting. A CSS user agent renders the text with strikethrough when the text-decoration CSS property has a line-through value. For example:

<spanstyle="text-decoration: line-through;">ABCD efghi</span>

...renders as:

ABCD efghi

The <s> and <del> tags already carry the aforementioned strikethrough definition, but also attach their semantic properties ("not longer correct" and "deleted") to the text.

Other markup symbols

Unicode

Combining characters

In plain text scenarios where markup cannot be used, Unicode offers a number of combining characters that achieve similar effects.

The "combining long stroke overlay" (U+0336) results in a stroke across text (may or may not be unbroken depending on the typeface used):

A̶B̶C̶D̶ ̶e̶f̶g̶h̶i̶

while the "combining short stroke overlay" (U+0335) results in individually struck out characters:

A̵B̵C̵D̵ ̵e̵f̵g̵h̵i̵

Similarly, the "combining short solidus overlay" (U+0337) results in diagonally struck out letters:

A̷B̷C̷D̷ ̷e̷f̷g̷h̷i̷

as does the "combining long solidus overlay" (U+0338), which produces longer diagonal strokes:

A̸B̸C̸D̸ ̸e̸f̸g̸h̸i̸

Specific struck-through characters

A number of characters that have the visual appearance of struck-through characters exist in Unicode, including:

These usually have specific functions (for example, in the Latin Extended-A character set) or representations and are not intended for general use. However, they are not precomposed characters and have neither canonical nor compatibility decompositions. This issue has created security considerations since "precomposed" characters like U+019F and sequences like U+004F U+0335 or U+004F U+0336 often cause visual confusion (compare ⟨Ɵ⟩, ⟨O̵⟩ and ⟨O̶⟩). Unicode has acknowledged this issue and has proposed a standardized method for counteraction. [11] [12]

For slashed letters in an orthography, unitary letters are provided by Unicode. The diacritics are used in generic applications, such as math operators which systematically use the solidus overlay to indicate negation.

Research

Since at least 2014, researchers in the area of optical character recognition have attempted to solve the problem of recognizing struck-out text in handwritten documents. [13] [14]

See also

Notes

    References

    1. "Strikethrough". eyemagazine. Retrieved 9 September 2024.
    2. Hackett, M.B. (1970). The original statutes of Cambridge University : the text and its history (Digitally printed version, re-issue. ed.). Cambridge: Cambridge Eng. University Press. p. 185. ISBN   9780521070768.
    3. See, e.g., this image from the British National Archives' Focus on the Domesday Book.
    4. "Exploring the WordStar file format". TechnicallyWeWrite.com. Retrieved 11 September 2024.
    5. 15.2.1 Font style elements: the TT, I, B, BIG, SMALL, STRIKE, S, and U elements, HTML 4.01 Specification: Alignment, font styles, and horizontal rules, W3C. 24 December 1999.
    6. 9.4 Marking document changes: The INS and DEL elements HTML 4.01 Specification: Text – Paragraphs, Lines, and Phrases, W3C. 24 December 1999.
    7. 4.5.5 The s element Archived 2015-08-01 at the Wayback Machine HTML5, W3C Last Call Working Draft. 17 June 2014.
    8. 4.6.2 The del element HTML5, W3C Last Call Working Draft 17 June 2014
    9. "GitHub Flavored Markdown".
    10. "Add formatting to your Google Chat messages - Computer - Google Chat Help". support.google.com. Retrieved 1 March 2024.
    11. The Unicode Consortium, The Unicode Standard, Chapter 2, Page 44, Non-decomposition of Overlaid Diacritics
    12. The Unicode Consortium, Unicode Technical Standard #39: Unicode Security Mechanisms, chapter Confusable Detection
    13. Adak, Chandranath; Chaudhuri, Bidyut B. (2014). "An Approach of Strike-Through Text Identification from Handwritten Documents". 2014 14th International Conference on Frontiers in Handwriting Recognition. pp. 643–648. doi:10.1109/ICFHR.2014.113. ISBN   978-1-4799-4334-0. S2CID   5364504.
    14. Chaudhuri, Bidyut B.; Adak, Chandranath (2017). "An approach for detecting and cleaning of struck-out handwritten text". Pattern Recognition. 61: 282–294. Bibcode:2017PatRe..61..282C. doi:10.1016/j.patcog.2016.07.032.