Strikethrough

Last updated
An example of strikethrough. Striked out text.png
An example of strikethrough.

Strikethrough is a typographical presentation of words with a horizontal line through their center, resulting in text like this. Contrary to censored or sanitized (redacted) texts, the words remain readable. This presentation signifies one of two meanings. In ink-written, typewritten, or other non-erasable text, the words are a mistake and not meant for inclusion. When used on a computer screen, however, it indicates deleted information, as popularized by Microsoft Word's revision and track changes features. [1]

Contents

Uses

Marking errors

Strikethrough is primarily used to mark text that is mistaken or to be removed. [lower-alpha 1]

Highlighting

In medieval manuscripts such as the Domesday Book, "strikethrough" of text with red ink often functions as highlighting similar to modern underline. [3]

Computer representations

HTML

The HTML presentational inline element for strikethrough is <strike> or <s>. This element was, however, deprecated in the 1999 HTML 4.01 standard, and replaced by the <del> tag, a semantic element representing deleted text, which user agents (typically web browsers) often render as a strikethrough. [4] [5]

In the HTML5 draft, there is no presentational element for strikethrough. However, there are two related semantic elements. Firstly, <s>, that is strikethrough in HTML 3 and 4, is redefined to mark text that is no longer correct, and secondly, <del> marks text that has been deleted, as it does in HTML 4.01. [6] [7]

Other markup symbols

  • BB Code is a markup language used on many web forums. The BB Code for strikethrough is [s] or [strike]. To end strikethrough when using [s], type [/s] (ex. [s]Birb[/s]).
  • GitHub flavored Markdown uses double tilde ~~ to wrap around text for strikethrough. [8]
  • By surrounding text in Google Chat with the ~ (tilde) character, text will be striked out. [9]

CSS

In cascading style sheets (CSS) strikethrough is controlled using the text-decoration property, and specified by the line-through value of that property. For example, <span style="text-decoration: line-through;">ABCD efghi</span> renders like this: ABCD efghi

To maintain backwards compatibility, the following can be added to the CSS: strike {text-decoration:line-through;} The example above could then be written like this: <strike>ABCD efghi</strike>, which is compatible with HTML 4. In HTML 5, this: <del>ABCD efghi</del> also produces the same result, although the use of CSS is preferred[ citation needed ] and the del tag carries a semantic interpretation not present in the purely stylistic s and strike tags.

Unicode

Combining characters

In plain text scenarios where markup cannot be used, Unicode offers a number of combining characters that achieve similar effects.

The "combining long stroke overlay" (U+0336) results in a stroke across the text (may or may not be unbroken depending on the typeface used):

A̶B̶C̶D̶ ̶e̶f̶g̶h̶i̶

while the "combining short stroke overlay" (U+0335) results in individually struck out characters:

A̵B̵C̵D̵ ̵e̵f̵g̵h̵i̵

Similarly, the "combining short solidus overlay" (U+0337) results in diagonally struck out letters:

A̷B̷C̷D̷ ̷e̷f̷g̷h̷i̷

as does the "combining long solidus overlay" (U+0338), which produces longer diagonal strokes:

A̸B̸C̸D̸ ̸e̸f̸g̸h̸i̸

Specific struck-through characters

A number of characters that have the visual appearance of struck-through characters exist in Unicode, including ⟨ƀ⟩, ⟨Đ⟩, ⟨Ð⟩, ⟨Ǥ⟩, ⟨Ħ⟩, ⟨Ɨ⟩, ⟨Ɉ⟩, ⟨Ł⟩, ⟨Ɵ⟩, ⟨⟩, ⟨Ŧ⟩, ⟨Ʉ⟩, ⟨Ƶ⟩, ⟨ƻ⟩, ⟨ʡ⟩, ⟨ʢ⟩, ⟨Ғ⟩, ⟨Ҟ⟩, ⟨Ұ⟩, and ⟨Ҍ⟩. These usually have specific functions (for example, in the Latin Extended-A character set) or representations and are not intended for general use. However, they are not precomposed characters and have neither canonical nor compatibility decompositions. This issue has created security considerations since "precomposed" characters like U+019F and sequences like U+004F U+0335 or U+004F U+0336 often cause visual confusion (compare ⟨Ɵ⟩, ⟨O̵⟩ and ⟨O̶⟩). Unicode has acknowledged this issue and has proposed a standardized method for counteraction. [10] [11]

For slashed letters in an orthography, unitary letters are provided by Unicode. The diacritics are used in generic applications, such as math operators which systematically use the solidus overlay to indicate negation.

Double/multiple strikethrough

Double strikethrough is an option in certain word processing applications such as Microsoft Word. There is no generally agreed meaning of double strikethrough, but it may be used as a second level of single strikethrough.

In Japan, double strikethrough is conventionally used (rather than single strikethrough) when striking out text. This is for added clarity, as in complex kanji a single strikethrough may be missed or confused with a stroke in the character.

Double, triple or multiple strikethrough may also (especially formerly) be used as a way of emphasising words.

Research

Since at least 2014, researchers in the area of optical character recognition have attempted to solve the problem of recognizing struck-out text in handwritten documents. [12] [13]

See also

Notes

  1. Historically, strikethrough was not the only or even primary way to mark errors. Errors were more commonly marked by placing dots under letters to be ignored (Latin: punctum delens). [2]

Related Research Articles

<span class="mw-page-title-main">HTML</span> HyperText Markup Language

HyperText Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScript.

Mathematical Markup Language (MathML) is a mathematical markup language, an application of XML for describing mathematical notations and capturing both its structure and content, and is one of a number of mathematical markup languages. Its aim is to natively integrate mathematical formulae into World Wide Web pages and other documents. It is part of HTML5 and standardised by ISO/IEC since 2015.

The slash is the oblique slanting line punctuation mark /. It is also known as a stroke, a solidus, a forward slash and several other historical or technical names. Once used to mark periods and commas, the slash is now used to represent division and fractions, exclusive 'or' and inclusive 'or', and as a date separator.

An HTML element is a type of HTML document component, one of several types of HTML nodes. The first used version of HTML was written by Tim Berners-Lee in 1993 and there have since been many versions of HTML. The current de facto standard is governed by the industry group WHATWG and is known as the HTML Living Standard.

An underscore or underline is a line drawn under a segment of text. In proofreading, underscoring is a convention that says "set this text in italic type", traditionally used on manuscript or typescript as an instruction to the printer. Its use to add emphasis in modern finished documents is generally avoided.

In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks.

Web standards are the formal, non-proprietary standards and other technical specifications that define and describe aspects of the World Wide Web. In recent years, the term has been more frequently associated with the trend of endorsing a set of standardized best practices for building web sites, and a philosophy of web design and development that includes those methods.

In web development, "tag soup" is a pejorative for HTML written for a web page that is syntactically or structurally incorrect. Web browsers have historically treated structural or syntax errors in HTML leniently, so there has been little pressure for web developers to follow published standards. Therefore there is a need for all browser implementations to provide mechanisms to cope with the appearance of "tag soup", accepting and correcting for invalid syntax and structure where possible.

<span class="mw-page-title-main">W3C Markup Validation Service</span> Validator service by the World Wide Web Consortium

The Markup Validation Service is a validator by the World Wide Web Consortium (W3C) that allows Internet users to check pre-HTML5 HTML and XHTML documents for well-formed markup against a document type definition. Markup validation is an important step towards ensuring the technical quality of web pages. However, it is not a complete measure of web standards conformance. Though W3C validation is important for browser compatibility and site usability, it has not been confirmed what effect it has on search engine optimization.

<span class="mw-page-title-main">Slashed zero</span> Glyph variant of numeral 0 (zero) with slash

The dotted or slashed zero 0̷ is a representation of the Arabic digit "0" (zero) with a slash or a dot through it. This variant zero glyph is often used to distinguish the digit "zero" ("0") from the Latin script letter "O" anywhere that the distinction needs emphasis, particularly in encoding systems, scientific and engineering applications, computer programming, and telecommunications. It thus helps to differentiate characters that would otherwise be homoglyphs. It was commonly used during the punch card era, when programs were typically written out by hand, to avoid ambiguity when the character was later typed on a card punch.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

In HTML, the standard markup language for documents designed to be displayed in a web browser, <div> and <span> tags are elements used to define parts of a document, so that they are identifiable when a unique classification is necessary. Where other HTML elements such as <p> (paragraph), <em> (emphasis), and so on, accurately represent the semantics of the content, the additional use of <span> and <div> tags leads to better accessibility for readers and easier maintainability for authors. Where no existing HTML element is applicable, <span> and <div> can valuably represent parts of a document so that HTML attributes such as class, id, lang, or dir can be applied.

In HTML and XHTML, the blockquote element defines "a section [within a document] that is quoted from another source". The syntax is

<span class="mw-page-title-main">HTML5</span> Fifth and previous version of HyperText Markup Language

HTML5 is a markup language used for structuring and presenting hypertext documents on the World Wide Web. It was the fifth and final major HTML version that is now a retired World Wide Web Consortium (W3C) recommendation. The current specification is known as the HTML Living Standard. It is maintained by the Web Hypertext Application Technology Working Group (WHATWG), a consortium of the major browser vendors.

Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character sets, which often included similar or identical characters.

In Unicode and the UCS, a compatibility character is a character that is encoded solely to maintain round-trip convertibility with other, often older, standards. As the Unicode Glossary says:

A character that would not have been encoded except for compatibility and round-trip convertibility with other standards

<span class="mw-page-title-main">Web typography</span> Publishing considerations for the Web

Web typography, like typography generally, is the design of pages – their layout and typeface choices. Unlike traditional print-based typography, pages intended for display on the World Wide Web have additional technical challenges and – given its ability to change the presentation dynamically – additional opportunities. Early web page designs were very simple due to technology limitations; modern designs use Cascading Style Sheets (CSS), JavaScript and other techniques to deliver the typographer's and the client's vision.

Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages which mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated.

The Web platform is a collection of technologies developed as open standards by the World Wide Web Consortium and other standardization bodies such as the Web Hypertext Application Technology Working Group, the Unicode Consortium, the Internet Engineering Task Force, and Ecma International. It is the umbrella term introduced by the World Wide Web Consortium, and in 2011 it was defined as "a platform for innovation, consolidation and cost efficiencies" by W3C CEO Jeff Jaffe. Being built on The evergreen Web has allowed for the addition of new capabilities while addressing security and privacy risks. Additionally, developers are enabled to build interoperable content on a cohesive platform.

References

  1. "Strikethrough and the power of defaults". reproof. Retrieved 2 February 2022.
  2. Hackett, M.B. (1970). The original statutes of Cambridge University : the text and its history (Digitally printed version, re-issue. ed.). Cambridge: Cambridge Eng. University Press. p. 185. ISBN   9780521070768.
  3. See, e.g., this image from the British National Archives' Focus on the Domesday Book.
  4. 15.2.1 Font style elements: the TT, I, B, BIG, SMALL, STRIKE, S, and U elements, HTML 4.01 Specification: Alignment, font styles, and horizontal rules, W3C. 24 December 1999.
  5. 9.4 Marking document changes: The INS and DEL elements HTML 4.01 Specification: Text – Paragraphs, Lines, and Phrases, W3C. 24 December 1999.
  6. 4.5.5 The s element Archived 2015-08-01 at the Wayback Machine HTML5, W3C Last Call Working Draft. 17 June 2014.
  7. 4.6.2 The del element HTML5, W3C Last Call Working Draft 17 June 2014
  8. "GitHub Flavored Markdown".
  9. "Add formatting to your Google Chat messages - Computer - Google Chat Help". support.google.com. Retrieved 1 March 2024.
  10. The Unicode Consortium, The Unicode Standard, Chapter 2, Page 44, Non-decomposition of Overlaid Diacritics
  11. The Unicode Consortium, Unicode Technical Standard #39: Unicode Security Mechanisms, chapter Confusable Detection
  12. Adak, Chandranath; Chaudhuri, Bidyut B. (2014). "An Approach of Strike-Through Text Identification from Handwritten Documents". 2014 14th International Conference on Frontiers in Handwriting Recognition. pp. 643–648. doi:10.1109/ICFHR.2014.113. ISBN   978-1-4799-4334-0. S2CID   5364504.
  13. Chaudhuri, Bidyut B.; Adak, Chandranath (2017). "An approach for detecting and cleaning of struck-out handwritten text". Pattern Recognition. 61: 282–294. Bibcode:2017PatRe..61..282C. doi:10.1016/j.patcog.2016.07.032.