Syriac Abbreviation Mark

Last updated

The Syriac Abbreviation Mark is a Unicode Control character (U+070F) that forms part of the Syriac script block. In Syriac, words are sometimes written in an abbreviated form, omitting some of the last letters. In such cases, a special overline is drawn over some of the final letters of the abbreviated word. Traditionally, this overline has a point at each edge and a point in the middle, but sometimes a plain overline was used. Another use of this overline is to mark numbers: in Syriac numbers are written using numerical values which are assigned to letters (similarly to the Gematria system in Hebrew). The sequence of letters used to write the number are also marked by the overline.

To implement this special overline in Unicode, a special control character was conceived: the Syriac Abbreviation Mark (or SAM), described in section 9.3 of the Unicode Standard. [1] It is inserted in the stream of characters just before the first letter which should be marked with the overline (visually, to the right of it, since Syriac is a right-to-left script). OpenType also has stch “feature tag” to pave the way of proper rendering. The rendering engine should put an overline on all subsequent Syriac script letters, until the first character which is not a Syriac letter or diacritic. However, several computer environments do not render this character correctly, but show rather a substitution glyph where it appears in the character stream.

Here is an example containing the SAM, just after the Syriac letter Alaph (first letter to the right): ܐ܏ܒܓܕ

The image below shows the correct way this should be rendered: SyriacAbbreviation.jpg

See also

Related Research Articles

The ancient Aramaic alphabet was adapted by Arameans from the Phoenician alphabet and became a distinct script by the 8th century BC. It was used to write the Aramaic languages spoken by ancient Aramean pre-Christian tribes throughout the Fertile Crescent. It was also adopted by other peoples as their own alphabet when empires and their subjects underwent linguistic Aramaization during a language shift for governing purposes —a precursor to Arabization centuries later— including among Assyrians who permanently replaced their Akkadian language and its cuneiform script with Aramaic and its script, and among Jews, who adopted the Aramaic language as their vernacular and started using the Aramaic alphabet even for writing Hebrew, displacing the former Paleo-Hebrew alphabet.. The letters in the Aramaic alphabet all represent consonants, some of which are also used as matres lectionis to indicate long vowels.

<span class="mw-page-title-main">Arabic alphabet</span> Alphabets for Arabic and other languages

The Arabic alphabet, or Arabic abjad, is the Arabic script as it is codified for writing Arabic. It is written from right to left in a cursive style and includes 29 letters. Most letters have contextual letterforms.

A macron is a diacritical mark: it is a straight bar ¯ placed above a letter, usually a vowel. Its name derives from Ancient Greek μακρόν (makrón) "long", since it was originally used to mark long or heavy syllables in Greco-Roman metrics. It now more often marks a long vowel. In the International Phonetic Alphabet, the macron is used to indicate a mid-tone; the sign for a long vowel is instead a modified triangular colon ⟨ː⟩.

<span class="mw-page-title-main">Sinhala script</span> Abugida writing system used for the Sinhala language

The Sinhala script, also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write the Sinhala language as well as the liturgical languages Pali and Sanskrit. The Sinhalese Akṣara Mālāva, one of the Brahmic scripts, is a descendant of the Ancient Indian Brahmi script. It is also related to the Grantha script.

The Coptic alphabet is the script used for writing the Coptic language. The repertoire of glyphs is based on the Greek alphabet augmented by letters borrowed from the Egyptian Demotic and is the first alphabetic script used for the Egyptian language. There are several Coptic alphabets, as the Coptic writing system may vary greatly among the various dialects and subdialects of the Coptic language.

The tilde˜ or ~, is a grapheme with several uses. The name of the character came into English from Spanish, which in turn came from the Latin titulus, meaning "title" or "superscription". Its primary use is as a diacritic (accent) in combination with a base letter; but for historical reasons, it is also used in standalone form within a variety of contexts.

The Burmese alphabet is an abugida used for writing Burmese. It is ultimately adapted from a Brahmic script, either the Kadamba or Pallava alphabet of South India. The Burmese alphabet is also used for the liturgical languages of Pali and Sanskrit. In recent decades, other, related alphabets, such as Shan and modern Mon, have been restructured according to the standard of the Burmese alphabet

<span class="mw-page-title-main">Ligature (writing)</span> Glyph combining two or more letterforms in a single typeset or handwritten character

In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters æ and œ used in English and French, in which the letters 'a' and 'e' are joined for the first ligature and the letters 'o' and 'e' are joined for the second ligature. For stylistic and legibility reasons, 'f' and 'i' are often merged to create 'fi' ; the same is true of 's' and 't' to create 'st'. The common ampersand (&) developed from a ligature in which the handwritten Latin letters 'E' and 't' were combined.

<span class="mw-page-title-main">Syriac alphabet</span> Writing system

The Syriac alphabet is a writing system primarily used to write the Syriac language since the 1st century AD. It is one of the Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares similarities with the Phoenician, Hebrew, Arabic and Sogdian, the precursor and a direct ancestor of the traditional Mongolian scripts.

<span class="mw-page-title-main">Sogdian alphabet</span> Alphabet for use with the Sogdian language of central Asia

The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia. The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet. It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.

<span class="mw-page-title-main">Scribal abbreviation</span> Abbreviations used by ancient and medieval scribes

Scribal abbreviations or sigla are abbreviations used by ancient and medieval scribes writing in various languages, including Latin, Greek, Old English and Old Norse. In modern manuscript editing sigla are the symbols used to indicate the source manuscript and to identify the copyists of a work.

<span class="mw-page-title-main">Complex text layout</span> Neighbour-dependent grapheme positioning

Complex text layout (CTL) or complex text rendering is the typesetting of writing systems in which the shape or positioning of a grapheme depends on its relation to other graphemes. The term is used in the field of software internationalization, where each grapheme is a character.

An overline, overscore, or overbar, is a typographical feature of a horizontal line drawn immediately above the text. In old mathematical notation, an overline was called a vinculum, a notation for grouping symbols which is expressed in modern notation by parentheses, though it persists for symbols under a radical sign. The original use in Ancient Greek was to indicate compositions of Greek letters as Greek numerals. In Latin, it indicates Roman numerals multiplied by a thousand and it forms medieval abbreviations (sigla). Marking one or more words with a continuous line above the characters is sometimes called overstriking, though overstriking generally refers to printing one character on top of an already-printed character.

<span class="mw-page-title-main">Universal Character Set characters</span> Complete list of the characters available on most computers

The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set, is an international standard to map characters, discrete symbols used in natural language, mathematics, music, and other domains, to unique machine-readable data values. By creating this mapping, the UCS enables computer software vendors to interoperate, and transmit—interchange—UCS-encoded text strings from one to another. Because it is a universal map, it can be used to represent multiple languages at the same time. This avoids the confusion of using multiple legacy character encodings, which can result in the same sequence of codes having multiple interpretations depending on the character encoding in use, resulting in mojibake if the wrong one is chosen.

<span class="mw-page-title-main">Subscript and superscript</span> A character set slightly below and above the normal line of type, respectively

A subscript or superscript is a character that is set slightly below or above the normal line of type, respectively. It is usually smaller than the rest of the text. Subscripts appear at or below the baseline, while superscripts are above. Subscripts and superscripts are perhaps most often used in formulas, mathematical expressions, and specifications of chemical compounds and isotopes, but have many other uses as well.

<span class="mw-page-title-main">Armenian alphabet</span> Alphabet used to write the Armenian language

The Armenian alphabet is an alphabetic writing system used to write Armenian. It was developed around 405 AD by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. The system originally had 36 letters; eventually, three more were adopted. The alphabet was also in wide use in the Ottoman Empire around the 18th and 19th centuries.

The Unicode Standard assigns various properties to each Unicode character and code point.

<span class="mw-page-title-main">Inscriptional Parthian</span>

Inscriptional Parthian is a script used to write Parthian language on coins of Parthia from the time of Arsaces I of Parthia. It was also used for inscriptions of Parthian and later Sassanian periods.

Psalter Pahlavi is a cursive abjad that was used for writing Middle Persian on paper; it is thus described as one of the Pahlavi scripts. It was written right to left, usually with spaces between words.

Suriyani Malayalam, also known as Karshoni, Syro-Malabarica or Syriac Malayalam, is a dialect of Malayalam written in a variant form of the Syriac alphabet which was popular among the Saint Thomas Christians of Kerala in India. It uses Malayalam grammar, the Maḏnḥāyā or "Eastern" Syriac script with special orthographic features, and vocabulary from Malayalam and East Syriac. This originated in the South Indian region of the Malabar Coast. Until the 19th century, the script was widely used by Syrian Christians in Kerala.

References

  1. "UNICODE 12.0 standard" (PDF). Unicode Consortium. p. 392. Retrieved 9 March 2020.