Morse code for non-Latin alphabets

Last updated

L-Telegraph1.png

This is a summary of the use of Morse code to represent alphabets other than Latin.

Contents

Greek

The Greek Morse code alphabet is very "similar" to the Latin alphabet. The "similarity" is based first on optical resemblance of each letter, a.k.a glyph, and then secondly on sound. Example, A both in Greek and English is the same glyph and sound (like a in word apple). The Greek B though, is identical to the English B as far as the glyph is concerned, even though the sound of Greek B is the same as the sound of English V (like the v in word victory). The B prevails to V. The Greek Morse code alphabet uses one extra letter for Greek letter Χ and no longer uses the codes for Latin letters "J", "U" and "V".

Greek
Code
ITUGreek
Code
ITUGreek
Code
ITU
Α
    
AΙ
    
IΡ
     
R
Β
      
BΚ
     
KΣ
     
S
Γ
     
GΛ
      
LΤ
   
T
Δ
     
DΜ
    
MΥ
      
Y
Ε
   
EΝ
    
NΦ
      
F
Ζ
      
ZΞ
      
XΧ
      
CH
Η
      
HΟ
     
OΨ
      
Q
Θ
      
CΠ
      
PΩ
     
W
Diphthongs (obsolete)
HY
      
VYI
      
JOY
     
U
OI
       
8EI
     
SAI
      
Ä
AY
      
ÜEY
      
Ö

The tonos is not transmitted in Morse code; the receiver can simply infer which vowels require one. The Greek diphthongs presented in the bottom three rows of the table are specified in old Greek Morse-code tables but they are never used in actual communication, the two vowels being sent separately.

Cyrillic

Cyrillic letters are represented using the representation of similar-sounding Latin letters (e.g. Б ⇒ B, В ⇒ W [German pronunciation], Г ⇒ G, Д ⇒ D, etc.). Cyrillic letters with no such Latin correspondence are assigned to Latin letters with no Cyrillic correspondence (e.g. Щ⇒Q). The same correspondence was later used to create Russian national character sets KOI-7 and KOI-8.

CyrillicCodeITUCyrillicCodeITUCyrillicCodeITUCyrillicCodeITU
А    AИ(Ukr. І)    IР     RШ      (CH)
Б      BЙ      JС     SЩ      Q
В     WК     KТ   TЬ(Bul. Ъ)      X
Г     GЛ      LУ     UЫ(Bul. Ь)      Y
Д     DМ    MФ      FЭ(Ukr. Є)       É
Е   EН    NХ      HЮ      (Ü)
Ж      VО     OЦ      CЯ      (Ä)
З      ZП      PЧ      (Ö)Ї       

The order and encoding shown uses the Russian national standard. The Bulgarian standard is the same except for the two letters (Ь, Ъ) given in parentheses: The Bulgarian language does not use Ы, while Ъ is frequent, but missing in Russian standard Morse.

The letter Ё (Yo) does not have an international Morse phonetic equivalent, with international Е used instead. Ukrainian Morse uses Є instead of Э, І instead of И, but also has И encoded as Ы, and has additional Ї (Yi).

Hebrew

Hebrew letters are mostly represented using the Morse representation of a similar-sounding Latin letter (e.g. "Bet" ב≡B); however the representation for several letters are from a Latin letter with a similar shape (e.g. "Tet" ט ≡U, while "Tav" ת≡T). Though Hebrew Morse code is transcribed from right to left, the table below is transcribed from left to right as per the Latin letters in the table.

LetterCodeITULetterCodeITU
א     A ל       L
ב       B מ     M
ג      G נ     N
ד      D ס       C
ה      O ע       J
ו    E פ       P
ז       Z צ      W
ח       H ק       Q
ט      U ר      R
י     I ש      S
כ      K ת    T

Arabic

LetterCodeITULetterCodeITULetterCodeITULetterCodeITU
ا     A ذ       Z ط      U ل       L
ب       B ر      R ظ       Y م     M
ت    T ز       Ö ع       Ä ن     N
ث       C س      S غ      G ه        É
ج       J ش       CH ف       F و      W
ح       H ص       X ق       Q ي     I
خ      O ض       V ك      K   E
د      D   

Kurdish

LetterIn LatinCodeLetterIn LatinCodeLetterIn LatinCodeLetterIn LatinCodeLetterIn LatinCode
اA    گG     ڵŁ       ڕŘ       وW     
بB      ھH      مM    سS     خX      
جC      ح       نN    شŞ       غ       
چÇ       -I    نٚŇ       تT   یY      
ڎĐ       یΠ      ۆO     وU     زZ      
ەE   ژJ      پP      ووÛ       ع'       
ێÊ       کK     قQ      ۊÜ      دD     
فF      لL      رR     ڤV      

See [1]

Persian

LetterCodeITULetterCodeITULetterCodeITULetterCodeITU
ا     A خ       X ص       Ä ک      K
ب       B د      D ض        É گ       Q
پ       P ذ       V ط      U ل       L
ت    T ر      R ظ       Y م     M
ث       C ز       Z ع      O ن     N
ج       J ژ      G غ       Ü و      W
چ       Ö س      S ف       F ه    E
ح       H ش       CH ق         : ی     I

See also fa:کد مورس

Devanagari

Devanagari is a left-to-right abugida (alphasyllabary) widely used in the Indian subcontinent. The following telegraph code table is adapted from one given by Ashok Kelkar, [2] where the Latin letters are encoded as per the International Morse code standard. Some variations on this code exist, [3] and there have been some attempts to introduce other telegraph codes either to improve efficiency or to apply to more Indian languages. [2] [4] [5] Proposals for a telegraph code suitable for multiple Indian languages have been made as early as 1948, shortly after independence. [6]

DevanagariLatin MorseDevanagariLatin MorseDevanagariLatin MorseDevanagariLatin Morse
ATAEAIA
िDTDEDID
UTUEUIU
FTFEFIF
OTOEOIO
Kक्TKEKख्IK
Gग्TGEGघ्IG
Cच्TCECछ्IC
Jज्TJEJझ्IJ
Äट्ठ्
Üड्ढ्
Wत्TWEWथ्IW
Zद्TZEZध्IZ
Nन्TNENIN
Pप्TPEPफ्IP
Bब्TBEBभ्IB
Mम्TMEMण्IM
Yय्TYEYळ्IY
Rर्TRERक्रIR
Lल्TLELञ्IL
Vव्TVEVङ्IV
Sस्TSESश्IS
Hह्THEHष्IH
Ö
क्षQक्ष्TQत्रEQत्र्IQ
ज्ञXज्ञ्TXश्रEXश्र्IX
मेंMMहैTMMमैंEMMहूँIMM

Chinese

See Chinese telegraph code.

Korean

See SKATS.

Japanese

See Wabun code.

Thai

See รหัสมอร์ส on the Thai Wikipedia

Related Research Articles

<span class="mw-page-title-main">Cyrillic script</span> Writing system used for various Eurasian languages

The Cyrillic script, Slavonic script or simply Slavic script is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Eastern Europe, the Caucasus, Central Asia, North Asia, and East Asia, and used by many other minority languages.

<span class="mw-page-title-main">Russian alphabet</span> Alphabet that uses letters from the Cyrillic script

The Russian alphabet is the script used to write the Russian language. It comes from the Cyrillic script, which was devised in the 9th century for the first Slavic literary language, Old Slavonic. Initially an old variant of the Bulgarian alphabet, it became used in the Kievan Rusʹ since the 10th century to write what would become the modern Russian language.

<span class="mw-page-title-main">Romanian alphabet</span> Variant of the Latin alphabet

The Romanian alphabet is a variant of the Latin alphabet used for writing the Romanian language. It is a modification of the classical Latin alphabet and consists of 31 letters, five of which have been modified from their Latin originals for the phonetic requirements of the language:

<span class="mw-page-title-main">I (Cyrillic)</span> Letter of the Cyrillic script

The Cyrillic I is a letter used in almost all Cyrillic alphabets with the exception of Belarusian.

<span class="mw-page-title-main">Bulgarian alphabet</span> Writing system of the Bulgarian language

The Bulgarian Cyrillic alphabet is used to write the Bulgarian language. The Cyrillic alphabet was originally developed in the First Bulgarian Empire during the 9th – 10th century AD at the Preslav Literary School.

<span class="mw-page-title-main">Romanization of Russian</span> Romanization of the Russian alphabet

The romanization of the Russian language, aside from its primary use for including Russian names and words in text written in a Latin alphabet, is also essential for computer users to input Russian text who either do not have a keyboard or word processor set up for inputting Cyrillic, or else are not capable of typing rapidly using a native Russian keyboard layout (JCUKEN). In the latter case, they would type using a system of transliteration fitted for their keyboard layout, such as for English QWERTY keyboards, and then use an automated tool to convert the text into Cyrillic.

KOI (КОИ) is a family of several code pages for the Cyrillic script. The name stands for Kod obmena informatsiey which means "Code for Information Interchange".

<span class="mw-page-title-main">Yery</span> Cyrillic letter ⟨ы⟩

Yeru or Eru, usually called Y in modern Russian or Yery or Ery historically and in modern Church Slavonic, is a letter in the Cyrillic script. It represents the close central unrounded vowel after non-palatalised (hard) consonants in the Belarusian and Russian alphabets, and after any consonant in most of Rusyn standards, where it represents the unrounded close-mid back unrounded vowel sound.

<span class="mw-page-title-main">Ukrainian alphabet</span> Alphabet that uses letters from the Cyrillic script

The Ukrainian alphabet is the set of letters used to write Ukrainian, which is the official language of Ukraine. It is one of several national variations of the Cyrillic script. It comes from the Cyrillic script, which was devised in the 9th century for the first Slavic literary language, called Old Slavonic. In the 10th century, it became used in Kievan Rus' to write Old East Slavic, from which the Belarusian, Russian, Rusyn, and Ukrainian alphabets later evolved. The modern Ukrainian alphabet has 33 letters in total: 21 consonants, 1 semivowel, 10 vowels and 1 palatalization sign. Sometimes the apostrophe (') is also included, which has a phonetic meaning and is a mandatory sign in writing, but is not considered as a letter and is not included in the alphabet.

<span class="mw-page-title-main">Hard sign</span> Letter of the Cyrillic script

The letter Ъ ъ of the Cyrillic script is known as er goläm in the Bulgarian alphabet, as the hard sign in the modern Russian and Rusyn alphabets, as the debelo jer in pre-reform Serbian orthography, and as ayirish belgisi in the Uzbek Cyrillic alphabet. The letter is called back yer or back jer and yor or jor in the pre-reform Russian orthography, in Old East Slavic, and in Old Church Slavonic.

Unicode has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.

Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for this are compatibility issues with legacy systems.

<span class="mw-page-title-main">Kazakh alphabets</span> Alphabets used to write the Kazakh language

Three alphabets are used to write Kazakh: the Cyrillic, Latin and Arabic scripts. The Cyrillic script is used in Kazakhstan and Mongolia. An October 2017 Presidential Decree in Kazakhstan ordered that the transition from Cyrillic to a Latin script be completed by 2031. The Arabic script is used in Saudi Arabia, Iran, Afghanistan, and parts of China.

<span class="mw-page-title-main">Serbian Cyrillic alphabet</span> Official script of the Serbian language

The Serbian Cyrillic alphabet is a variation of the Cyrillic script used to write the Serbian language, updated in 1818 by the Serbian philologist and linguist Vuk Karadžić. It is one of the two alphabets used to write modern standard Serbian, the other being Gaj's Latin alphabet.

<span class="mw-page-title-main">Cyrillic alphabets</span> Related alphabets based on Cyrillic scripts

Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the theologians Cyril and Methodius. It is the basis of alphabets used in various languages, past and present, Slavic origin, and non-Slavic languages influenced by Russian. As of 2011, around 252 million people in Eurasia use it as the official alphabet for their national languages. About half of them are in Russia. Cyrillic is one of the most-used writing systems in the world. The creator is Saint Clement of Ohrid from the Preslav literary school in the First Bulgarian Empire.

JCUKEN is the main Cyrillic keyboard layout for the Russian language in computers and typewriters. Earlier in Russia JIUKEN (ЙІУКЕН) layout was the main layout, but it was replaced by JCUKEN when the Russian alphabet reform of 1917 removed the letters Ѣ, І, Ѵ, and Ѳ. The letter Ъ had decreased in usage significantly after the reform.

The ISO basic Latin alphabet is an international standard for a Latin-script alphabet that consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.

<span class="mw-page-title-main">I with bowl</span> Letter of the Latin alphabet used for historical orthography of Jaꞑalif

Latin yeru or I with bowl is an additional letter of the Latin alphabet based on the Cyrillic soft sign. It was introduced in 1928 into the reformed Yañalif, and later into other alphabets for Soviet minority languages. The letter was designed specifically to represent the non-front close vowel sounds IPA:[ɨ] and IPA:[ɯ]. Thus, this letter corresponds to the letter ⟨I ı⟩ in modern Turkic alphabets, and the letter yery in Cyrillic.

The Komi language, a Uralic language spoken in the north-eastern part of European Russia, has been written in several different alphabets. Currently, Komi writing uses letters from the Cyrillic script. There have been five distinct stages in the history of Komi writing:

References

  1. https://tirsik.net/index.php?peyam=142083
  2. 1 2 Kelkar, Ashok R. (October 1967) [1969]. "Telegraph code for Marathi". In Poddar, Arabinda (ed.). Language and society in India. Proceedings of a seminar. Language and society in India. Simla, IN: Indian Institute of Advanced Study. pp. 520–524. OCLC   504343143.
  3. @avtansa (June 7, 2018). "Morse code for sending messages in Devanagari (Telegraphy) देवनागरी तार संकेत निर्देशिका #Hindi #Marathi #Sanskrit #Kashmiri #Bodo #Sindhi #Santhali #Dogri" (Tweet). Retrieved 2019-02-25 via Twitter.
  4. Ramakrishna, B.S.; Nair, K.K.; Chiplunkar, V.N.; Atal, B.S.; Rajaraman, V. (1957). "Statistical studies in some Indian languages with applications to communication engineering". IETE Journal of Research. 4: 25–35. doi:10.1080/03772063.1957.11486049.
  5. Narahari Pandit, S.N. (1961). "An alternative code for Indian languages". IETE Journal of Research. 7 (3): 145–148. doi:10.1080/03772063.1961.11486283.
  6. Joshi, R.D. (1948). "A new telegraphic code for the Indian languages". Journal of Scientific and Industrial Research. 7 (7): 301–307.