International Alphabet of Sanskrit Transliteration

Last updated

International Alphabet of Sanskrit Transliteration
IAST
Script type romanisation
Time period
19th century–present
Languages Sanskrit and other Indic Languages
 This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA).For an introductory guide on IPA symbols, see Help:IPA.For the distinction between [ ], / / and  , see IPA § Brackets and transcription delimiters.

The International Alphabet of Sanskrit Transliteration (IAST) is a transliteration scheme that allows the lossless romanisation of Indic scripts as employed by Sanskrit and related Indic languages. It is based on a scheme that emerged during the 19th century from suggestions by Charles Trevelyan, William Jones, Monier Monier-Williams and other scholars, and formalised by the Transliteration Committee of the Geneva Oriental Congress, in September 1894. [1] [2] IAST makes it possible for the reader to read the Indic text unambiguously, exactly as if it were in the original Indic script. It is this faithfulness to the original scripts that accounts for its continuing popularity amongst scholars.

Contents

Usage

Scholars commonly use IAST in publications that cite textual material in Sanskrit, Pāḷi and other classical Indian languages.

IAST is also used for major e-text repositories such as SARIT, Muktabodha, GRETIL, and sanskritdocuments.org.

The IAST scheme represents more than a century of scholarly usage in books and journals on classical Indian studies. By contrast, the ISO 15919 standard for transliterating Indic scripts emerged in 2001 from the standards and library worlds. For the most part, ISO 15919 follows the IAST scheme, departing from it only in minor ways (e.g., ṃ/ṁ and ṛ/r̥)—see comparison below.

The Indian National Library at Kolkata romanization, intended for the romanisation of all Indic scripts, is an extension of IAST.

Inventory and conventions

The IAST letters are listed with their Devanagari equivalents and phonetic values in IPA, valid for Sanskrit, Hindi and other modern languages that use Devanagari script, but some phonological changes have occurred:

Vowels and codas
DevanāgarīTranscriptionIPACategory
aA ɐ monophthongs
āĀ
iI ɪ
īĪ
uU ʊ
ūŪ
syllabic liquids
r̩ː
l̩ː
eE e diphthongs
aiAiɐi̯
oO o
auAuɐu̯
anusvara
visarga
˜ chandrabindu [3]
' avagraha
Consonants
velars palatals retroflexes dentals labials Category

k  K

c  C

  

t  T

p  P
tenuis stops

kh  Kh

ch  Ch

ṭh  Ṭh

th  Th

ph  Ph
aspirated stops

g  G

j  J

  

d  D

b  B
voiced stops

gh  Gh

jh  Jh

ḍh  Ḍh

dh  Dh

bh  Bh
breathy-voiced stops

  

ñ  Ñ

  

n  N

m  M
nasal stops

h  H*

y  Y

r  R

l  L

v  V
approximants
 
ś  Ś

  

s  S
  sibilants
  
  
   approximants (Vedic) [2]

*H is actually glottal, not velar.

Some letters are modified with diacritics: Long vowels are marked with an overline. Vocalic (syllabic) consonants, retroflexes and ṣ (/ʂ~ɕ~ʃ/) have an underdot. One letter has an overdot: ṅ (/ŋ/). One has an acute accent: ś (/ʃ/). One letter has a line below: ḻ (/ɭ/) (Vedic).

Unlike ASCII-only romanizations such as ITRANS or Harvard-Kyoto, the diacritics used for IAST allow capitalization of proper names. The capital variants of letters never occurring word-initially (Ṇ Ṅ Ñ Ṝ Ḹ) are useful only when writing in all-caps and in Pāṇini contexts for which the convention is to typeset the IT sounds as capital letters.

Comparison with ISO 15919

For the most part, IAST is a subset of ISO 15919 that merges the retroflex (underdotted) liquids with the vocalic ones (ringed below) and the short close-mid vowels with the long ones. The following seven exceptions are from the ISO standard accommodating an extended repertoire of symbols to allow transliteration of Devanāgarī and other Indic scripts, as used for languages other than Sanskrit.

DevanāgarīIASTISO 15919Comment
ए / ेeē (e)ISO e generally represents short ए / ॆ, but optionally represents long ए / े in the Devanagari, Bengali, Gurmukhi, Gujarati, and Odia scripts.
ओ / ोoō (o)ISO o generally represents short ऒ / ॆ, but optionally represents long ओ / ो in the Devanagari, Bengali, Gurmukhi, Gujarati, and Odia scripts.
अं / ंISO represents Gurmukhi tippi .
ऋ / ृISO represents ड़ /ɽ/.
ॠ / ॄr̥̄for consistency with .
ऌ / ॢISO represents ळ /𝼈/.
ॡ / ॣl̥̄for consistency with .
Used in Vedic Sanskrit only and not found in the Classical variant

Computer input by alternative keyboard layout

The most convenient method of inputting romanized Sanskrit is by setting up an alternative keyboard layout. This allows one to hold a modifier key to type letters with diacritical marks. For example, alt+a = ā. How this is set up varies by operating system.

Linux/Unix and BSD desktop environments allow one to set up custom keyboard layouts and switch them by clicking a flag icon in the menu bar.

macOS One can use the pre-installed US International keyboard, or install Toshiya Unebe's Easy Unicode keyboard layout.

Microsoft Windows Windows also allows one to change keyboard layouts and set up additional custom keyboard mappings for IAST. This Pali keyboard installer [4] made by Microsoft Keyboard Layout Creator (MSKLC) supports IAST (works on Microsoft Windows up to at least version 10, can use Alt button on the right side of the keyboard instead of Ctrl+Alt combination).

Computer input by selection from a screen

Applet for character selection Gucharmap screenshot.png
Applet for character selection

Many systems provide a way to select Unicode characters visually. ISO/IEC 14755 refers to this as a screen-selection entry method.

Microsoft Windows has provided a Unicode version of the Character Map program (find it by hitting ⊞ Win+R then type charmap then hit ↵ Enter) since version NT 4.0 – appearing in the consumer edition since XP. This is limited to characters in the Basic Multilingual Plane (BMP). Characters are searchable by Unicode character name, and the table can be limited to a particular code block. More advanced third-party tools of the same type are also available (a notable freeware example is BabelMap).

macOS provides a "character palette" with much the same functionality, along with searching by related characters, glyph tables in a font, etc. It can be enabled in the input menu in the menu bar under System Preferences → International → Input Menu (or System Preferences → Language and Text → Input Sources) or can be viewed under Edit → Emoji & Symbols in many programs.

Equivalent tools – such as gucharmap (GNOME) or kcharselect (KDE) – exist on most Linux desktop environments.

Users of SCIM on Linux based platforms can also have the opportunity to install and use the sa-itrans-iast input handler which provides complete support for the ISO 15919 standard for the romanization of Indic languages as part of the m17n library.

Or user can use some Unicode characters in Latin-1 Supplement, Latin Extended-A, Latin Extended Additional and Combining Diarcritical Marks block to write IAST.

Font support

Only certain fonts support all the Latin Unicode characters essential for the transliteration of Indic scripts according to the IAST and ISO 15919 standards.

For example, the Arial, Tahoma and Times New Roman font packages that come with Microsoft Office 2007 and later versions also support precomposed Unicode characters like ā, ḍ, ḥ, ī, ḷ, ḹ, ḻ, ṃ, ñ, ṅ, ṇ, ṛ, ṝ, ṣ, ś, ṭ and ū, glyphs for some of which are only to be found in the Latin Extended Additional Unicode block.

Many other text fonts commonly used for book production may be lacking in support for one or more characters from this block. Accordingly, many academics working in the area of Sanskrit studies make use of free OpenType fonts such as FreeSerif or Gentium, both of which have complete support for the full repertoire of conjoined diacritics in the IAST character set. Released under the GNU FreeFont or SIL Open Font License, respectively, such fonts may be freely shared and do not require the person reading or editing a document to purchase proprietary software to make use of its associated fonts.

See also

Related Research Articles

<span class="mw-page-title-main">Devanagari</span> Script used to write Indian and Nepalese languages

Devanagari is an Indic script used in the northern Indian subcontinent. Also simply called Nāgari, it is a left-to-right abugida, based on the ancient Brāhmi script. It is one of the official scripts of the Republic of India and Nepal. It was developed and in regular use by the 7th century CE and achieved its modern form by 1000 CE. The Devanāgari script, composed of 48 primary characters, including 14 vowels and 34 consonants, is the fourth most widely adopted writing system in the world, being used for over 120 languages.

<span class="mw-page-title-main">Sinhala script</span> Abugida writing system

The Sinhala script, also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write the Sinhala language as well as the liturgical languages Pali and Sanskrit. The Sinhalese Akṣara Mālāva, one of the Brahmic scripts, is a descendant of the Ancient Indian Brahmi script. It is also related to the Grantha script.

<span class="mw-page-title-main">Kannada script</span> Abugida writing system of the Brahmic family

The Kannada script is an abugida of the Brahmic family, used to write Kannada, one of the Dravidian languages of South India especially in the state of Karnataka. It is one of the official scripts of the Indian Republic. Kannada script is also widely used for writing Sanskrit texts in Karnataka. Several minor languages, such as Tulu, Konkani, Kodava, Sanketi and Beary, also use alphabets based on the Kannada script. The Kannada and Telugu scripts share very high mutual intellegibility with each other, and are often considered to be regional variants of single script. Other scripts similar to Kannada script are Sinhala script, and Old Peguan script (used in Burma).

<span class="mw-page-title-main">Brahmic scripts</span> Family of abugida writing systems

The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used by various languages in several language families in South, East and Southeast Asia: Indo-Aryan, Dravidian, Tibeto-Burman, Mongolic, Austroasiatic, Austronesian, and Tai. They were also the source of the dictionary order (gojūon) of Japanese kana.

The Thai script is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai alphabet itself has 44 consonant symbols and 16 vowel symbols that combine into at least 32 vowel forms and four tone diacritics to create characters mostly representing syllables.

Malayalam script is a Brahmic script used commonly to write Malayalam, which is the principal language of Kerala, India, spoken by 45 million people in the world. It is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry by the Malayali people. It is one of the official scripts of the Indian Republic. Malayalam script is also widely used for writing Sanskrit texts in Kerala.

<span class="mw-page-title-main">Romanization</span> Transliteration or transcription to Latin letters

In linguistics, romanization is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and transcription, for representing the spoken word, and combinations of both. Transcription methods can be subdivided into phonemic transcription, which records the phonemes or units of semantic meaning in speech, and more strict phonetic transcription, which records speech sounds with precision.

Anusvara, also known as Bindu, is a symbol used in many Indic scripts to mark a type of nasal sound, typically transliterated ⟨ṃ⟩ or ⟨ṁ⟩ in standards like ISO 15919 and IAST. Depending on its location in the word and the language for which it is used, its exact pronunciation can vary. In the context of ancient Sanskrit, anusvara is the name of the particular nasal sound itself, regardless of written representation.

Devanagari is an Indic script used for many Indo-Aryan languages of North India and Nepal, including Hindi, Marathi and Nepali, which was the script used to write Classical Sanskrit. There are several somewhat similar methods of transliteration from Devanagari to the Roman script, including the influential and lossless IAST notation. Romanised Devanagari is also called Romanagari.

The National Library at Kolkata romanisation is a widely used transliteration scheme in dictionaries and grammars of Indic languages. This transliteration scheme is also known as (American) Library of Congress and is nearly identical to one of the possible ISO 15919 variants. The scheme is an extension of the IAST scheme that is used for transliteration of Sanskrit.

<span class="mw-page-title-main">Grantha script</span> South Indian script

The Grantha script was a classical South Indian Brahmic script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, the Grantha script is related to Tamil and Vatteluttu scripts. The modern Malayalam script of Kerala is a direct descendant of the Grantha script. The Southeast Asian and Indonesian scripts such as Thai and Javanese respectively, as well as South Asian Tigalari and Sinhala scripts, are derived or closely related to Grantha through the early Pallava script. The Pallava script or Pallava Grantha, emerged in the 4th century CE and was used until the 7th century CE, in India. This early Grantha script was used to write Sanskrit texts, inscriptions on copper plates and stones of Hindu temples and monasteries. It was also used for classical Manipravalam – a language that is a blend of Sanskrit and Tamil. From it evolved Middle Grantha by the 7th century, and Transitional Grantha by about the 8th century, which remained in use until about the 14th century. Modern Grantha has been in use since the 14th century and into the modern era, to write classical texts in Sanskrit and Dravidian languages. It is also used to chant hymns and in traditional Vedic schools.

The Harvard-Kyoto Convention is a system for transliterating Sanskrit and other languages that use the Devanāgarī script into ASCII. It is predominantly used informally in e-mail, and for electronic texts.

ISO 15919 is one of a series of international standards for romanization by the International Organization for Standardization. It was published in 2001 and uses diacritics to map the much larger set of consonants and vowels in Brahmic and Nastaliq scripts to the Latin script.

The "Indian languages TRANSliteration" (ITRANS) is an ASCII transliteration scheme for Indic scripts, particularly for the Devanagari script.

There are several romanisation schemes for the Malayalam script, including ITRANS and ISO 15919.

Romanisation of Bengali is the representation of written Bengali language in the Latin script. Various romanisation systems for Bengali are used, most of which do not perfectly represent Bengali pronunciation. While different standards for romanisation have been proposed for Bengali, none has been adopted with the same degree of uniformity as Japanese or Sanskrit.

Indic Computing means "computing in Indic", i.e., Indian Scripts and Languages. It involves developing software in Indic Scripts/languages, Input methods, Localization of computer applications, web development, Database Management, Spell checkers, Speech to Text and Text to Speech applications and OCR in Indian languages.

The Sanskrit Library Phonetic basic encoding scheme (SLP1) is an ASCII transliteration scheme for the Sanskrit language from and to the Devanagari script.

The Velthuis system of transliteration is an ASCII transliteration scheme for the Sanskrit language from and to the Devanagari script. It was developed in about 1983 by Frans Velthuis, a scholar living in Groningen, Netherlands, who created a popular, high-quality software package in LaTeX for typesetting Devanāgarī. The primary documentation for the scheme is the system's clearly-written software manual. It is based on using the ISO 646 repertoire to represent mnemonically the accents used in standard scholarly transliteration. It does not use diacritics as IAST does. It may optionally use capital letters in a manner similar but not identical to the Harvard-Kyoto or ITRANS schemes.manual para 4.1

<span class="mw-page-title-main">Meitei input methods</span>

Meitei input methods are the methods that allow users of computers to input texts in the Meitei script, systematically for Meitei language.

References

  1. Monier-Williams, Monier (1899). A Sanskrit-English Dictionary (PDF). Oxford: Clarendon Press. pp. xxx.
  2. 1 2 "Tenth International Congress of Orientalists, Held at Geneva". The Journal of the Royal Asiatic Society of Great Britain and Ireland: 879–892. 1895. JSTOR   25207765.
  3. See, now, the Talk page, where m̐ is recommended.
  4. "Pali Keyboard". fsnow.com. Retrieved 21 October 2019.