ArabTeX

Last updated
The ArabTeX logo ArabTeX logo.svg
The ArabTeX logo

ArabTeX is a free software package providing support for the Arabic and Hebrew alphabets to TeX and LaTeX. Written by Klaus Lagally, it can take romanized ASCII or native script input to produce quality ligatures for Arabic, Persian, Urdu, Pashto, Sindhi, Western Punjabi (Lahnda), Maghribi, Uyghur, Kashmiri, Hebrew, Judeo-Arabic, Ladino and Yiddish. ArabTeX characters are placed within a TeX/LaTeX document using the command \RL{ ... } or the environment \begin{RLtext} ... \end{RLtext}. ArabTeX is released under the LaTeX Project Public License v1+. [1]

Contents

Example

Arabtex as-salam alaikum.png

\novocalize\RL{al-salAm `alaykum}

Bismillah.png

\documentclass[12pt]{article}\usepackage{arabtex}\begin{document}\setarab\fullvocalize\transtrue\arabtrue\begin{RLtext}  bismi al-ll_ahi al-rra.hm_ani al-rra.hImi  \end{RLtext}\end{document}

Common commands

Character table

LetterTransliterationUnicode name
اAARABIC LETTER ALEF
أa'ARABIC LETTER ALEF WITH HAMZA ABOVE
بbARABIC LETTER BEH
تtARABIC LETTER TEH
ث_tARABIC LETTER THEH
جj / ^gARABIC LETTER JEEM
ح.hARABIC LETTER HAH
خx / _hARABIC LETTER KHAH
دdARABIC LETTER DAL
ذ_dARABIC LETTER THAL
رrARABIC LETTER REH
زzARABIC LETTER ZAIN
سsARABIC LETTER SEEN
ش^sARABIC LETTER SHEEN
ص.sARABIC LETTER SAD
ض.dARABIC LETTER DAD
ط.tARABIC LETTER TAH
ظ.zARABIC LETTER ZAH
ع`ARABIC LETTER AIN
غ.gARABIC LETTER GHAIN
فfARABIC LETTER FEH
قqARABIC LETTER QAF
كkARABIC LETTER KAF
لlARABIC LETTER LAM
مmARABIC LETTER MEEM
نnARABIC LETTER NOON
وw / UARABIC LETTER WAW
هhARABIC LETTER HEH
يy / IARABIC LETTER YEH
َaARABIC FATHA
ُu / oARABIC DAMMA
ِi / eARABIC KASRA
پpARABIC LETTER PEH
چ^cARABIC LETTER TCHEH
ژ^zARABIC LETTER JEH
گgARABIC LETTER GAF
ک.kARABIC LETTER KEHEH
یy / I * ARABIC LETTER FARSI YEH
ۀH-iARABIC LETTER HEH WITH YEH
آ'AARABIC LETTER ALEF WITH MADDA ABOVE
ةTARABIC LETTER TEH MARBUTA
ء'ARABIC LETTER HAMZA ABOVE
ئ'yARABIC LETTER YEH WITH HAMZA ABOVE
ؤu'ARABIC LETTER WAW WITH HAMZA ABOVE
ًaNARABIC FATHATAN
ّxxARABIC SHADDA
،,ARABIC COMMA
؛;ARABIC SEMICOLON
؟?ARABIC QUESTION MARK
٪%ARABIC PERCENT SIGN
SPACE
..FULL STOP
-ZERO WIDTH JOINER
\hspace{0ex}ZERO WIDTH NON-JOINER
^* Activated by \setfarsi

Note that one can also overcome the problem with <yah> containing dots using the \yahnodots command.

See also

Related Research Articles

<span class="mw-page-title-main">Arabic alphabet</span> Alphabets for Arabic and other languages

The Arabic alphabet, or Arabic abjad, is the Arabic script as it is codified for writing Arabic. It is written from right to left in a cursive style and includes 29 letters. Most letters have contextual letterforms.

The Hebrew alphabet, known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew language and other Jewish languages, most notably Yiddish, Ladino, Judeo-Arabic, and Judeo-Persian. It is also used informally in Israel to write Levantine Arabic, especially among Druze. It is an offshoot of the Imperial Aramaic alphabet, which flourished during the Achaemenid Empire and which itself derives from the Phoenician alphabet.

<span class="mw-page-title-main">LaTeX</span> Document preparation system

LaTeX is a software system for document preparation. When writing, the writer uses plain text as opposed to the formatted text found in WYSIWYG word processors like Microsoft Word, LibreOffice Writer and Apple Pages. The writer uses markup tagging conventions to define the general structure of a document to stylise text throughout a document, and to add citations and cross-references. A TeX distribution such as TeX Live or MiKTeX is used to produce an output file suitable for printing or digital distribution.

TeX, stylized within the system as TeX, is a typesetting system which was designed and written by Donald Knuth and first released in 1978. TeX is a popular means of typesetting complex mathematical formulae; it has been noted as one of the most sophisticated digital typographical systems.

OpenType is a format for scalable computer fonts. It was built on its predecessor TrueType, retaining TrueType's basic structure and adding many intricate data structures for prescribing typographic behavior. OpenType is a registered trademark of Microsoft Corporation.

<span class="mw-page-title-main">Device independent file format</span> Typesetting file format

The device independent file format (DVI) is the output file format of the TeX typesetting program, designed by David R. Fuchs and implemented by Donald E. Knuth in 1982. Unlike the TeX markup files used to generate them, DVI files are not intended to be human-readable; they consist of binary data describing the visual layout of a document in a manner not reliant on any specific image format, display hardware or printer. DVI files are typically used as input to a second program which translates DVI files to graphical data. For example, most TeX software packages include a program for previewing DVI files on a user's computer display; this program is a driver. Drivers are also used to convert from DVI to popular page description languages and for printing.

Persian alphabet Writing system used for the Persian language

The Persian alphabet is a writing system used for the Persian language spoken in Iran and Afghanistan since the 7th century after the Muslim conquest of Persia.

XeTeX TeX typesetting engine

XeTeX is a TeX typesetting engine using Unicode and supporting modern font technologies such as OpenType, Graphite and Apple Advanced Typography (AAT). It was originally written by Jonathan Kew and is distributed under the X11 free software license.

Kaph is the eleventh letter of the Semitic abjads, including Phoenician kāp 𐤊, Hebrew kāf כ‎, Aramaic kāp 𐡊, Syriac kāp̄ ܟܟ‎, and Arabic kāf ك‎.

Ayin is the sixteenth letter of the Semitic abjads, including Phoenician ʿayin, Hebrew ʿayinע‎, Aramaic ʿē, Syriac ʿē ܥ, and Arabic ʿayn ع.

It is thought that the Arabic alphabet is a derivative of the Nabataean variation of the Aramaic alphabet, which descended from the Phoenician alphabet, which among others also gave rise to the Hebrew alphabet and the Greek alphabet, the latter one being in turn the base for the Latin and Cyrillic alphabets.

<span class="mw-page-title-main">Romanization of Arabic</span> Representation of Arabic in Latin script

The romanization of Arabic is the systematic rendering of written and spoken Arabic in the Latin script. Romanized Arabic is used for various purposes, among them transcription of names and titles, cataloging Arabic language works, language education when used instead of or alongside the Arabic script, and representation of the language in scientific publications by linguists. These formal systems, which often make use of diacritics and non-standard Latin characters and are used in academic settings or for the benefit of non-speakers, contrast with informal means of written communication used by speakers such as the Latin-based Arabic chat alphabet.

The diaeresis and the umlaut are two different diacritical marks that look alike. They both consist of two dots ¨ placed over a letter, usually a vowel; when that letter is an i or a j, the diacritic replaces the tittle: ï. In computer systems, both forms have the same code point. Their appearance in print or on screen may vary between typefaces but rarely within the same typeface.

Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palestine, Israel, Jordan, Egypt, Sudan, Libya, Tunisia, Algeria, Morocco, Syria and Lebanon. Based on the systematicity of sound changes, it is known with a fair degree of certainty that the names Domari and Romani derive from the Indo-Aryan word ḍom. However, the Domari and Romani languages do not derive from the same ancestor idiom. Domari derives from an Indo-Aryan language. The Arabs referred to them as Nawar as they were a nomadic people that originally immigrated to the Middle East from the Indian subcontinent.

<span class="mw-page-title-main">GNU FreeFont</span> Font family

GNU FreeFont is a family of free OpenType, TrueType and WOFF vector fonts, implementing as much of the Universal Character Set (UCS) as possible, aside from the very large CJK Asian character set. The project was initiated in 2002 by Primož Peterlin and is now maintained by Steve White.

<span class="mw-page-title-main">Urdu alphabet</span> Perso-Arabic-based alphabet for Urdu of 40 letters

The Urdu alphabet, is the right-to-left alphabet used for Urdu. It is a modification of the Persian alphabet, which is itself a derivative of the Arabic alphabet. The Urdu alphabet has up to 39 or 40 distinct letters with no distinct letter cases and is typically written in the calligraphic Nastaʿlīq script, whereas Arabic is more commonly written in the Naskh style.

The computer program pdfTeX is an extension of Knuth's typesetting program TeX, and was originally written and developed into a publicly usable product by Hàn Thế Thành as a part of the work for his PhD thesis at the Faculty of Informatics, Masaryk University, Brno, Czech Republic. The idea of making this extension to TeX was conceived during the early 1990s, when Jiří Zlatuška and Phil Taylor discussed some developmental ideas with Donald Knuth at Stanford University. Knuth later met Hàn Thế Thành in Brno during his visit to the Faculty of Informatics to receive an honorary doctorate from Masaryk University.

The varieties of Arabic, a Semitic language within the Afroasiatic family originating in the Arabian Peninsula, are the linguistic systems that Arabic speakers speak natively. There are considerable variations from region to region, with degrees of mutual intelligibility that are often related to geographical distance and some that are mutually unintelligible. Many aspects of the variability attested to in these modern variants can be found in the ancient Arabic dialects in the peninsula. Likewise, many of the features that characterize the various modern variants can be attributed to the original settler dialects. Some organizations, such as SIL International, consider these approximately 30 different varieties to be different languages, while others, such as the Library of Congress, consider them all to be dialects of Arabic.

TexLab TeX editor

TexLab is a TeX editor based on parallel programming. Contrary to other TeX editors which use a text file as input, TexLab was designed to process small binary document modules. TexLab typesetting engine is implemented by a thread pool to execute MiKTeX commands, such as: LaTeX, XeTeX or pdfTeX. MiKTeX is the version of LaTeX for Microsoft Windows; it consists of a set of tools to process text. Christian Schenk is the creator of MiKTeX, he studied Computer Science at TU Berlin. When Christian owned his first PC, he began working on a fun project; this project eventually became MiKTeX. TexLab interacts with Windows API's and Win32 through a thin level of abstraction provided by the C++ classes of Wintempla. TexLab interacts with MiKTeX commands through Anonymous pipes which is an interprocess communication (IPC) method.

References