Fārsi written in Persian (Nastaʿlīq script)
|Pronunciation|| [fɒːɾˈsiː] (|
|70 million |
(110 million total speakers)
Official language in
Areas with significant numbers of Persian speakers (including dialects)
Countries where Persian is an official language
Persian ( /
The Western Iranian languages are a branch of the Iranian languages, attested from the time of Old Persian and Median.
The Indo-Iranian languages, Indo-Iranic languages, or Aryan languages constitute the largest and southeasternmost extant branch of the Indo-European language family. It has more than 1.5 billion speakers, stretching from Europe (Romani), Turkey and the Caucasus (Ossetian) eastward to Xinjiang (Sarikoli) and Assam (Assamese), and south to Sri Lanka (Sinhala) and the Maldives (Maldivian). Furthermore, there are large communities of Indo-Iranian speakers in northwestern Europe, North America and Australia.
The Indo-European languages are a language family of several hundred related languages and dialects.
The Persian language is classified as a continuation of Middle Persian, the official religious and literary language of the Sasanian Empire, itself a continuation of Old Persian, the language of the Achaemenid Empire.Its grammar is similar to that of many contemporary European languages. A Persian-speaking person may be referred to as Persophone.
Middle Persian also known as Pahlavi or Parsik, is the Middle Iranian language or ethnolect of southwestern Iran that during the Sasanian Empire (224–654) became a prestige dialect and so came to be spoken in other regions of the empire as well. Middle Persian is classified as a Western Iranian language. It descends from Old Persian and is the linguistic ancestor of Modern Persian.
The Sasanian Empire, also known as the Sassanian, Sasanid, Sassanid or Neo-Persian Empire, was the last kingdom of the Persian Empire before the rise of Islam. Named after the House of Sasan, it ruled from 224 to 651 AD. The Sasanian Empire succeeded the Parthian Empire and was recognised as one of the leading world powers alongside its neighbouring arch-rival the Roman-Byzantine Empire for a period of more than 400 years.
Old Persian is one of the two directly attested Old Iranian languages. Old Persian appears primarily in the inscriptions, clay tablets and seals of the Achaemenid era. Examples of Old Persian have been found in what is now Iran, Romania (Gherla), Armenia, Bahrain, Iraq, Turkey and Egypt, with the most important attestation by far being the contents of the Behistun Inscription. Recent research (2007) into the vast Persepolis Fortification Archive at the Oriental Institute at the University of Chicago have unearthed Old Persian tablets, which suggest Old Persian was a written language in use for practical recording and not only for royal display.
There are approximately 110 million Persian speakers worldwide, with the language holding official status in Iran, Afghanistan, and Tajikistan. For centuries, Persian has also been a prestigious cultural language in other regions of Western Asia, Central Asia, and South Asia by the various empires based in the regions.
Iran, also called Persia, and officially the Islamic Republic of Iran, is a country in Western Asia. With over 81 million inhabitants, Iran is the world's 18th most populous country. Comprising a land area of 1,648,195 km2 (636,372 sq mi), it is the second largest country in the Middle East and the 17th largest in the world. Iran is bordered to the northwest by Armenia and the Republic of Azerbaijan, to the north by the Caspian Sea, to the northeast by Turkmenistan, to the east by Afghanistan and Pakistan, to the south by the Persian Gulf and the Gulf of Oman, and to the west by Turkey and Iraq. The country's central location in Eurasia and Western Asia, and its proximity to the Strait of Hormuz, give it geostrategic importance. Tehran is the country's capital and largest city, as well as its leading economic and cultural center.
Afghanistan, officially the Islamic Republic of Afghanistan, is a landlocked country located in South-Central Asia. Afghanistan is bordered by Pakistan in the south and east; Iran in the west; Turkmenistan, Uzbekistan, and Tajikistan in the north; and in the far northeast, China. Its territory covers 652,000 square kilometers (252,000 sq mi) and much of it is covered by the Hindu Kush mountain range, which experiences very cold winters. The north consists of fertile plains, while the south-west consists of deserts where temperatures can get very hot in summers. Kabul serves as the capital and its largest city.
Tajikistan, officially the Republic of Tajikistan, is a mountainous, landlocked country in Central Asia with an area of 143,100 km2 (55,300 sq mi) and an estimated population of 8.7 million people as of 2016. It is bordered by Afghanistan to the south, Uzbekistan to the west, Kyrgyzstan to the north, and China to the east. The traditional homelands of the Tajik people include present-day Tajikistan as well as parts of Afghanistan and Uzbekistan.
Persian has had a considerable (mainly lexical) influence on neighboring languages, particularly the Turkic languages in Central Asia, Caucasus, and Anatolia, neighboring Iranian languages, as well as Armenian, Georgian, and Indo-Aryan languages, especially Urdu (a register of Hindustani). It also exerted some influence on Arabic, particularly Bahrani Arabic,while borrowing much vocabulary from it after the Arab conquest of Iran.
A lexicon, word-hoard, wordbook, or word-stock is the vocabulary of a person, language, or branch of knowledge. In linguistics, a lexicon is a language's inventory of lexemes. The word "lexicon" derives from the Greek λεξικόν (lexicon), neuter of λεξικός (lexikos) meaning "of or for words."
The Turkic languages are a language family of at least thirty-five documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe, the Caucasus, Central Asia, and West Asia all the way to North Asia and East Asia. The Turkic languages originated in a region of East Asia spanning Western China to Mongolia, where Proto-Turkic is thought to have been spoken, according to one estimate, around 2,500 years ago, from where they expanded to Central Asia and farther west during the first millennium.
Central Asia stretches from the Caspian Sea in the west to China in the east and from Afghanistan in the south to Russia in the north. The region consists of the former Soviet republics of Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan, and Uzbekistan. It is also colloquially referred to as "the stans" as the countries generally considered to be within the region all have names ending with the Persian suffix "-stan", meaning "land of".
With a long history of literature in the form of Middle Persian before Islam, Persian was the first language in the Muslim world to break through Arabic's monopoly on writing, and the writing of poetry in Persian was established as a court tradition in many eastern courts.Some of the famous works of Persian literature are the Shahnameh of Ferdowsi, the works of Rumi, the Rubaiyat of Omar Khayyam, the Panj Ganj of Nizami Ganjavi, the Divān of Hafez and the two miscellanea of prose and verse by Saadi Shirazi, the Gulistan and the Bustan .
The terms Muslim world and Islamic world commonly refer to the Islamic community (Ummah), consisting of all those who adhere to the religion of Islam, or to societies where Islam is practiced. In a modern geopolitical sense, these terms refer to countries where Islam is widespread, although there are no agreed criteria for inclusion. The term Muslim-majority countries is an alternative often used for the latter sense.
Arabic is usually classified as a Central Semitic language, and linguists widely agree that the language first emerged in the 1st to 4th centuries CE.. It is now the lingua franca of the Arab world. It is named after the Arabs, a term initially used to describe peoples living in the area bounded by Mesopotamia in the east and the Anti-Lebanon mountains in the west, in northwestern Arabia, and in the Sinai Peninsula. Arabic is classified as a macrolanguage comprising 30 modern varieties, including its standard form, Modern Standard Arabic, which is derived from Classical Arabic.
Persian literature comprises oral compositions and written texts in the Persian language and it is one of the world's oldest literatures. It spans over two-and-a-half millennia. Its sources have been within Greater Iran including present-day Iran, Iraq, Afghanistan, the Caucasus, and Turkey, regions of Central Asia and South Asia where the Persian language has historically been either the native or official language. For instance, Rumi, one of best-loved Persian poets born in Balkh or Vakhsh, wrote in Persian and lived in Konya, then the capital of the Seljuks in Anatolia. The Ghaznavids conquered large territories in Central and South Asia and adopted Persian as their court language. There is thus Persian literature from Iran, Mesopotamia, Azerbaijan, the wider Caucasus, Turkey, western parts of Pakistan, India, Tajikistan and other parts of Central Asia. Not all Persian literature is written in Persian, as some consider works written by ethnic Persians in other languages, such as Greek and Arabic, to be included. At the same time, not all literature written in Persian is written by ethnic Persians or Iranians, as Turkic, Caucasian, and Indic poets and writers have also used the Persian language in the environment of Persianate cultures.
Persian is one of the Western Iranian languages within the Indo-European family. Other Western Iranian languages are the Kurdish languages, Gilaki, Mazanderani, Talysh, and Balochi. Persian is classified as a member of the Southwestern subgroup within Western Iranian along with Lari, Kumzari, and Luri.
Kurdish is a continuum of Northwestern Iranian languages spoken by the Kurds in Western Asia. Kurdish forms three dialect groups known as Northern Kurdish (Kurmanji), Central Kurdish (Sorani), and Southern Kurdish. A separate group of non-Kurdish Northwestern Iranian languages, the Zaza–Gorani languages, are also spoken by several million Kurds. Studies as of 2009 estimate between 8 and 20 million native Kurdish speakers in Turkey. The majority of the Kurds speak Northern Kurdish ("Kurmanji").
The Gilaki language is a Caspian language, and a member of the northwestern Iranian language branch, spoken in Iran's Gīlān Province. Gilaki is closely related to Mazandarani and the two languages have similar vocabularies. Though the Persian language has influenced Gilaki to a great extent, Gilaki remains an independent language with a northwestern Iranian origin. The Gilaki and Mazandarani languages share certain typological features with Caucasian languages, reflecting the history, ethnic identity, and close relatedness to the Caucasus region and Caucasian peoples of the Gilaki people and Mazandarani people.
Mazanderani (مازندرانی), also Tabari (طبری), is an Iranian language of the Northwestern branch, spoken mainly in Iran's Mazandaran, Tehran, Alborz, Semnan and Golestan provinces. As a member of the Northwestern branch, etymologically speaking it is rather closely related to Gilaki, and more distantly related to Persian, which belongs to the Southwestern branch. Mazandarani is closely related to Gilaki and the two languages have similar vocabularies. The Gilaki and Mazandarani languages share certain typological features with Caucasian languages, reflecting the history, ethnic identity, and close relatedness to the Caucasus region and Caucasian peoples of the Mazandarani people and Gilaki people. For the most part Mazanderani is mutually unintelligible with Persian.
In Persian, the language is known by several names:
Persian, the historically more widely used name of the language in English, is an anglicized form derived from Latin *Persianus < Latin Persia < Greek Περσίς (Persís) "Persia", a Hellenized form of Old Persian Pārsa. According to the Oxford English Dictionary , the term Persian as a language name is first attested in English in the mid-16th century.
Farsi is the Arabicized form of Pārsi, subsequent to Arab conquest of Iran, due to a lack of the phoneme /p/ in Standard Arabic (i.e., the /p/ was replaced with an /f/). The name Farsi, and the language as a whole, originated in the Fars Province , which itself is the Arabicized form of Pārs. Farsi is encountered in some linguistic literature as a name for the language, used both by Iranian and by foreign authors.
The Academy of Persian Language and Literature, however, has declared that the name Persian is more appropriate, as it has the longer tradition in western languages and better expresses the role of the language as a mark of cultural and national continuity.Some Persian language scholars such as Ehsan Yarshater, editor of Encyclopædia Iranica , and University of Arizona professor Kamran Talattof, have also rejected the usage of "Farsi" in their articles.
The international language-encoding standard ISO 639-1 uses the code
fa, as its coding system is mostly based on the local names. The more detailed standard ISO 639-3 uses the name "Persian" (code
fas) for the dialect continuum spoken across Iran and Afghanistan. This consists of the individual languages Dari (Afghan Persian) and Iranian Persian.
Currently, Voice of America, BBC World Service, Deutsche Welle, and Radio Free Europe/Radio Liberty use "Persian Service" for their broadcasts in the language. Radio Free Europe/Radio Liberty also includes a Tajik service and an Afghan (Dari) service. The American Association of Teachers of Persian, and the Centre for Promotion of Persian Language and Literature also use the name "Persian".
| History of the|
| Proto-Indo-European (c. 3000 BCE) |
| Proto-Indo-Iranian (c. 2000 BCE) |
| Proto-Iranian (c. 1500 BCE) |
| Old Persian (c. 525 – 300 BCE) |
| Middle Persian (c. 300 BCE – 800 CE) |
|Modern Persian (from 800)|
Persian is an Iranian language which belongs to the Indo-Iranian branch of the Indo-European family of languages. In general, Iranian languages are known from three periods, usually referred to as Old, Middle, and New (Modern) periods. These correspond to three eras in Iranian history; Old era being the period from sometime before Achaemenids, the Achaemenid era and sometime after Achaemenids (that is to 400–300 BC), Middle era being the next period most officially Sassanid era and sometime in post-Sassanid era, and the New era being the period afterwards down to present day.
According to available documents, the Persian language is "the only Iranian language"for which close philological relationships between all of its three stages are established and so that Old, Middle, and New Persian represent one and the same language of Persian; that is, New Persian is a direct descendant of Middle and Old Persian.
The known history of the Persian language can be divided into the following three distinct periods:
As a written language, Old Persian is attested in royal Achaemenid inscriptions. The oldest known text written in Old Persian is from the Behistun Inscription, dating to the time of king Darius I (reigned 522-486 BC).Examples of Old Persian have been found in what is now Iran, Romania (Gherla), Armenia, Bahrain, Iraq, Turkey and Egypt. Old Persian is one of the oldest Indo-European languages which is attested in original texts.
Related to Old Persian, but from a different branch of the Iranian language family, was Avestan, the language of the Zoroastrian liturgical texts.
The complex grammatical conjugation and declension of Old Persian yielded to the structure of Middle Persian in which the dual number disappeared, leaving only singular and plural, as did gender. Middle Persian developed the ezāfe construction, expressed through ī (modern ye), to indicate some of the relations between words that have been lost with the simplification of the earlier grammatical system.
Although the "middle period" of the Iranian languages formally begins with the fall of the Achaemenid Empire, the transition from Old to Middle Persian had probably already begun before the 4th century BC. However, Middle Persian is not actually attested until 600 years later when it appears in the Sassanid era (224–651) inscriptions, so any form of the language before this date cannot be described with any degree of certainty. Moreover, as a literary language, Middle Persian is not attested until much later, in the 6th or 7th century. From the 8th century onward, Middle Persian gradually began yielding to New Persian, with the middle-period form only continuing in the texts of Zoroastrianism.
Middle Persian is "essentially, though not in every detail, a later form of the same dialect as Old Persian".
The native name of Middle Persian was Parsig or Parsik, after the name of the ethnic group of the southwest, that is, "of Pars", Old Persian Parsa, New Persian Fars . This is the origin of the name Farsi as it is today used to signify New Persian. Following the collapse of the Sassanid state, Parsik came to be applied exclusively to (either Middle or New) Persian that was written in the Arabic script. From about the 9th century onward, as Middle Persian was on the threshold of becoming New Persian, the older form of the language came to be erroneously called Pahlavi , which was actually but one of the writing systems used to render both Middle Persian as well as various other Middle Iranian languages. That writing system had previously been adopted by the Sassanids (who were Persians, i.e. from the southwest) from the preceding Arsacids (who were Parthians, i.e. from the northeast). While Ibn al-Muqaffa' (eighth century) still distinguished between Pahlavi (i.e. Parthian) and Persian (in Arabic text: al-Farisiyah) (i.e. Middle Persian), this distinction is not evident in Arab commentaries written after that date.
Gernot Windfuhr considers new Persian as an evolution of the Old Persian language and the Middle Persian languagebut also states that none of the known Middle Persian dialects is the direct predecessor of Modern Persian. Ludwig Paul states: "The language of the Shahnameh should be seen as one instance of continuous historical development from Middle to New Persian."
"New Persian" (Modern) is conventionally divided into three stages:
Early New Persian remains largely intelligible to speakers of Contemporary Persian, as the morphology and, to a lesser extent, the lexicon of the language have remained relatively stable.
"New Persian" is taken to replace Middle Persian in the course of the 8th to 9th centuries, under Abbasid rule.With the decline of the Abbasids began the re-establishment of Persian national life and Persians laid the foundations for a renaissance in the realm of letters. New Persian as an independent literary language first emerges in Bactria through the adaptation of the spoken form of Sassanian Middle Persian court language called Pārsi-ye Dari. The cradle of the Persian literary renaissance lay in the east of Greater Iran in Greater Khorasan and Transoxiana close to the Amu Darya (modern day Afghanistan, Tajikstan, Uzbekistan and Turkmenistan). The vocabulary of the New Persian language was thus heavily influenced by other Eastern Iranian languages, particularly Sogdian.
The mastery of the newer speech having now been transformed from Middle into New Persian was already complete by the era of the three princely dynasties of Iranian origin, the Tahirid dynasty (820–872), Saffarid dynasty (860–903) and Samanid Empire (874–999), and could develop only in range and power of expression.
Abbas of Merv is mentioned as being the earliest minstrel to chant verse in the newer Persian tongue and after him the poems of Hanzala Badghisi were among the most famous between the Persian-speakers of the time.
The first poems of the Persian language, a language historically called Dari, emerged in Afghanistan.The first significant Persian poet was Rudaki. He flourished in the 10th century, when the Samanids were at the height of their power. His reputation as a court poet and as an accomplished musician and singer has survived, although little of his poetry has been preserved. Among his lost works is versified fables collected in the Kalila wa Dimna .
The language spread geographically from the 11th century on and was the medium through which among others, Central Asian Turks became familiar with Islam and urban culture. New Persian was widely used as a trans-regional lingua franca, a task for which it was particularly suitable due to its relatively simple morphological structure and this situation persisted until at least 19th century.In the late Middle Ages, new Islamic literary languages were created on the Persian model: Ottoman Turkish, Chagatai and Urdu, which are regarded as "structural daughter languages" of Persian.
"Classical Persian" loosely refers to the standardized language of medieval Persia used in literature and poetry. This is the language of the 10th to 12th centuries, which continued to be used as literary language and lingua franca under the "Persianized" Turko-Mongol dynasties during the 12th to 15th centuries, and under restored Persian rule during the 16th to 19th centuries.
Persian during this time served as lingua franca of Greater Persia and of much of the Indian subcontinent. It was also the official and cultural language of many Islamic dynasties, including the Samanids, Buyids, Tahirids, Ziyarids, the Mughal Empire, Timurids, Ghaznavids, Karakhanids, Seljuqs, Khwarazmians, the Sultanate of Rum, Delhi Sultanate, the Shirvanshahs, Safavids, Afsharids, Zands, Qajars, Khanate of Bukhara, Khanate of Kokand, Emirate of Bukhara, Khanate of Khiva, Ottomans and also many Mughal successors such as the Nizam of Hyderabad. Persian was the only non-European language known and used by Marco Polo at the Court of Kublai Khan and in his journeys through China.
Despite Anatolia having been ruled at various times prior to the Middle Ages by various Persian-speaking dynasties originating in Iran, the language lost its traditional foothold there with the demise of the Sasanian Empire. Centuries later however, the practise and usage of Persian in the region would be strongly revived. A branch of the Seljuks, the Sultanate of Rum, took Persian language, art and letters to Anatolia.They adopted Persian language as the official language of the empire. The Ottomans, which can roughly be seen as their eventual successors, took this tradition over. Persian was the official court language of the empire, and for some time, the official language of the empire. The educated and noble class of the Ottoman Empire all spoke Persian, such as Sultan Selim I, despite being Safavid Iran's archrival and a staunch opposer of Shia Islam. It was a major literary language in the empire. Some of the noted earlier Persian works during the Ottoman rule are Idris Bidlisi's Hasht Bihisht, which begun in 1502 and covered the reign of the first eight Ottoman rulers, and the Salim-Namah, a glorification of Selim I. After a period of several centuries, Ottoman Turkish (which was highly Persianised itself) had developed towards a fully accepted language of literature, which was even able to satisfy the demands of a scientific presentation. However, the number of Persian and Arabic loanwords contained in those works increased at times up to 88%.
The Persian language influenced the formation of many modern languages in West Asia, Europe, Central Asia, and South Asia. Following the Turko-Persian Ghaznavid conquest of South Asia, Persian was firstly introduced in the region by Turkic Central Asians.The basis in general for the introduction of Persian language into the subcontinent was set, from its earliest days, by various Persianized Central Asian Turkic and Afghan dynasties. For five centuries prior to the British colonization, Persian was widely used as a second language in the Indian subcontinent, due to the admiration the Mughals (who were of Turco-Mongol origin) had for the foreign language. It took prominence as the language of culture and education in several Muslim courts on the subcontinent and became the sole "official language" under the Mughal emperors. Beginning in 1843, though, English and Hindustani gradually replaced Persian in importance on the subcontinent. Evidence of Persian's historical influence there can be seen in the extent of its influence on certain languages of the Indian subcontinent. Words borrowed from Persian are still quite commonly used in certain Indo-Aryan languages, especially Urdu, also historically known as Hindustani. There is also a small population of Zoroastrian Iranis in India, who migrated in the 19th century to escape religious execution in Qajar Iran and speak a Dari dialect.
In the 19th century, under the Qajar dynasty, the dialect spoken in Tehran rose to prominence. This became the basis of what is now known as "Contemporary Standard Persian".
There is still substantial Arabic vocabulary, but many of these words have been integrated into Persian phonology and grammar. In addition, since the 19th century numerous Russian, French, and English terms have been borrowed, especially vocabulary related to technology. The Iranian National Academy of Persian Language and Literature is responsible for evaluating neologisms in order to devise their Persian equivalents.
There are three modern varieties of standard Persian:
All these three varieties are based on the classic Persian literature and its literary tradition. There are also several local dialects from Iran, Afghanistan and Tajikistan which slightly differ from the standard Persian. The Hazaragi dialect (in Central Afghanistan and Pakistan), Herati (in Western Afghanistan), Darwazi (in Afghanistan and Tajikistan), and the Tehrani accent (in Iran, the basis of standard Iranian Persian) are examples of these dialects. Persian-speaking peoples of Iran, Afghanistan, and Tajikistan can understand one another with a relatively high degree of mutual intelligibility.
The following are some languages closely related to Persian, or in some cases are considered dialects:
For other more distantly related branches of the Iranian language family, such as Kurdish and Balochi, see Iranian languages.
Iranian Persian has six vowels and twenty-three consonants.
Historically, Persian distinguished length. Early New Persian had a series of five long vowels (/iː/, /uː/, /ɒː/, /oː/ and /eː/) along with three short vowels /æ/, /i/ and /u/. At some point prior to the 16th century in the general area now modern Iran, /eː/ and /iː/ merged into /iː/, and /oː/ and /uː/ merged into /uː/. Thus, older contrasts such as شیرshēr "lion" vs. شیرshīr "milk", and زودzūd "quick" vs زورzōr "strong" were lost. However, there are exceptions to this rule, and in some words, ē and ō are preserved[ citation needed ] or merged into the diphthongs [eɪ] and [oʊ] (which are descendants of the diphthongs [æɪ] and [æʊ] in Early New Persian), instead of merging into /iː/ and /uː/. Examples of the exception can be found in words such as روشن[roʊʃæn] (bright).
However, in Dari, the archaic distinction of /eː/ and /iː/ (respectively known as یای مجهولYā-ye majhūl and یای معروفYā-ye ma'rūf) is still preserved as well as the distinction of /oː/ and /uː/ (known as واو مجهولWāw-e majhūl and واو معروفWāw-e ma'rūf). On the other hand, in standard Tajik, the length distinction has disappeared, and /iː/ merged with /i/ and /uː/ with /u/. Therefore, contemporary Afghan Dari dialects are the closest to the vowel inventory of Early New Persian.
According to most studies on the subject (e.g. Samareh 1977, Pisowicz 1985, Najafi 2001), the three vowels traditionally considered long (/i/, /u/, /ɒ/) are currently distinguished from their short counterparts (/e/, /o/, /æ/) by position of articulation rather than by length. However, there are studies (e.g. Hayes 1979, Windfuhr 1979) that consider vowel length to be the active feature of the system, with /ɒ/, /i/, and /u/ phonologically long or bimoraic and /æ/, /e/, and /o/ phonologically short or monomoraic.
There are also some studies that consider quality and quantity to be both active in the Iranian system (such as Toosarvandani 2004). That offers a synthetic analysis including both quality and quantity, which often suggests that Modern Persian vowels are in a transition state between the quantitative system of Classical Persian and a hypothetical future Iranian language, which will eliminate all traces of quantity and retain quality as the only active feature.
The length distinction is still strictly observed by careful reciters of classic-style poetry for all varieties (including Tajik).
|Plosive||p b||t d||k ɡ||( q ) ɢ|
|Fricative||f v||s z||ʃ ʒ||x ɣ||h|
|Flap or Tap||ɾ|
Suffixes predominate Persian morphology, though there are a small number of prefixes.Verbs can express tense and aspect, and they agree with the subject in person and number. There is no grammatical gender in Persian, and pronouns are not marked for natural gender.
Normal declarative sentences are structured as (S) (PP) (O) V: sentences have optional subjects, prepositional phrases, and objects followed by a compulsory verb. If the object is specific, the object is followed by the word rā and precedes prepositional phrases: (S) (O + rā) (PP) V.
Persian makes extensive use of word building and combining affixes, stems, nouns and adjectives. Persian frequently uses derivational agglutination to form new words from nouns, adjectives, and verbal stems. New words are extensively formed by compounding – two existing words combining into a new one, as is common in German.
While having a lesser influence on Arabicand other languages of Mesopotamia and its core vocabulary being of Middle Persian origin, New Persian contains a considerable amount of Arabic lexical items, which were Persianized and often took a different meaning and usage than the Arabic original. Persian loanwords of Arabic origin especially include Islamic terms. The Arabic vocabulary in other Iranian, Turkic and Indic languages is generally understood to have been copied from New Persian, not from Arabic itself.
John R. Perry, in his article Lexical Areas and Semantic Fields of Arabic, estimates that about 24 percent of an everyday vocabulary of 20,000 words in current Persian, and more than 25 percent of the vocabulary of classical and modern Persian literature, are of Arabic origin. The text frequency of these loan words is generally lower and varies by style and topic area. It may approach 25 percent of a text in literature.According to another source, about 40% of everyday Persian literary vocabulary is of Arabic origin. Among the Arabic loan words, relatively few (14 percent) are from the semantic domain of material culture, while a larger number are from domains of intellectual and spiritual life. Most of the Arabic words used in Persian are either synonyms of native terms or could be glossed in Persian.
The inclusion of Mongolian and Turkic elements in the Persian language should also be mentioned, ارتشarteš for "army", instead of the Uzbek قؤشینqoʻshin; سرلشکرsarlaškar; دریابانdaryābān; etc.) in the 20th century. Persian has likewise influenced the vocabularies of other languages, especially other Indo-European languages such as Armenian, Urdu, Bengali and (to a lesser extent) Hindi; the latter three through conquests of Persianized Central Asian Turkic and Afghan invaders; Turkic languages such as Ottoman Turkish, Chagatai, Tatar, Turkish, Turkmen, Azeri, Uzbek, and Karachay-Balkar; Caucasian languages such as Georgian, and to a lesser extent, Avar and Lezgin; Afro-Asiatic languages like Assyrian (List of loanwords in Assyrian Neo-Aramaic) and Arabic; and even Dravidian languages indirectly especially Telugu and Brahui; as well as Austronesian languages such as Indonesian and Malay. Persian has also had a significant lexical influence, via Turkish, on Albanian, Serbian, Croatian, Bosnian, and Serbo-Croatian, particularly as spoken in Bosnia and Herzegovina.not only because of the political role a succession of Turkic dynasties played in Iranian history, but also because of the immense prestige Persian language and literature enjoyed in the wider (non-Arab) Islamic world, which was often ruled by sultans and emirs with a Turkic background. The Turkish and Mongolian vocabulary in Persian is minor in comparison to that of Arabic and these words were mainly confined to military, pastoral terms and political sector (titles, administration, etc.). New military and political titles were coined based partially on Middle Persian (e.g.
Use of occasional foreign synonyms instead of Persian words can be a common practice in everyday communications as an alternative expression. In some instances in addition to the Persian vocabulary, the equivalent synonyms from multiple foreign languages can be used. For example, in Iranian colloquial Persian (not in Afghanistan or Tajikistan), the phrase "thank you" may be expressed using the French word مرسیmerci (stressed, however, on the first syllable), the hybrid Persian-Arabic phrase متشکّر امmotešakker am (متشکّرmotešakker being "thankful" in Arabic, commonly pronounced motčakker in Persian, and the verb امam meaning "I am" in Persian), or by the pure Persian phrase سپاسگزار امsepās-gozār am.
The vast majority of modern Iranian Persian and Dari text is written with the Arabic script. Tajiki, which is considered by some linguists to be a Persian dialect influenced by Russian and the Turkic languages of Central Asia,is written with the Cyrillic script in Tajikistan (see Tajik alphabet). There also exist several romanization systems for Persian.
Modern Iranian Persian and Afghan Persian are written using a modified variant of the Arabic alphabet, which uses different pronunciation and additional letters not found in Arabic. After the Muslim conquest of Persia, it took approximately 150 years before Persians adopted the Arabic script in place of the older alphabet. Previously, two different scripts were used, Pahlavi, used for Middle Persian, and the Avestan alphabet (in Persian, Dīndapirak or Din Dabire—literally: religion script), used for religious purposes, primarily for the Avestan but sometimes for Middle Persian.
In the modern Persian script, historically short vowels are usually not written, only the historically long ones are represented in the text, so words distinguished from each other only by short vowels are ambiguous in writing: Western Persian kerm "worm", karam "generosity", kerem "cream", and krom "chrome" are all spelled krm (کرم) in Persian. The reader must determine the word from context. The Arabic system of vocalization marks known as harakat is also used in Persian, although some of the symbols have different pronunciations. For example, a ḍammah is pronounced [ʊ~u], while in Iranian Persian it is pronounced [o]. This system is not used in mainstream Persian literature; it is primarily used for teaching and in some (but not all) dictionaries.
There are several letters generally only used in Arabic loanwords. These letters are pronounced the same as similar Persian letters. For example, there are four functionally identical letters for /z/ (ز ذ ض ظ), three letters for /s/ (س ص ث), two letters for /t/ (ط ت), two letters for /h/ (ح ه). On the other hand, there are four letters that don't exist in Arabic پ چ ژ گ.
The Persian alphabet adds four letters to the Arabic alphabet:
|/ʒ/||ژ||že (zhe or jhe)|
Historically, there was also a special letter for the sound /β/. This letter is no longer used, as the /β/-sound changed to /b/, i.e. archaic زڤان /zaβān/ > زبان /zæbɒn/ 'language'
The Persian alphabet also modifies some letters of the Arabic alphabet. For example, alef with hamza below ( إ ) changes to alef ( ا ); words using various hamzas get spelled with yet another kind of hamza (so that مسؤول becomes مسئول) even though the latter is also correct in Arabic; and teh marbuta ( ة ) changes to heh ( ه ) or teh ( ت ).
The letters different in shape are:
|Arabic Style letter||Persian Style letter||name|
The International Organization for Standardization has published a standard for simplified transliteration of Persian into Latin, ISO 233-3, titled "Information and documentation – Transliteration of Arabic characters into Latin characters – Part 3: Persian language – Simplified transliteration" but the transliteration scheme is not in widespread use.
Another Latin alphabet, based on the Common Turkic Alphabet, was used in Tajikistan in the 1920s and 1930s. The alphabet was phased out in favor of Cyrillic in the late 1930s.
Fingilish is Persian using ISO basic Latin alphabet. It is most commonly used in chat, emails and SMS applications. The orthography is not standardized, and varies among writers and even media (for example, typing 'aa' for the [ɒ] phoneme is easier on computer keyboards than on cellphone keyboards, resulting in smaller usage of the combination on cellphones).
The Cyrillic script was introduced for writing the Tajik language under the Tajik Soviet Socialist Republic in the late 1930s, replacing the Latin alphabet that had been used since the October Revolution and the Persian script that had been used earlier. After 1939, materials published in Persian in the Persian script were banned from the country.
The following text is from Article 1 of the Universal Declaration of Human Rights.
|Western Persian||همهی افراد بشر آزاد به دنیا میآیند و حیثیت و حقوقشان با هم برابر است، همه اندیشه و وجدان دارند و باید در برابر یکدیگر با روح برادری رفتار کنند.|
|Hamaye afrâd bašâr âzâd be donyâ miâyand o heysiyat o hoğuğe šân bâ ham barâbar ast hame šân andiše o vejdân dârand o bâjad dar barâbare yekdigar bâ ruhe barâdari raftâr konand.|
|Western Persian IPA||[hæmeje æfrɒde bæʃær ɒzɒd be donjɒ miɒjænd o hejsijæt o hoɢuɢe ʃɒn bɒ hæm bærɒbær æst hæme ʃɒn ændiʃe o vedʒdɒn dɒrænd o bɒjæd dær bærɒbære jekdiɡær bɒ ruhe bærɒdæri ræftɒr konænd]|
|Tajiki||Ҳамаи афроди башар озод ба дунё меоянд ва ҳайсияту ҳуқуқашон бо ҳам баробар аст, ҳамаашон андешаву виҷдон доранд ва бояд дар баробари якдигар бо рӯҳи бародарӣ рафтор кунанд.|
|English translation||All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.|
The Persians are an Iranian ethnic group that make up over half the population of Iran. They share a common cultural system and are native speakers of the Persian language, as well as closely related languages.
Tajiks are a Persian-speaking Iranian ethnic group native to Afghanistan, Tajikistan, and Uzbekistan. Tajiks are the largest ethnicity in Tajikistan, and the second largest in Afghanistan which constitutes over half of the global Tajik population. They speak varieties of Persian, a Western Iranian language. In Tajikistan, since the 1939 Soviet census, its small Pamiri and Yaghnobi ethnic groups are included as Tajiks. In China, the term is used to refer to its Pamiri ethnic groups, the Tajiks of Xinjiang, who speak the Eastern Iranian Pamiri languages. In Afghanistan, the Pamiris are counted as a separate ethnic group.
Darī or Dari Persian or synonymously Farsi is a variation of the Persian language spoken in Afghanistan. Dari is the term officially recognized and promoted since 1964 by the Afghan government for the Persian language, hence, it is also known as Afghan Persian in many Western sources. This has resulted in a naming dispute. Many Persian speakers in Afghanistan prefer and use the name "Farsi" and say the term Dari has been forced on them by the dominant Pashtun ethnic group as an attempt to distance Afghans from their cultural, linguistic, and historical ties to the Persian-speaking world, which includes Iran and Tajikistan.
Hindustani, also known as Hindi-Urdu and historically also known as Hindavi, Dehlavi and Rekhta, is a lingua franca of Northern India and Pakistan. It is an Indo-Aryan language, deriving its base primarily from the Khariboli dialect of Delhi. The language incorporates a large amount of vocabulary from Prakrit, Sanskrit, as well as Persian and Arabic. It is a pluricentric language, with two official forms, Modern Standard Hindi and Modern Standard Urdu, which are its standardised registers. According to Ethnologue's 2019 estimates, if Hindi and Urdu are taken together as Hindustani, the language would be the 3rd most spoken language in the world, with approximately 409.8 million native speakers and 785.6 million total speakers.
Uzbek is a Turkic language that is the first official and only declared national language of Uzbekistan. The language of Uzbeks, it is spoken by some 33 million native speakers in Uzbekistan and elsewhere in Central Asia.
Pashto, sometimes spelled Pukhto, is the language of the Pashtuns. It is known in Persian literature as Afghāni (افغانی) and in Hindustani literature as Paṭhānī. Speakers of the language are called Pashtuns/Pakhtuns/Pathans and sometimes Afghans. It is an Eastern Iranian language, belonging to the Indo-European family. Pashto is one of the two official languages of Afghanistan and it is the second-largest regional language of Pakistan, mainly spoken in the west and northwest of the country. In Pakistan, it is the majority language of the province of Khyber Pakhtunkhwa and the northern districts of Balochistan. Along with Dari Persian, Pashto is the main language among the Pashtun diaspora around the world. The total number of Pashto-speakers is estimated to be 45–60 million people worldwide.
The Dungan language is a Sinitic language spoken primarily in Kyrgyzstan and Kazakhstan by the Dungan people, an ethnic group related to the Hui people of China. Although it is derived from the Central Plains Mandarin of Gansu and Shaanxi, it is primarily written in Cyrillic and contains loanwords and archaisms not found in other modern varieties of Mandarin.
Hazaragi is an eastern variety of Persian that is spoken by the Hazara people, primarily in the Hazarajat region of central Afghanistan, as well as other Hazara-populated areas of their native living ground of Afghanistan. It is also spoken by the Hazaras of Pakistan and Iran and also by Hazara diaspora living elsewhere. It is mutually intelligible with Dari, one of the two official languages of Afghanistan.
Tajik or Tajiki, also called Tajiki Persian, is the variety of Persian spoken in Tajikistan and Uzbekistan. It is closely related to Dari Persian. Since the beginning of the twentieth century and collapse of the Soviet Union, Tajik has been considered by a number of writers and researchers to be a variety of Persian. The popularity of this conception of Tajik as a variety of Persian was such that, during the period in which Tajik intellectuals were trying to establish Tajik as a language separate from Persian language, Sadriddin Ayni, who was a prominent intellectual and educator, made a statement that Tajik was not a bastardized dialect of Persian. The issue of whether Tajik and Persian are to be considered two dialects of a single language or two discrete languages has political sides to it.
The Uyghur or Uighur language, formerly known as Eastern Turki, is a Turkic language with 10 to 15 million speakers, spoken primarily by the Uyghur people in the Xinjiang Uyghur Autonomous Region of Western China. Significant communities of Uyghur-speakers are located in Kazakhstan, Kyrgyzstan and Uzbekistan, and various other countries have Uyghur-speaking expatriate communities. Uyghur is an official language of the Xinjiang Uyghur Autonomous Region, and is widely used in both social and official spheres, as well as in print, radio, and television, and is used as a common language by other ethnic minorities in Xinjiang.
A pluricentric language or polycentric language is a language with several interacting codified standard forms, often corresponding to different countries. Examples include Armenian, Chinese, English, French, German, Korean, Portuguese, Serbo-Croatian, Spanish and Swedish. A language that has only one formally standardized version is monocentric. Examples include Japanese and Russian.
Karakalpak is a Turkic language spoken by Karakalpaks in Karakalpakstan. It is divided into two dialects, Northeastern Karakalpak and Southeastern Karakalpak. It developed alongside neighboring Kazakh and Uzbek languages, being markedly influenced by both. Typologically, Karakalpak belongs to the Kipchak branch of the Turkic languages, thus being closely related to and partially mutually intelligible to Kazakh.
Fārsīwān is a designation for Persian-speakers in Afghanistan, with diaspora in Iran and elsewhere abroad. More specifically, it is used to refer to a distinct group of farmers in Afghanistan and urban dwellers who are a subgroup of the Tajiks of Afghanistan and Tajikistan. The term excludes the Hazāra and Aymāq tribes who also speak dialects of Persian, but are generally believed to be distinct from the Tajiks. In Afghanistan, the Farsiwan are found predominantly in Herat and Farah provinces. They are roughly the same as the Persians of Eastern Iran. Although the term was originally coined with Persian language's lexical root (Pārsībān), the suffix has been transformed into a Pashto form (-wān), and is usually utilized by the Pashtuns to designate the Persian-speakers.
The Persian language has between six and eight vowel phonemes and twenty-three consonant phonemes. It features contrastive stress and syllable-final consonant clusters.
The Persian language historically influenced many of the modern languages and dialects of the Middle East, Central Asia, Eastern Europe, and South Asia including the standard register Urdu, the national language of Pakistan and an official language in seven states/territories of India. The relationship extends back to around the 3rd millennium BCE, as they both descend from the Proto-Indo-Iranian language.
The Iranian or Iranic languages are a branch of the Indo-Iranian languages in the Indo-European language family that are spoken natively by the Iranian peoples.
The Tajik language has been written in three alphabets over the course of its history: an adaptation of the Perso-Arabic script, an adaptation of the Latin script, and an adaptation of the Cyrillic script. Any script used specifically for Tajik may be referred to as the Tajik alphabet, which is written as алифбои тоҷикӣ in Cyrillic characters, الفبای تاجیکی with Arabic script, and alifboji toçikī in Latin script.
Ishkashimi is an Iranian language spoken predominantly in the Badakhshan Province in Afghanistan and in Gorno-Badakhshan Autonomous Region in Tajikistan.
Romanization of Persian or Latinization of Persian is the representation of the Persian language with the Latin script. Several different romanization schemes exist, each with its own set of rules driven by its own set of ideological goals.
Bible translations into Persian have been made since the fourth or fifth century, although few early manuscripts survive. There are both Jewish and Christian translations from the Middle Ages. Complete translations of the Hebrew Bible and Greek New Testament from original languages were first made in the 19th century by Protestant missionaries.
There are numerous reasons to study Persian: for one thing, Persian is an important language of the Middle East and Central Asia, spoken by approximately 70 million native speakers and roughly 110 million people worldwide.
Since the Arab conquest of the country in 7th century AD, many loan words have entered the language (which from this time has been written with a slightly modified version of the Arabic script) and the literature has been heavily influenced by the conventions of Arabic literature.
|Persian edition of Wikipedia, the free encyclopedia|