Catalan language

Last updated

català, valencià
Pronunciation [kətəˈla] , [valensiˈa]
Native to Spain, Andorra, France, Italy
Ethnicity Aragonese
Speakers L1: 4.1 million (2012) [1]
L2: 5.1 million
Total: 9.2 million
Early form
Standard forms
Latin (Catalan alphabet)
Catalan Braille
Signed Catalan
Official status
Official language in
Recognised minority
language in
Regulated by Institut d'Estudis Catalans
Acadèmia Valenciana de la Llengua
Language codes
ISO 639-1 ca
ISO 639-2 cat
ISO 639-3 cat
Glottolog stan1289
Linguasphere 51-AAA-e
Catalan language in Europe.png
  Territories where Catalan is spoken and is official
  Territories where Catalan is spoken but is not official
  Territories where Catalan is not historically spoken but is official
This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters. For an introductory guide on IPA symbols, see Help:IPA.
A speaker of Catalan (Majorcan dialect).
Artur Mas, former president of Catalonia, discussing individual identity, collective identity and language.

Catalan ( /ˈkætələn,-æn,ˌkætəˈlæn/ ; [3] [4] autonym: català, Eastern Catalan:  [kətəˈla] ), known in the Valencian Community and Carche as Valencian (autonym: valencià), is a Western Romance language. It is the official language of Andorra, [5] and an official language of three autonomous communities in eastern Spain: Catalonia, the Valencian Community, and the Balearic Islands. It also has semi-official status in the Italian comune of Alghero. [6] It is also spoken in the Pyrénées-Orientales department of France and in two further areas in eastern Spain: the eastern strip of Aragon and the Carche area in the Region of Murcia. The Catalan-speaking territories are often called the Països Catalans or "Catalan Countries".


The language evolved from Vulgar Latin in the Middle Ages around the eastern Pyrenees. Nineteenth-century Spain saw a Catalan literary revival, [7] [8] culminating in the early 1900s.

Etymology and pronunciation

Catalan Countries (Paisos Catalans
): (In orange, strict Catalan-speaking area) NE modern Spain (Catalonia, Valencian Community and Balearic Islands), SE. France (Roussillon, touching the Pyrenees) and Comune of Alghero (NW coast of Sardinia, an island belonging to Italy) Extensio de la llengua catalana als Paisos Catalans.png
Catalan Countries ( Països Catalans ): (In orange, strict Catalan-speaking area) NE modern Spain (Catalonia, Valencian Community and Balearic Islands), SE. France (Roussillon, touching the Pyrenees) and Comune of Alghero (NW coast of Sardinia, an island belonging to Italy)
The Crown of Aragon in 1443. King James the Conqueror [1208-1276] dictated his autobiographical chronicles entirely in Catalan. Some of this territory nowadays makes up the Catalan Countries. Aragonese Empire 1443.svg
The Crown of Aragon in 1443. King James the Conqueror [1208–1276] dictated his autobiographical chronicles entirely in Catalan. Some of this territory nowadays makes up the Catalan Countries .

The word Catalan is derived from the territorial name of Catalonia, itself of disputed etymology. The main theory suggests that Catalunya (Latin Gathia Launia) derives from the name Gothia or Gauthia ("Land of the Goths"), since the origins of the Catalan counts, lords and people were found in the March of Gothia, whence Gothland > Gothlandia > Gothalania > Catalonia theoretically derived. [9] [10]

In English, the term referring to a person first appears in the mid 14th century as Catelaner, followed in the 15th century as Catellain (from French). It is attested a language name since at least 1652. The word Catalan can be pronounced in English as /ˈkætələn/ , /ˈkætəlæn/ or /ˌkætəˈlæn/ . [11] [4]

The endonym is pronounced [kətəˈla] in the Eastern Catalan dialects, and [kataˈla] in the Western dialects. In the Valencian Community and Carche, the term valencià [valensiˈa, ba-] is frequently used instead. Thus, the name "Valencian", although often employed for referring to the varieties specific to the Valencian Community and Carche, is also used by Valencians as a name for the language as a whole, [12] synonymous with "Catalan". [13] [12] Both uses of the term have their respective entries in the dictionaries by the Acadèmia Valenciana de la Llengua [note 1] and the Institut d'Estudis Catalans. [note 2] See also status of Valencian below.


Homilies d'Organya (12th century) Homilies d'Organya.jpg
Homilies d'Organyà (12th century)
Fragment of the Greuges de Guitard Isarn (ca. 1080-1095), one of the earliest texts written almost completely in Catalan, predating the famous Homilies d'Organya by a century Greuges de Guitard Isarn.jpg
Fragment of the Greuges de Guitard Isarn (ca. 1080–1095), one of the earliest texts written almost completely in Catalan, predating the famous Homilies d'Organyà by a century
Linguistic map of Southwestern Europe Linguistic map Southwestern Europe-en.gif
Linguistic map of Southwestern Europe

Middle Ages

By the 9th century, Catalan had evolved from Vulgar Latin on both sides of the eastern end of the Pyrenees, as well as the territories of the Roman province of Hispania Tarraconensis to the south. [8] From the 8th century onwards the Catalan counts extended their territory southwards and westwards at the expense of the Muslims, bringing their language with them. [8] This process was given definitive impetus with the separation of the County of Barcelona from the Carolingian Empire in 988. [8]

In the 11th century, documents written in macaronic Latin begin to show Catalan elements, [15] with texts written almost completely in Romance appearing by 1080. [15] Old Catalan shared many features with Gallo-Romance, diverging from Old Occitan between the 11th and 14th centuries. [16]

During the 11th and 12th centuries the Catalan rulers expanded southward to the Ebro river, [8] and in the 13th century they conquered the Land of Valencia and the Balearic Islands. [8] The city of Alghero in Sardinia was repopulated with Catalan speakers in the 14th century. The language also reached Murcia, which became Spanish-speaking in the 15th century. [17]

In the Low Middle Ages, Catalan went through a golden age, reaching a peak of maturity and cultural richness. [8] Examples include the work of Majorcan polymath Ramon Llull (1232–1315), the Four Great Chronicles (13th–14th centuries), and the Valencian school of poetry culminating in Ausiàs March (1397–1459). [8] By the 15th century, the city of Valencia had become the sociocultural center of the Crown of Aragon, and Catalan was present all over the Mediterranean world. [8] During this period, the Royal Chancery propagated a highly standardized language. [8] Catalan was widely used as an official language in Sicily until the 15th century, and in Sardinia until the 17th. [17] During this period, the language was what Costa Carreras terms "one of the 'great languages' of medieval Europe". [8]

Martorell's outstanding [8] novel of chivalry Tirant lo Blanc (1490) shows a transition from Medieval to Renaissance values, something that can also be seen in Metge's work. [8] The first book produced with movable type in the Iberian Peninsula was printed in Catalan. [18] [8]

Start of the modern era


With the union of the crowns of Castille and Aragon in 1479, the Spanish kings ruled over different kingdoms, each with its own cultural, linguistic and political particularities, and they had to swear by the Laws of each territory before the respective Parliaments. But after the War of the Spanish Succession, Spain became an Absolute monarchy under Philip V, which led to the assimilation of the Crown of Aragon by the Crown of Castile through the Nueva Planta decrees, as a first step in the creation of the Spanish nation-state; as in other contemporary European states, this meant the imposition of the political and cultural characteristics of the dominant groups. [19] [20] Since the political unification of 1714, Spanish assimilation policies towards national minorities have been a constant. [21] [22] [23] [24] [25]

School map of Spain from 1850. On it, the State is shown divided into four parts:- "Fully constitutional Spain", which includes Castile and Andalusia, but also the Galician-speaking territories. - "Annexed or assimilated Spain": the territories of the Crown of Aragon, the larger part of which, with the exception of Aragon proper, are Catalan-speaking-, "Foral Spain", which includes Basque-speaking territories-, and "Colonial Spain", with the last overseas colonial territories. Mapa politico de Espana, 1850.jpg
School map of Spain from 1850. On it, the State is shown divided into four parts:- "Fully constitutional Spain", which includes Castile and Andalusia, but also the Galician-speaking territories. - "Annexed or assimilated Spain": the territories of the Crown of Aragon, the larger part of which, with the exception of Aragon proper, are Catalan-speaking-, "Foral Spain", which includes Basque-speaking territories-, and "Colonial Spain", with the last overseas colonial territories.

The process of assimilation began with secret instructions to the corregidores of the Catalan territory: they "will take the utmost care to introduce the Castilian language, for which purpose he will give the most temperate and disguised measures so that the effect is achieved, without the care being noticed." [26] From there, actions in the service of assimilation, discreet or aggressive, were continued, and reached to the last detail, such as, in 1799, the Royal Certificate forbidding anyone to "represent, sing and dance pieces that were not in Spanish." [27] Anyway, the use of Spanish gradually became more prestigious [17] and marked the start of the decline of Catalan. [8] [7] Starting in the 16th century, Catalan literature came under the influence of Spanish, and the nobles, part of the urban and literary classes became bilingual. [17]


With the Treaty of the Pyrenees (1659), Spain ceded the northern part of Catalonia to France, and soon thereafter the local Catalan varieties came under the influence of French, which in 1700 became the sole official language of the region. [5] [28]

Shortly after the French Revolution (1789), the French First Republic prohibited official use of, and enacted discriminating policies against, the regional languages of France, such as Catalan, Alsatian, Breton, Occitan, Flemish, and Basque.

France: 19th to 20th centuries

Official decree prohibiting the Catalan language in France Interdiction officielle de la langue catalana 2 avril 1700.jpg
Official decree prohibiting the Catalan language in France
"Speak French, be clean", school wall in Ayguatebia-Talau (Northern Catalonia), 2010 SpeakFrenchBeClean.jpg
"Speak French, be clean", school wall in Ayguatébia-Talau (Northern Catalonia), 2010

Following the French establishment of the colony of Algeria from 1830 onward, it received several waves of Catalan-speaking settlers. People from the Spanish Alicante province settled around Oran, whereas Algiers received immigration from Northern Catalonia and Menorca.

Their speech was known as patuet . [29] By 1911, the number of Catalan speakers was around 100,000. [30] After the declaration of independence of Algeria in 1962, almost all the Catalan speakers fled to Northern Catalonia (as Pieds-Noirs ) [31] or Alacant. [32]

The government of France formally recognizes only French as an official language. Nevertheless, on 10 December 2007, the General Council of the Pyrénées-Orientales officially recognized Catalan as one of the languages of the department [33] and seeks to further promote it in public life and education.

Spain: 18th to 20th centuries

In Spain, the decline of Catalan continued into the 18th century. The defeat of the pro-Habsburg coalition in the War of Spanish Succession (1714) initiated a series of laws which, among other centralizing measures, imposed the use of Spanish in legal documentation all over Spain.

However, the 19th century saw a Catalan literary revival ( Renaixença ), which has continued up to the present day. [5] This period starts with Aribau's Ode to the Homeland (1833); followed in the second half of the 19th century, and the early 20th by the work of Verdaguer (poetry), Oller (realist novel), and Guimerà (drama). [34] In the 19th century, the region of Carche, in the province of Murcia was repopulated with Valencian speakers. [35] Catalan spelling was standardized in 1913 and the language became official during the Second Spanish Republic (1931–1939). The Second Spanish Republic saw a brief period of tolerance, with most restrictions against Catalan lifted. [5]

The Catalan language and culture were frowned upon during the Spanish Civil War (1936–1939) and the subsequent decades in Francoist Catalonia. The Francoist dictatorship (1939–1975) imposed the use of Spanish in schools and in public administration in all of Spain. However, in 1944, it became mandatory by law for universities with Romance Philology to include the subject of Catalan Philology. [ citation needed ] Numerous and prestigious cultural contests were created to reward works produced in Catalan. In January 1944, the "Eugenio Nadal" award was created. In 1945, with the sponsorship and subsidy of the Government, the centenary of Mossèn Cinto Verdaguer was celebrated. In 1947 the Joan Martorell prize for novels in Catalan was awarded. In 1949, the Víctor Català award for short novels in Catalan and the Aedos awards for biographies, the Josep Ysart award for essays, and the Ossa Menor award, later renamed Carles Riba, were created. In 1951, a national prize was awarded to poetry in Catalan with the same financial amount as Spanish poetry. That same year, Selecta Editions was founded for works written in Catalan. And the Joanot Martorell is awarded to Josep Pla for his work El carrer estret. In subsequent years (50s, 60s and 70s) countless awards were born, such as the Lletra d'Or, Amadeu Oller for poetry, the Sant Jordi for novels (endowed with 150,000 pesetas), the Honor Award of Catalan Letters, the Verdaguer, the Josep Pla Prize, the Mercè Rodoreda Prize for short stories and narratives. [36] The first Catalan-language TV show was broadcast during the Franco period, in 1964. [37] The Francoist dictatorship (1939–1975) banned the use of Catalan in schools and in public administration. [38] [7] At the same time, oppression of the Catalan language and identity was carried out in schools, through governmental bodies, and in religious centers. [39] Franco's desire for a homogenous Spanish population resonated with some Catalans in favor of his regime, primarily members of the upper class, who began to reject the use of Catalan. Despite all of these hardships, Catalan continued to be used privately within households, and it was able to survive Francisco Franco's dictatorship. Several prominent Catalan authors resisted the suppression through literature. [40]

In addition to the loss of prestige for Catalan and its prohibition in schools, migration during the 1950s into Catalonia from other parts of Spain also contributed to the diminished use of the language. These migrants were often unaware of the existence of Catalan, and thus felt no need to learn or use it. Catalonia was the economic powerhouse of Spain, so these migrations continued to occur from all corners of the country. Employment opportunities were reduced for those who were not bilingual. [41]

Present day

Since the Spanish transition to democracy (1975–1982), Catalan has been institutionalized as an official language, language of education, and language of mass media; all of which have contributed to its increased prestige. [42] In Catalonia, there is an unparalleled large bilingual European non-state linguistic community. [42] The teaching of Catalan is mandatory in all schools, [5] but it is possible to use Spanish for studying in the public education system of Catalonia in two situations – if the teacher assigned to a class chooses to use Spanish, or during the learning process of one or more recently arrived immigrant students. [43] There is also some intergenerational shift towards Catalan. [5]

More recently, several Spanish political forces have tried to increase the use of Spanish in the Catalan educational system. As a result, in May 2022 the Spanish Supreme Court urged the Catalan regional government to enforce a measure by which 25% of all lessons must be taught in Spanish. [44]

According to the Statistical Institute of Catalonia, in 2013 the Catalan language is the second most commonly used in Catalonia, after Spanish, as a native or self-defining language: 7% of the population self-identifies with both Catalan and Spanish equally, 36.4% with Catalan and 47.5% only Spanish. [45] In 2003 the same studies concluded no language preference for self-identification within the population above 15 years old: 5% self-identified with both languages, 44.3% with Catalan and 47.5% with Spanish. [46] To promote use of Catalan, the Generalitat de Catalunya (Catalonia's official Autonomous government) spends part of its annual budget on the promotion of the use of Catalan in Catalonia and in other territories, with entities such as Consorci per a la Normalització Lingüística  [ ca; es ] (Consortium for Linguistic Normalization) [47] [48]

In Andorra, Catalan has always been the sole official language. [5] Since the promulgation of the 1993 constitution, several policies favoring Catalan have been enforced, like Catalan medium education. [5]

On the other hand, there are several language shift processes currently taking place. In the Northern Catalonia area of France, Catalan has followed the same trend as the other minority languages of France, with most of its native speakers being 60 or older (as of 2004). [5] Catalan is studied as a foreign language by 30% of the primary education students, and by 15% of the secondary. [5] The cultural association La Bressola promotes a network of community-run schools engaged in Catalan language immersion programs.

In Alicante province, Catalan is being replaced by Spanish and in Alghero by Italian. [42] There is also well ingrained diglossia in the Valencian Community, Ibiza, and to a lesser extent, in the rest of the Balearic islands. [5]

During the 20th century many Catalans emigrated or went into exile to Venezuela, Mexico, Cuba, Argentina and other South American countries. They formed a large number of Catalan colonies that today continue to maintain the Catalan language. [49] [50] They also founded many Catalan casals (associations). [51]

Classification and relationship with other Romance languages

Chart of Romance languages based on structural and comparative criteria, not on socio-functional ones. FP: Franco-Provencal, IR: Istro-Romanian. Romance-lg-classification-en.svg
Chart of Romance languages based on structural and comparative criteria, not on socio-functional ones. FP: Franco-Provençal, IR: Istro-Romanian.

One classification of Catalan is given by Pèire Bèc:

However, the ascription of Catalan to the Occitano-Romance branch of Gallo-Romance languages is not shared by all linguists and philologists, particularly among Spanish ones, such as Ramón Menéndez Pidal.

Catalan bears varying degrees of similarity to the linguistic varieties subsumed under the cover term Occitan language (see also differences between Occitan and Catalan and Gallo-Romance languages). Thus, as it should be expected from closely related languages, Catalan today shares many traits with other Romance languages.

Relationship with other Romance languages

Some include Catalan in Occitan, as the linguistic distance between this language and some Occitan dialects (such as the Gascon language) is similar to the distance among different Occitan dialects. Catalan was considered a dialect of Occitan until the end of the 19th century [52] and still today remains its closest relative. [53]

Catalan shares many traits with the other neighboring Romance languages (Occitan, French, Italian, Sardinian as well as Spanish and Portuguese among others). [35] However, despite being spoken mostly on the Iberian Peninsula, Catalan has marked differences with the Iberian Romance group (Spanish and Portuguese) in terms of pronunciation, grammar, and especially vocabulary; showing instead its closest affinity with languages native to France and northern Italy, particularly Occitan [54] [55] [56] and to a lesser extent Gallo-Romance (Franco-Provençal, French, Gallo-Italian). [57] [58] [59] [60] [54] [55] [56]

According to Ethnologue, the lexical similarity between Catalan and other Romance languages is: 87% with Italian; 85% with Portuguese and Spanish; 76% with Ladin and Romansh; 75% with Sardinian; and 73% with Romanian. [1]

Lexical comparison of 24 words among Romance languages:
17 cognates with Gallo-Romance, 5 isoglosses with Iberian Romance, 3 isoglosses with Occitan, and 1 unique word. [58] [59]
GlossCatalan Occitan (Campidanese) Sardinian Italian French Spanish Portuguese Romanian
cousincosícosinfradilicuginocousinprimoprimo, coirmãovăr
summerestiuestiuistadiestateétéverano, estío [61] verão, estio [61] vară
eveningvespreser, vèspreseruserasoirtarde, noche [62] tarde, serão [62] seară
morningmatímatinmangianumattinamatinmañanamanhã, matinadimineață
frying panpaellapadenapaellapadellapoêlesarténfrigideira, fritadeiratigaie
bedllitlièch, lèitletulettolitcama, lechocama, leitopat
birdocell, auaucèlpilloniuccellooiseauave, pájaroave, pássaropasăre
doggos, cagos, canhcanicanechienperro, cancão, cachorrocâine
buttermantegabodreburru, butiruburrobeurremantequilla, mantecamanteigaunt
piecetrostròç, petaçarrogupezzomorceau, piècepedazo, trozo [63] pedaço, bocadobucată
graygrisgriscanugrigiogrisgris, pardo [64] cinzento, grisgri, [65] sur, cenușiu
too muchmassatròptroputroppotropdemasiadodemais, demasiadoprea
to wantvolervòlerbolli(ri)volerevouloirquererquerera vrea
to takeprendreprene, prendrepigaiprendereprendretomar, prenderapanhar, levara lua
to praypregarpregarpregaipregareprierorarorar, rezar, pregara se ruga
to askdemanar/preguntardemandardimandai, preguntaidomandaredemanderpedir, preguntarpedir, perguntara cere, a întreba
to searchcercar/buscarcercarcircaicercarechercherbuscarprocurar, buscara căuta
to arrivearribararribararribaiarrivarearriverllegar, arribarchegara ajunge
to speakparlarparlarchistionnai, fueddaiparlareparlerhablar, parlarfalar, palrara vorbi
to eatmenjarmanjarpappaimangiaremangercomer (manyar in lunfardo; papear in slang)comer (papar in slang), manjara mânca
Catalan and Spanish cognates with different meanings [60]
Latin Catalan Spanish
accostare acostar "to bring closer" acostar "to put to bed"
levare llevar "to remove;
wake up"
llevar "to take"
trahere traure "to remove" traer "to bring"
circare cercar "to search" cercar "to fence"
collocare colgar "to bury" colgar "to hang"
mulier muller "wife" mujer "woman or wife"

During much of its history, and especially during the Francoist dictatorship (1939–1975), the Catalan language was ridiculed as a mere dialect of Spanish. [55] [56] This view, based on political and ideological considerations, has no linguistic validity. [55] [56] Spanish and Catalan have important differences in their sound systems, lexicon, and grammatical features, placing the language in features closer to Occitan (and French). [55] [56]

There is evidence that, at least from the 2nd century a.d., the vocabulary and phonology of Roman Tarraconensis was different from the rest of Roman Hispania. [54] Differentiation arose generally because Spanish, Asturian, and Galician-Portuguese share certain peripheral archaisms (Spanish hervir, Asturian and Portuguese ferver vs. Catalan bullir, Occitan bolir "to boil") and innovatory regionalisms (Sp novillo, Ast nuviellu vs. Cat torell, Oc taurèl "bullock"), while Catalan has a shared history with the Western Romance innovative core, especially Occitan. [66] [54]

Like all Romance languages, Catalan has a handful of native words which are unique to it, or rare elsewhere. These include:

The Gothic superstrate produced different outcomes in Spanish and Catalan. For example, Catalan fang "mud" and rostir "to roast", of Germanic origin, contrast with Spanish lodo and asar , of Latin origin; whereas Catalan filosa "spinning wheel" and templa "temple", of Latin origin, contrast with Spanish rueca and sien , of Germanic origin. [54]

The same happens with Arabic loanwords. Thus, Catalan alfàbia "large earthenware jar" and rajola "tile", of Arabic origin, contrast with Spanish tinaja and teja , of Latin origin; whereas Catalan oli "oil" and oliva "olive", of Latin origin, contrast with Spanish aceite and aceituna . [54] However, the Arabic element in Spanish is generally much more prevalent. [54]

Situated between two large linguistic blocks (Iberian Romance and Gallo-Romance), Catalan has many unique lexical choices, such as enyorar "to miss somebody", apaivagar "to calm somebody down", and rebutjar "reject". [54]

Geographic distribution

Catalan-speaking territories

Traditionally Catalan-speaking territories in dark gray; non-Catalan-speaking territories belonging to traditionally Catalan-speaking regions in light gray

Traditionally Catalan-speaking territories are sometimes called the Països Catalans (Catalan Countries), a denomination based on cultural affinity and common heritage, that has also had a subsequent political interpretation but no official status. Various interpretations of the term may include some or all of these regions.

Territories where Catalan is spoken [35]
StateTerritoryCatalan nameNotes
Andorra AndorraAndorraA sovereign state where Catalan is the national and the sole official language. The Andorrans speak a Western Catalan variety. [lower-alpha 1]
France Northern Catalonia Catalunya NordRoughly corresponding to the département of Pyrénées-Orientales. [35]
Spain Catalonia CatalunyaIn the Aran Valley (northwest corner of Catalonia), in addition to Occitan, which is the local language, Catalan, Spanish and French are also spoken. [35]
Valencian Community Comunitat ValencianaExcepting some regions in the west and south which have been Aragonese/Spanish-speaking since at least the 18th century. [35] The Western Catalan variety spoken there is known as "Valencian".

La Franja
La FranjaA part of the Autonomous Community of Aragon, specifically a strip bordering Western Catalonia. It comprises the comarques of Ribagorça, Llitera, Baix Cinca, and Matarranya.
Balearic Islands Illes BalearsComprising the islands of Mallorca, Menorca, Ibiza and Formentera.
Carche El CarxeA small area of the Autonomous Community of Murcia, settled in the 19th century. [35]
Italy Alghero L'AlguerA city in the Province of Sassari, on the island of Sardinia, where the Algherese dialect is spoken.

Number of speakers

The number of people known to be fluent in Catalan varies depending on the sources used. A 2004 study did not count the total number of speakers, but estimated a total of 9–9.5 million by matching the percentage of speakers to the population of each area where Catalan is spoken. [68] The web site of the Generalitat de Catalunya estimated that as of 2004 there were 9,118,882 speakers of Catalan. [69] These figures only reflect potential speakers; today it is the native language of only 35.6% of the Catalan population. [70] According to Ethnologue , Catalan had 4.1 million native speakers and 5.1 million second-language speakers in 2021. [1]

Geographical distribution of Catalan language by official status Llengua catalana al mon.svg
Geographical distribution of Catalan language by official status

According to a 2011 study the total number of Catalan speakers is over 9.8 million, with 5.9 million residing in Catalonia. More than half of them speak Catalan as a second language, with native speakers being about 4.4 million of those (more than 2.8 in Catalonia). [71] Very few Catalan monoglots exist; basically, virtually all of the Catalan speakers in Spain are bilingual speakers of Catalan and Spanish, with a sizable population of Spanish-only speakers of immigrant origin (typically born outside Catalonia or whose parents were both born outside Catalonia) [ citation needed ] existing in the major Catalan urban areas as well.

In Roussillon, only a minority of French Catalans speak Catalan nowadays, with French being the majority language for the inhabitants after a continued process of language shift. According to a 2019 survey by the Catalan government, 31.5% of the inhabitants of Catalonia have Catalan as first language at home whereas 52.7% have Spanish, 2.8% both Catalan and Spanish and 10.8% other languages. [72]

Spanish is the most spoken language in Barcelona (according to the linguistic census held by the Government of Catalonia in 2013) and it is understood almost universally. According to this census of 2013 Catalan is also very commonly spoken in the city of 1,501,262: it is understood by 95% of the population, while 72.3% over the age of 2 can speak it (1,137,816), 79% can read it (1,246.555), and 53% can write it (835,080). [73] The proportion in Barcelona who can speak it, 72.3%, [74] is lower than that of the overall Catalan population, of whom 81.2% over the age of 15 speak the language. Knowledge of Catalan has increased significantly in recent decades thanks to a language immersion educational system. An important social characteristic of the Catalan language is that all the areas where it is spoken are bilingual in practice: together with the French language in Roussillon, with Italian in Alghero, with Spanish and French in Andorra and with Spanish in the rest of the territories.

TerritoryStateUnderstand 1 [75] Can speak 2 [75]
Valencian CommunitySpain3,448,7802,407,951
Balearic IslandsSpain852,780706,065
Roussillon France203,121125,621
La Franja (Aragon)Spain47,25045,000
Alghero (Sardinia)Italy20,00017,625
Carche (Murcia)SpainNo dataNo data
Total Catalan-speaking territories 11,150,2189,062,637
Rest of WorldNo data350,000
1. ^ The number of people who understand Catalan includes those who can speak it.
2. ^ Figures relate to all self-declared capable speakers, not just native speakers.

Level of knowledge

Catalonia [76] 81.294.485.565.3
Valencian Community57.578.154.932.5
Balearic Islands74.693.179.646.9
Franja Oriental of Aragón88.898.572.930.3

(% of the population 15 years old and older).

Social use

AreaAt homeOutside home
Valencian Community3732
Balearic Islands4441
Franja Oriental of Aragón7061

(% of the population 15 years old and older).

Native language

Valencian Community1,047,00021.1%
Balearic Islands392,00036.1%
Franja Oriental of Aragon33,00070.2%

[77] [78] [79]


Catalan phonology varies by dialect. Notable features include: [80]

In contrast to other Romance languages, Catalan has many monosyllabic words, and these may end in a wide variety of consonants, including some consonant clusters. [80] Additionally, Catalan has final obstruent devoicing, which gives rise to an abundance of such couplets as amic ("male friend") vs. amiga ("female friend"). [80]

Central Catalan pronunciation is considered to be standard for the language. [81] The descriptions below are mostly representative of this variety. [82] For the differences in pronunciation between the different dialects, see the section on pronunciation of dialects in this article.


Vowels of Standard Eastern Catalan Catalan vowel chart.svg
Vowels of Standard Eastern Catalan

Catalan has inherited the typical vowel system of Vulgar Latin, with seven stressed phonemes: /a ɛ e i ɔ o u/, a common feature in Western Romance, with the exception of Spanish. [80] Balearic also has instances of stressed /ə/. [84] Dialects differ in the different degrees of vowel reduction, [85] and the incidence of the pair /ɛ e/. [86]

In Central Catalan, unstressed vowels reduce to three: /a e ɛ/ > [ə]; /o ɔ u/ > [u]; /i/ remains distinct. [87] The other dialects have different vowel reduction processes (see the section pronunciation of dialects in this article).

Examples of vowel reduction processes in Central Catalan [88]
The root is stressed in the first word and unstressed in the second
Front vowelsBack vowels
gel ("ice")
gelat ("ice cream")
pedra ("stone")
pedrera ("quarry")
banya ("he bathes")
banyem ("we bathe")
cosa ("thing")
coseta ("little thing")
tot ("everything")
total ("total")


Catalan consonants [89]
Bilabial Alveolar
/ Dental
Palatal Velar
Nasal m n ɲ ŋ
Plosive voiceless p t c ~ k
voiced b d ɟ ~ ɡ
Affricate voiceless ts
voiced dz
Fricative voiceless f s ʃ
voiced ( v ) z ʒ
Approximant central j w
lateral l ʎ
Tap ɾ
Trill r

The consonant system of Catalan is rather conservative.

  • /l/ has a velarized allophone in syllable coda position in most dialects. [90] However, /l/ is velarized irrespective of position in Eastern dialects like Majorcan [91] and standard Eastern Catalan.
  • /v/ occurs in Balearic, [92] Algherese, standard Valencian and some areas in southern Catalonia. [93] It has merged with /b/ elsewhere. [94]
  • Voiced obstruents undergo final-obstruent devoicing: /b/ > [p], /d/ > [t], /ɡ/ > [k]. [95]
  • Voiced stops become lenited to approximants in syllable onsets, after continuants: /b/ > [ β ], /d/ > [ ð ], /ɡ/ > [ ɣ ]. [96] Exceptions include /d/ after lateral consonants, and /b/ after /f/. In coda position, these sounds are realized as stops, [97] except in some Valencian dialects where they are lenited. [98]
  • There is some confusion in the literature about the precise phonetic characteristics of /ʃ/, /ʒ/, /tʃ/, /dʒ/. Some sources [92] describe them as "postalveolar". Others [99] [100] as "back alveolo-palatal", implying that the characters ɕ ʑ tɕ dʑ would be more accurate. However, in all literature only the characters for palato-alveolar affricates and fricatives are used, even when the same sources use ɕ ʑ for other languages like Polish and Chinese. [101] [102] [100]
  • The distribution of the two rhotics /r/ and /ɾ/ closely parallels that of Spanish. Between vowels, the two contrast, but they are otherwise in complementary distribution: in the onset of the first syllable in a word, [ r ] appears unless preceded by a consonant. Dialects vary in regards to rhotics in the coda with Western Catalan generally featuring [ ɾ ] and Central Catalan dialects featuring a weakly trilled [ r ] unless it precedes a vowel-initial word in the same prosodic unit, in which case [ ɾ ] appears. [103]
  • In careful speech, /n/, /m/, /l/ may be geminated. Geminated /ʎ/ may also occur. [92] Some analyze intervocalic [r] as the result of gemination of a single rhotic phoneme. [104] This is similar to the common analysis of Spanish and Portuguese rhotics. [105]

Phonological evolution


Catalan sociolinguistics studies the situation of Catalan in the world and the different varieties that this language presents. It is a subdiscipline of Catalan philology and other affine studies and has as an objective to analyze the relation between the Catalan language, the speakers and the close reality (including the one of other languages in contact).

Preferential subjects of study



Main dialects of Catalan Catalan dialects-en.png
Main dialects of Catalan

The dialects of the Catalan language feature a relative uniformity, especially when compared to other Romance languages; [60] both in terms of vocabulary, semantics, syntax, morphology, and phonology. [109] Mutual intelligibility between dialects is very high, [35] [110] [81] estimates ranging from 90% to 95%. [1] The only exception is the isolated idiosyncratic Algherese dialect. [60]

Catalan is split in two major dialectal blocks: Eastern and Western. [81] [109] The main difference lies in the treatment of unstressed a and e; which have merged to /ə/ in Eastern dialects, but which remain distinct as /a/ and /e/ in Western dialects. [60] [81] There are a few other differences in pronunciation, verbal morphology, and vocabulary. [35]

Western Catalan comprises the two dialects of Northwestern Catalan and Valencian; the Eastern block comprises four dialects: Central Catalan, Balearic, Rossellonese, and Algherese. [81] Each dialect can be further subdivided in several subdialects. The terms "Catalan" and "Valencian" (respectively used in Catalonia and the Valencian Community) refer to two varieties of the same language. [111] There are two institutions regulating the two standard varieties, the Institute of Catalan Studies in Catalonia and the Valencian Academy of the Language in the Valencian Community.

Central Catalan is considered the standard pronunciation of the language and has the largest number of speakers. [81] It is spoken in the densely populated regions of the Barcelona province, the eastern half of the province of Tarragona, and most of the province of Girona. [81]

Catalan has an inflectional grammar. Nouns have two genders (masculine, feminine), and two numbers (singular, plural). Pronouns additionally can have a neuter gender, and some are also inflected for case and politeness, and can be combined in very complex ways. Verbs are split in several paradigms and are inflected for person, number, tense, aspect, mood, and gender. In terms of pronunciation, Catalan has many words ending in a wide variety of consonants and some consonant clusters, in contrast with many other Romance languages. [80]

Main dialectal divisions of Catalan [81] [112]
BlockWestern CatalanEastern Catalan
Dialect Northwestern Valencian Central Balearic Northern/Rossellonese Algherese
Area Spain, Andorra Spain France Italy
Andorra, Provinces of Lleida, western half of Tarragona, La Franja Autonomous community of Valencia, Carche Provinces of Barcelona, eastern half of Tarragona, most of Girona Balearic islands Roussillon/Northern Catalonia City of Alghero in Sardinia



Catalan has inherited the typical vowel system of Vulgar Latin, with seven stressed phonemes: /a ɛ e i ɔ o u/, a common feature in Western Romance, except Spanish. [80] Balearic has also instances of stressed /ə/. [84] Dialects differ in the different degrees of vowel reduction, [85] and the incidence of the pair /ɛ e/. [86]

In Eastern Catalan (except Majorcan), unstressed vowels reduce to three: /a e ɛ/ > [ə]; /o ɔ u/ > [u]; /i/ remains distinct. [87] There are a few instances of unreduced [e], [o] in some words. [87] Algherese has lowered [ə] to [a].

In Majorcan, unstressed vowels reduce to four: /a e ɛ/ follow the Eastern Catalan reduction pattern; however /o ɔ/ reduce to [o], with /u/ remaining distinct, as in Western Catalan. [113]

In Western Catalan, unstressed vowels reduce to five: /e ɛ/ > [e]; /o ɔ/ > [o]; /a u i/ remain distinct. [114] [115] This reduction pattern, inherited from Proto-Romance, is also found in Italian and Portuguese. [114] Some Western dialects present further reduction or vowel harmony in some cases. [114] [116]

Central, Western, and Balearic differ in the lexical incidence of stressed /e/ and /ɛ/. [86] Usually, words with /ɛ/ in Central Catalan correspond to /ə/ in Balearic and /e/ in Western Catalan. [86] Words with /e/ in Balearic almost always have /e/ in Central and Western Catalan as well.[ vague ] [86] As a result, Central Catalan has a much higher incidence of /ɛ/. [86]

Different incidence of stressed /e/, /ə/, /ɛ/ [86]
set ("thirst")/ˈset//ˈsət//ˈsɛt//ˈset/
ven ("he sells")/ˈven//ˈvən//ˈbɛn//ˈven/
General differences in the pronunciation of unstressed vowels in different dialects [81] [117]
mare ("mother")/ˈmaɾe//ˈmaɾə/
cançó ("song")/kanˈso//kənˈso//kənˈsu/
posar ("to put")/poˈza(ɾ)//puˈza(ɾ)/
ferro ("iron")/ˈfɛro//ˈfɛru/
Detailed examples of vowel reduction processes in different dialects [88]
Word pairs:
the first with stressed root,
the second with unstressed root
gel ("ice")
gelat ("ice cream")
pera ("pear")
perera ("pear tree")
pedra ("stone")
pedrera ("quarry")
banya ("he bathes")
banyem ("we bathe")
Majorcan: banyam ("we bathe")
cosa ("thing")
coseta ("little thing")
tot ("everything")
total ("total")



Western Catalan: In verbs, the ending for 1st-person present indicative is -e in verbs of the 1st conjugation and -∅ in verbs of the 2nd and 3rd conjugations in most of the Valencian Community, or -o in all verb conjugations in the Northern Valencian Community and Western Catalonia.
E.g. parle, tem, sent (Valencian); parlo, temo, sento (Northwestern Catalan).

Eastern Catalan: In verbs, the ending for 1st-person present indicative is -o, -i, or -∅ in all conjugations.
E.g. parlo (Central), parl (Balearic), and parli (Northern), all meaning ('I speak').

1st-person singular present indicative forms
ConjugationEastern CatalanWestern CatalanGloss
1stparloparliparlparleparlo'I speak'
2ndtemotemitemtemtemo'I fear'
3rdpuresentosentisentsentsento'I feel', 'I hear'
inchoativepoleixopoleixipoleix or polescpolisc or polescpol(e)ixo'I polish'

Western Catalan: In verbs, the inchoative endings are -isc/-esc, -ix, -ixen, -isca/-esca.

Eastern Catalan: In verbs, the inchoative endings are -eixo, -eix, -eixen, -eixi.

Western Catalan: In nouns and adjectives, maintenance of /n/ of medieval plurals in proparoxytone words.
E.g. hòmens 'men', jóvens 'youth'.

Eastern Catalan: In nouns and adjectives, loss of /n/ of medieval plurals in proparoxytone words.
E.g. homes 'men', joves 'youth' (Ibicencan, however, follows the model of Western Catalan in this case [118] ).


Despite its relative lexical unity, the two dialectal blocks of Catalan (Eastern and Western) show some differences in word choices. [54] Any lexical divergence within any of the two groups can be explained as an archaism. Also, usually Central Catalan acts as an innovative element. [54]

Selection of different words between Western and Eastern Catalan
Gloss"mirror""boy""broom""navel""to exit"
Eastern Catalanmirallnoiescombrallombrígolsortir
Western Catalanespillxiquetgranerameliceixir


Casa de Convalescencia, Headquarters of the Institut d'Estudis Catalans Casa de Convalescencia - IEC.JPG
Casa de Convalescència, Headquarters of the Institut d'Estudis Catalans
Written varieties
Catalan (IEC)Valencian (AVL)gloss
conèixerconéixerto know
treuretrauretake out
néixernàixerto be born
mevameuamy, mine

Standard Catalan, virtually accepted by all speakers, [42] is mostly based on Eastern Catalan, [81] [119] which is the most widely used dialect. Nevertheless, the standards of the Valencian Community and the Balearics admit alternative forms, mostly traditional ones, which are not current in eastern Catalonia. [119]

The most notable difference between both standards is some tonic e accentuation, for instance: francès, anglès (IEC) – francés, anglés (AVL). Nevertheless, AVL's standard keeps the grave accent è, while pronouncing it as /e/ rather than /ɛ/, in some words like: què ('what'), or València. Other divergences include the use of tl (AVL) in some words instead of tll like in ametla/ametlla ('almond'), espatla/espatlla ('back'), the use of elided demonstratives (este 'this', eixe 'that') in the same level as reinforced ones (aquest, aqueix) or the use of many verbal forms common in Valencian, and some of these common in the rest of Western Catalan too, like subjunctive mood or inchoative conjugation in -ix- at the same level as -eix- or the priority use of -e morpheme in 1st person singular in present indicative (-ar verbs): jo compre instead of jo compro ('I buy').

In the Balearic Islands, IEC's standard is used but adapted for the Balearic dialect by the University of the Balearic Islands's philological section. In this way, for instance, IEC says it is correct writing cantam as much as cantem ('we sing') but the University says that the priority form in the Balearic Islands must be cantam in all fields. Another feature of the Balearic standard is the non-ending in the 1st person singular present indicative: jo compr ('I buy'), jo tem ('I fear'), jo dorm ('I sleep').

In Alghero, the IEC has adapted its standard to the Algherese dialect. In this standard one can find, among other features: the definite article lo instead of el, special possessive pronouns and determinants la mia ('mine'), lo sou/la sua ('his/her'), lo tou/la tua ('yours'), and so on, the use of -v-/v/ in the imperfect tense in all conjugations: cantava, creixiva, llegiva; the use of many archaic words, usual words in Algherese: manco instead of menys ('less'), calqui u instead of algú ('someone'), qual/quala instead of quin/quina ('which'), and so on; and the adaptation of weak pronouns.

In 2011, [120] the Aragonese government passed a decree approving the statutes of a new language regulator of Catalan in La Franja (the so-called Catalan-speaking areas of Aragon) as originally provided for by Law 10/2009. [121] The new entity, designated as Acadèmia Aragonesa del Català , shall allow a facultative education in Catalan and a standardization of the Catalan language in La Franja.

Status of Valencian

Subdialects of Valencian Subdialectes del valencia.svg
Subdialects of Valencian

Valencian is classified as a Western dialect, along with the northwestern varieties spoken in Western Catalonia (provinces of Lleida and the western half of Tarragona). [81] [112] Central Catalan has 90% to 95% inherent intelligibility for speakers of Valencian. [1]

Linguists, including Valencian scholars, deal with Catalan and Valencian as the same language. The official regulating body of the language of the Valencian Community, the Valencian Academy of Language (Acadèmia Valenciana de la Llengua, AVL) declares the linguistic unity between Valencian and Catalan varieties. [12]

[T]he historical patrimonial language of the Valencian people, from a philological standpoint, is the same shared by the autonomous communities of Catalonia and Balearic islands, and Principality of Andorra. Additionally, it is the patrimonial historical language of other territories of the ancient Crown of Aragon [...] The different varieties of these territories constitute a language, that is, a "linguistic system" [...] From this group of varieties, Valencian has the same hierarchy and dignity as any other dialectal modality of that linguistic system [...]

Ruling of the Valencian Language Academy of 9 February 2005, extract of point 1. [12] [122]

The AVL, created by the Valencian parliament, is in charge of dictating the official rules governing the use of Valencian, and its standard is based on the Norms of Castelló ( Normes de Castelló ). Currently, everyone who writes in Valencian uses this standard, except the Royal Academy of Valencian Culture (Acadèmia de Cultura Valenciana, RACV), which uses for Valencian an independent standard.

Despite the position of the official organizations, an opinion poll carried out between 2001 and 2004 [123] showed that the majority of the Valencian people consider Valencian different from Catalan. This position is promoted by people who do not use Valencian regularly. [42] Furthermore, the data indicates that younger generations educated in Valencian are much less likely to hold these views. A minority of Valencian scholars active in fields other than linguistics defends the position of the Royal Academy of Valencian Culture (Acadèmia de Cultura Valenciana, RACV), which uses for Valencian a standard independent from Catalan. [124]

This clash of opinions has sparked much controversy. For example, during the drafting of the European Constitution in 2004, the Spanish government supplied the EU with translations of the text into Basque, Galician, Catalan, and Valencian, but the latter two were identical. [125]


Word choices

Despite its relative lexical unity, the two dialectal blocks of Catalan (Eastern and Western) show some differences in word choices. [54] Any lexical divergence within any of the two groups can be explained as an archaism. Also, usually Central Catalan acts as an innovative element. [54]

Literary Catalan allows the use of words from different dialects, except those of very restricted use. [54] However, from the 19th century onwards, there has been a tendency towards favoring words of Northern dialects to the detriment of others, even though nowadays there is a greater freedom of choice.[ clarify ] [54]

Latin and Greek loanwords

Like other languages, Catalan has a large list of loanwords from Greek and Latin. This process started very early, and one can find such examples in Ramon Llull's work. [54] In the 14th and 15th centuries Catalan had a far greater number of Greco-Latin loanwords than other Romance languages, as is attested for example in Roís de Corella's writings. [54] The incorporation of learned, or "bookish" words from its own ancestor language, Latin, into Catalan is arguably another form of lexical borrowing through the influence of written language and the liturgical language of the Church. Throughout the Middle Ages and into the early modern period, most literate Catalan speakers were also literate in Latin; and thus they easily adopted Latin words into their writing—and eventually speech—in Catalan.

Word formation

The process of morphological derivation in Catalan follows the same principles as the other Romance languages, [126] where agglutination is common. Many times, several affixes are appended to a preexisting lexeme, and some sound alternations can occur, for example elèctric[əˈlɛktrik] ("electrical") vs. electricitat[ələktrisiˈtat]. Prefixes are usually appended to verbs, as in preveure ("foresee"). [126]

There is greater regularity in the process of word-compounding, where one can find compounded words formed much like those in English. [126]

Common types of word compounds in Catalan [126]
two nouns, the second assimilated to the firstpaper moneda"banknote paper"
noun delimited by an adjectiveestat major"military staff"
noun delimited by another noun and a prepositionmàquina d'escriure"typewriter"
verb radical with a nominal objectparacaigudes"parachute"
noun delimited by an adjective, with adjectival valuepit-roig"robin" (bird)

Writing system

The word novel*la
("novel") in a dictionary. The geminated L (l*l
) is a distinctive character used in Catalan. Catalan geminated L in a dictionary.jpg
The word novel·la ("novel") in a dictionary. The geminated L (l·l) is a distinctive character used in Catalan.
Billboard in Barcelona (detail), showing the word il*lusio
("illusion") Billboard in Barcelona (detail).png
Billboard in Barcelona (detail), showing the word il·lusió ("illusion")
Main forms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Modified formsÀÇÉÈÍÏL·LÓÒÚÜ

Catalan uses the Latin script, with some added symbols and digraphs. [127] The Catalan orthography is systematic and largely phonologically based. [127] Standardization of Catalan was among the topics discussed during the First International Congress of the Catalan Language, held in Barcelona October 1906. Subsequently, the Philological Section of the Institut d'Estudis Catalans (IEC, founded in 1911) published the Normes ortogràfiques in 1913 under the direction of Antoni Maria Alcover and Pompeu Fabra. In 1932, Valencian writers and intellectuals gathered in Castelló de la Plana to make a formal adoption of the so-called Normes de Castelló , a set of guidelines following Pompeu Fabra's Catalan language norms. [128]

Pronunciation of Catalan special characters and digraphs (Central Catalan) [129]
PronunciationExamples [129]
ç/s/feliç[fəˈlis] ("happy")
gu/ɡ/ ([ɡ~ɣ]) before i and eguerra[ˈɡɛrə] ("war")
/ɡw/ elsewhereguant[ˈɡwan] ("glove")
ig[tʃ] in final positionraig[ˈratʃ] ("trickle")
ix/ʃ/ ([jʃ] in some dialects)caixa[ˈkaʃə] ("box")
ll/ʎ/lloc[ʎɔk] ("place")
l·lNormatively /l:/, but usually /l/novel·la[nuˈβɛlə] ("novel")
ny/ɲ/Catalunya[kətəˈɫuɲə] ("Catalonia")
qu/k/ before i and equi[ˈki] ("who")
/kw/ before other vowelsquatre[ˈkwatrə] ("four")
Intervocalic s is pronounced /z/
grossa[ˈɡɾɔsə] ("big-feminine)"
casa[ˈkazə] ("house")
tg, tj[ddʒ]fetge[ˈfeddʒə] ("liver"), mitjó[midˈdʒo] ("sock")
tx[tʃ]despatx[dəsˈpatʃ] ("office")
tz[ddz]dotze[ˈdoddzə] ("twelve")
Letters and digraphs with contextually conditioned pronunciations (Central Catalan) [129]
NotesExamples [129]
c/s/ before i and e
corresponds to ç in other contexts
feliç ("happy-masculine-singular") - felices ("happy-feminine-plural")
caço ("I hunt") - caces ("you hunt")
g/ʒ/ before e and i
corresponds to j in other positions
envejar ("to envy") - envegen ("they envy")
final g + stressed i, and final ig before other vowels,
are pronounced [tʃ]
corresponds to j~g or tj~tg in other positions
boig['bɔtʃ] ("mad-masculine") - boja['bɔʒə] ("mad-feminine") -boges['bɔʒəs] ("mad-feminine plural")
desig[də'zitʃ] ("wish") - desitjar ("to wish") - desitgem ("we wish")
gu/ɡ/ before e and i
corresponds to g in other positions
botiga ("shop") - botigues ("shops")
/ɡw/ before e and i
corresponds to gu in other positions
llengua ("language") - llengües ("languages")
qu/k/ before e and i
corresponds to c in other positions
vaca ("cow") - vaques ("cows")
/kw/ before e and i
corresponds to qu in other positions
obliqua ("oblique-feminine") - obliqües ("oblique-feminine plural")
x[ʃ~tʃ] initially and in onsets after a consonant
[ʃ] after i
otherwise, [ɡz] before stress, [ks] after
xarxa[ˈʃarʃə] ("net")
guix[ˈɡiʃ] ("chalk")
exacte[əɡˈzaktə] ("exact"), fax[ˈfaks] ("fax")


The grammar of Catalan is similar to other Romance languages. Features include: [130]

Gender and number inflection

Gender and number inflection of the word gat
("cat") Flexio of word Gat.jpg
Gender and number inflection of the word gat ("cat")
Regular noun with definite article: el gat ("the cat")
singularel gatla gata
pluralels gatsles gates
Adjective with 4 forms:
verd ("green")
Adjective with 3 forms:
feliç ("happy")
Adjective with 2 forms:
indiferent ("indifferent")

In gender inflection, the most notable feature is (compared to Portuguese, Spanish or Italian), the loss of the typical masculine suffix -o. Thus, the alternance of -o/-a, has been replaced by ø/-a. [80] There are only a few exceptions, like minso/minsa ("scarce"). [80] Many not completely predictable morphological alternations may occur, such as: [80]

Catalan has few suppletive couplets, like Italian and Spanish, and unlike French. Thus, Catalan has noi/noia ("boy"/"girl") and gall/gallina ("cock"/"hen"), whereas French has garçon/fille and coq/poule. [80]

There is a tendency to abandon traditionally gender-invariable adjectives in favor of marked ones, something prevalent in Occitan and French. Thus, one can find bullent/bullenta ("boiling") in contrast with traditional bullent/bullent. [80]

As in the other Western Romance languages, the main plural expression is the suffix -s, which may create morphological alternations similar to the ones found in gender inflection, albeit more rarely. [80] The most important one is the addition of -o- before certain consonant groups, a phonetic phenomenon that does not affect feminine forms: el pols/els polsos ("the pulse"/"the pulses") vs. la pols/les pols ("the dust"/"the dusts"). [131]


Sign in the town square of Begur, Catalonia, Spain. In placa de la vila
(literally "square of the town"), since the noun vila
("town") is feminine singular, the definite article carries the corresponding form, la
("the"). Begur - Placa de la Vila - Catalunya.jpg
Sign in the town square of Begur, Catalonia, Spain. In plaça de la vila (literally "square of the town"), since the noun vila ("town") is feminine singular, the definite article carries the corresponding form, la ("the").
Definite article in Standard Catalan
(elided forms in brackets) [132]
singularel (l')la (l')
Contractions of the definite article
articleelal (a l')del (de l')pel (per l')
Indefinite article

The inflection of determinatives is complex, specially because of the high number of elisions, but is similar to the neighboring languages. [126] Catalan has more contractions of preposition + article than Spanish, like dels ("of + the [plural]"), but not as many as Italian (which has sul, col, nel, etc.). [126]

Central Catalan has abandoned almost completely unstressed possessives (mon, etc.) in favor of constructions of article + stressed forms (el meu, etc.), a feature shared with Italian. [126]

Personal pronouns

Catalan stressed pronouns [133]
1st personjo, minosaltres
2nd personinformaltuvosaltres
respectful(vós) [134]
3rd personmasculineellells

The morphology of Catalan personal pronouns is complex, especially in unstressed forms, which are numerous (13 distinct forms, compared to 11 in Spanish or 9 in Italian). [126] Features include the gender-neutral ho and the great degree of freedom when combining different unstressed pronouns (65 combinations). [126]

Catalan pronouns exhibit T–V distinction, like all other Romance languages (and most European languages, but not Modern English). This feature implies the use of a different set of second person pronouns for formality.

This flexibility allows Catalan to use extraposition extensively, much more than French or Spanish. Thus, Catalan can have m'hi recomanaren ("they recommended me to him"), whereas in French one must say ils m'ont recommandé à lui, and Spanish me recomendaron a él. [126] This allows the placement of almost any nominal term as a sentence topic, without having to use so often the passive voice (as in French or English), or identifying the direct object with a preposition (as in Spanish). [126]


Simple forms of a regular verb of the first conjugation: portar ("to bring") [135]
Past participleportat (portat, portada, portats, portades)
Indicativejotuell / ella
ells / elles
Preterite (archaic)portíportaresportàportàremportàreuportaren
Subjunctivejotuell / ella
ells / elles
Imperativejotuell / ella
ells / elles

Like all the Romance languages, Catalan verbal inflection is more complex than the nominal. Suffixation is omnipresent, whereas morphological alternations play a secondary role. [126] Vowel alternances are active, as well as infixation and suppletion. However, these are not as productive as in Spanish, and are mostly restricted to irregular verbs. [126]

The Catalan verbal system is basically common to all Western Romance, except that most dialects have replaced the synthetic indicative perfect with a periphrastic form of anar ("to go") + infinitive. [126]

Catalan verbs are traditionally divided into three conjugations, with vowel themes -a-, -e-, -i-, the last two being split into two subtypes. However, this division is mostly theoretical. [126] Only the first conjugation is nowadays productive (with about 3500 common verbs), whereas the third (the subtype of servir, with about 700 common verbs) is semiproductive. The verbs of the second conjugation are fewer than 100, and it is not possible to create new ones, except by compounding. [126]


The grammar of Catalan follows the general pattern of Western Romance languages. The primary word order is subject–verb–object. [136] However, word order is very flexible. Commonly, verb-subject constructions are used to achieve a semantic effect. The sentence "The train has arrived" could be translated as Ha arribat el tren or El tren ha arribat. Both sentences mean "the train has arrived", but the former puts a focus on the train, while the latter puts a focus on the arrival. This subtle distinction is described as "what you might say while waiting in the station" versus "what you might say on the train." [137]

Catalan names

In Spain, every person officially has two surnames, one of which is the father's first surname and the other is the mother's first surname. [138] The law contemplates the possibility of joining both surnames with the Catalan conjunction i ("and"). [138] [139]

Sample text

Selected text [140] from Manuel de Pedrolo's 1970 novel Un amor fora ciutat ("A love affair outside the city").

OriginalWord-for-word translation [140] Free translation
Tenia prop de divuit anys quan vaig conèixerI was having close to eighteen years, when I go [past auxiliary] know (=I met)I was about eighteen years old when I met
en Raül, a l'estació de Manresa.the Raül, at the station of (=in) Manresa. Raül, at Manresa railway station.
El meu pare havia mort, inesperadament i encara jove,The my father had died, unexpectedly and still young,My father had died, unexpectedly and still young,
un parell d'anys abans, i d'aquells tempsa couple of years before, and of those times a couple of years before; and from that time
conservo un record de punyent solitud.I keep a memory of acute loneliness I still harbor memories of great loneliness.
Les meves relacions amb la mareThe my relations with the motherMy relationship with my mother
no havien pas millorat, tot el contrari, not had at all improved, all the contrary, had not improved; quite the contrary,
potser fins i tot empitjoravenperhaps even they were worsening and arguably it was getting even worse
a mesura que em feia step that (=in proportion as) myself I was making big (=I was growing up).as I grew up.
No existia, no existí mai entre nosaltres, Not it was existing, not it existed never between us,There did not exist, at no point had there ever existed between us
una comunitat d'interessos, d'afeccions. a community of interests, of affections. shared interests or affection.
Cal creure que cercava... una personaIt is necessary to believe that I was seeking... a person I guess I was seeking... a person
en qui centrar la meva vida whom to center the my life whom I could center my emotional life.

See also



  1. The Valencian Normative Dictionary of the Valencian Academy of the Language states that Valencian is a "Romance language spoken in the Valencian Community, as well as in Catalonia, the Balearic Islands, the French department of the Pyrénées-Orientales, the Principality of Andorra, the eastern flank of Aragon and the Sardinian town of Alghero (unique in Italy), where it receives the name of 'Catalan'."
  2. The Catalan Language Dictionary of the Institut d'Estudis Catalans states in the sixth definition of "Valencian" that, in the Valencian Community, it is equivalent to Catalan language.
  1. Although in business and daily life other languages are common, and due to immigration Catalan mother-tongue speakers are only 35.7% of the population. See Languages of Andorra.

Related Research Articles

<span class="mw-page-title-main">Institute for Catalan Studies</span> Catalan academic institution

The Institute for Catalan Studies, also known by the acronym IEC, is an academic institution which seeks to undertake research and study into "all elements of Catalan culture". It is based in Barcelona, Catalonia, Spain.

<span class="mw-page-title-main">Valencian language</span> Dialectal variant of the Catalan language spoken in the Valencian Community and El Carxe

Valencian or Valencian language is the official, historical and traditional name used in the Valencian Community (Spain), and unofficially in the El Carche comarca in Murcia (Spain), to refer to the Romance language also known as Catalan. The Valencian Community's 1982 Statute of Autonomy and the Spanish Constitution officially recognise Valencian as the regional language.

<span class="mw-page-title-main">Catalan Countries</span> Regions where Catalan is the native language

The Catalan Countries refers to those territories where the Catalan language is spoken. They include the Spanish regions of Catalonia, the Balearic Islands, Valencia, and parts of Aragon and Murcia (Carche), as well as the Principality of Andorra, the department of Pyrénées-Orientales in France, and the city of Alghero in Sardinia (Italy). In the context of Catalan nationalism, the term is sometimes used in a more restricted way to refer to just Catalonia, Valencia and the Balearic Islands. The Catalan Countries do not correspond to any present or past political or administrative unit, though most of the area belonged to the Crown of Aragon in the Middle Ages. Parts of Valencia (Spanish) and Catalonia (Occitan) are not Catalan-speaking.

The phonology of Catalan, a Romance language, has a certain degree of dialectal variation. Although there are two standard varieties, one based on Central Eastern dialect and another one based on South-Western or Valencian dialect, this article deals with features of all or most dialects, as well as regional pronunciation differences. Various studies have focused on different Catalan varieties; for example, Wheeler and Mascaró analyze Central Eastern varieties, the former focusing on the educated speech of Barcelona and the latter focusing more on the vernacular of Barcelona, and Recasens does a careful phonetic study of Central Eastern Catalan.

The Catalan and Valencian orthographies encompass the spelling and punctuation of standard Catalan and Valencian. There are also several adapted variants to the peculiarities of local dialects of Insular Catalan.

<span class="mw-page-title-main">Enric Valor i Vives</span>

Enric Valor i Vives was a valencian narrator and grammarian who made one of the most important contributions to the re-collection and recovery of Valencian lexicography and its standardization in the Valencian Country, Spain.

<span class="mw-page-title-main">Catalans</span> People from Catalonia, Spain

Catalans are a Romance ethnic group native to Catalonia, who speak Catalan. The current official category of "Catalans" is that of the citizens of Catalonia, an autonomous community in Spain and the inhabitants of the Roussillon historical region in southern France, today the Pyrénées Orientales department, also called Northern Catalonia and Pays Catalan in French.

<span class="mw-page-title-main">Languages of Spain</span>

The languages of Spain, or Spanish languages, are the languages spoken in Spain.

Manuel Sanchís Guarner was a Spanish philologist, historian and writer.

The Occitano-Romance or Gallo-Narbonnese, or rarely East Iberian, is a branch of the Romance language group that encompasses the Catalan/Valencian and Occitan languages spoken in parts of southern France and northeastern Spain.

<span class="mw-page-title-main">Names of the Catalan language</span>

The first names, or glossonyms, of the Catalan/Valencian language formed in a dialectal relation with Latin, in which Catalan existed as a variety. These names already expressed the relationship between the two languages. New names that related Catalan to Rome came about to dignify the Catalan language in the thirteenth century, though Latinists called it vulgar and the people planus, or pla.

<span class="mw-page-title-main">Valencian Community</span> Autonomous community of Spain

The Valencian Community is an autonomous community of Spain. It is the fourth most populous Spanish autonomous community after Andalusia, Catalonia and the Community of Madrid with more than five million inhabitants. Its homonymous capital Valencia is the third largest city and metropolitan area in Spain. It is located along the Mediterranean coast on the east side of the Iberian Peninsula. It borders with Catalonia to the north, Aragon and Castilla–La Mancha to the west, and Murcia to the south, and the Balearic Islands are to its east. The Valencian Community consists of three provinces which are Castellón, Valencia and Alicante.

<span class="mw-page-title-main">Joan Veny i Clar</span>

Joan Veny i Clar is a linguist and Catalan dialectologist from Majorca, considered one of the most prestigious and renowned of the Catalan Countries. He is the author of Els parlars catalans, an essential book for Catalan dialectology, synthesis of the dialectal variation of the entire space of the Catalan Countries; and furthermore a dense and rich work, made in conjunction with Lydia Pons: Linguistic Atlas of the Catalan Domain

<span class="mw-page-title-main">History of Catalan</span>

The Catalan language originated from Vulgar Latin in the Pyrenees Mountains between France and Spain. It diverged from the other Romance languages in the 9th century. At that time, Catalan spread quickly throughout the Iberian peninsula when the Catalan counts conquered Muslim territory. By the 11th century, the Catalan language was present in several feudal documents. Catalan was present throughout the Mediterranean by the 15th century. At that time, the city of Valencia was thriving.

Old Catalan is the modern denomination for Romance varieties that during the Middle Ages were spoken in territories that spanned roughly the territories of the Principality of Catalonia, the Kingdom of Valencia, the Balearic Islands, and the island of Sardinia; all of them then part of the Crown of Aragon. These varieties were part of a dialect continuum with what today is called Old Occitan that reached the Loire Valley in the north and Northern Italy in the east. Consequently, Old Catalan can be considered a dialect group of Old Occitan, or be classified as an Occitano-Romance variety side by side with Old Occitan.

<span class="mw-page-title-main">Josep Maria Nadal i Farreras</span>

Josep Maria Nadal i Farreras is Professor of History of Language at the University of Girona.

<span class="mw-page-title-main">Catalan dialects</span> Varieties of the Catalan language

The Catalan dialects feature a relative uniformity, especially when compared to other Romance languages; both in terms of vocabulary, semantics, syntax, morphology, and phonology. Mutual intelligibility between its dialects is very high, estimates ranging from 90% to 95%. The only exception is the isolated idiosyncratic Alguerese dialect.

The Spanish language is widely spoken in most of the Catalan-speaking territories, where it is partly characterized by language contact with the Catalan language. These territories are: Catalonia, the Valencian Community, the Balearic Islands, Andorra, and the easternmost areas of Aragon. This linguistic contact is encouraged by the fact that almost all of the Catalan speakers in these regions are Catalan–Spanish bilingual to a greater or lesser extent.

<span class="mw-page-title-main">Joan Solà i Cortassa</span> Spanish linguist & philologist

Joan Solà Cortassa was a Spanish linguist and philologist. He was professor of Catalan language and literature at the University of Barcelona from 1984 onwards, and vice president of the Institut d'Estudis Catalans (IEC) from 2009.

These are lists of spelling-to-sound correspondences in the Catalan language. The two main standard forms are used as primary transcriptions norms of their respective spelling forms.


  1. 1 2 3 4 5 Catalan at Ethnologue (25th ed., 2022) Closed Access logo transparent.svg
  2. 1 2 Some Iberian scholars may alternatively classify Catalan as Iberian Romance/East Iberian.
  3. "Definition of CATALAN".
  4. 1 2 "Definition of Catalan |".
  5. 1 2 3 4 5 6 7 8 9 10 11 Wheeler 2010, p. 191.
  6. Minder, Raphael (21 November 2016). "Italy's Last Bastion of Catalan Language Struggles to Keep It Alive". The New York Times. Archived from the original on 1 January 2022. Retrieved 21 January 2017.
  7. 1 2 3 Wheeler 2010, pp. 190–191.
  8. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Costa Carreras & Yates 2009, pp. 6–7.
  9. García Venero 2006.
  10. Burke 1900, p. 154.
  11. "Definition of CATALAN".
  12. 1 2 3 4 Acadèmia Valenciana de la Llengua (9 February 2005). "Acord de l'Acadèmia Valenciana de la Llengua (AVL), adoptat en la reunió plenària del 9 de febrer del 2005, pel qual s'aprova el dictamen sobre els principis i criteris per a la defensa de la denominació i l'entitat del valencià" (PDF) (in Valencian). p. 52. Archived from the original (PDF) on 23 September 2015. Retrieved 16 February 2013.
  13. Lledó 2011, pp. 334–337.
  14. Veny 1997, pp. 9–18.
  15. 1 2 3 Moran 2004, pp. 37–38.
  16. Riquer 1964.
  17. 1 2 3 4 Wheeler 2010, p. 190.
  18. Trobes en llaors de la Verge Maria ("Poems of praise of the Virgin Mary") 1474.
  19. Sales Vives, Pere (22 September 2020). L'Espanyolització de Mallorca: 1808-1932 (in Catalan). El Gall editor. p. 422. ISBN   9788416416707.
  20. Antoni Simon, Els orígens històrics de l'anticatalanisme, páginas 45-46, L'Espill, nº 24, Universitat de València
  21. Mayans Balcells, Pere (2019). Cròniques Negres del Català A L'Escola (in Catalan) (del 1979 ed.). p. 230. ISBN   978-84-947201-4-7.
  22. Lluís, García Sevilla (2021). Recopilació d'accions genocides contra la nació catalana (in Catalan). Base. p. 300. ISBN   9788418434983.
  23. Bea Seguí, Ignaci (2013). En cristiano! Policia i Guàrdia Civil contra la llengua catalana (in Catalan). Cossetània. p. 216. ISBN   9788490341339.
  24. "Enllaç al Manifest Galeusca on en l'article 3 es denuncia l'asimetria entre el castellà i les altres llengües de l'Estat Espanyol, inclosa el català". Archived from the original on 19 July 2008. Retrieved 2 August 2008.