This article has multiple issues. Please help improve it or discuss these issues on the talk page . (Learn how and when to remove these template messages)
|
A macrolanguage is a group of mutually intelligible speech varieties, or dialect continuum, that have no traditional name in common, and which may be considered distinct languages by their speakers. Macrolanguages are used as a book-keeping mechanism for the ISO 639 international standard of language codes. Macrolanguages are established to assist mapping between different sets of ISO language codes. Specifically, there may be a many-to-one correspondence between ISO 639-3, intended to identify all the thousands of languages of the world, and either of two other sets, ISO 639-1, established to identify languages in computer systems, and ISO 639-2, which encodes a few hundred languages for library cataloguing and bibliographic purposes. When such many-to-one ISO 639-2 codes are included in an ISO 639-3 context, they are called "macrolanguages" to distinguish them from the corresponding individual languages of ISO 639-3. [1] According to the ISO,
Some existing code elements in ISO 639-2, and the corresponding code elements in ISO 639-1, are designated in those parts of ISO 639 as individual language code elements, yet are in a one-to-many relationship with individual language code elements in [ISO 639-3]. For purposes of [ISO 639-3], they are considered to be macrolanguage code elements.
— ISO 639-3: Relationship between ISO 639-3 and the other parts of ISO 639 [2]
ISO 639-3 is curated by SIL International; ISO 639-2 is curated by the Library of Congress (USA).
The mapping often has the implication that it covers borderline cases where two language varieties may be considered strongly divergent dialects of the same language or very closely related languages (dialect continua); it may also encompass situations when there are language varieties that are considered to be varieties of the same language on the grounds of ethnic, cultural, and political considerations, rather than linguistic reasons.[ dubious – discuss ] However, this is not its primary function and the classification is not evenly applied.
For example, Chinese is a macrolanguage encompassing many languages that are not mutually intelligible, but the languages "Standard German", "Bavarian German", and other closely related languages do not form a macrolanguage, despite being more mutually intelligible. Other examples include Tajiki not being part of the Persian macrolanguage despite sharing much lexicon, and Urdu and Hindi not forming a macrolanguage despite forming a mutually intelligible dialect continuum. All dialects of Hindi are considered separate languages. Basically, ISO 639-2 and ISO 639-3 use different criteria for dividing language varieties into languages, 639-2 uses shared writing systems and literature more whereas 639-3 focuses on mutual intelligibility and shared lexicon. The macrolanguages exist within the ISO 639-3 code set to make mapping between the two sets easier.
The use of macrolanguages was applied in Ethnologue , starting in the 16th edition. [3] As of 21 December 2023 [update] , there are fifty-nine language codes in ISO 639-2 that are counted as macrolanguages in ISO 639-3. [4] The most recent registered macrolanguage is Sanskrit with code san, adopted in 15 December 2023, though it already existed as individual language for several years. [5]
Some of the macrolanguages had no individual language (as defined by 639-3) in ISO 639-2, e.g. "ara" (Arabic), but ISO 639-3 recognizes different varieties of Arabic as separate languages under some circumstances. Others, like "nor" (Norwegian) had their two individual parts (nno Nynorsk, nob Bokmål) already in 639-2. That means some languages (e.g. "arb" Standard Arabic) that were considered by ISO 639-2 to be dialects of one language ("ara") are now in ISO 639-3 in certain contexts considered to be individual languages themselves. This is an attempt to deal with varieties that may be linguistically distinct from each other, but are treated by their speakers as forms of the same language, e.g. in cases of diglossia. For example,
ISO 639-2 also includes codes for collections of languages; these are not the same as macrolanguages. These collections of languages are excluded from ISO 639-3, because they never refer to individual languages. Most such codes are included in ISO 639-5.
bnc
, hbs
, kln
, luy
)aka
– tw
hbs
– bs
, hr
, sr
msa
– id
nor
– nb
, nn
This list only includes official data from https://iso639-3.sil.org/code_tables/macrolanguage_mappings/data.
This is a complete list of the individual language codes that comprise the macrolanguages in the ISO 639-3 code tables as of 6 March 2023 [update] . [9]
aka is the ISO 639-3 language code for Akan. Its ISO 639-1 code is ak. There are two individual language codes assigned:
ara is the ISO 639-3 language code for Arabic. Its ISO 639-1 code is ar. There are twenty-eight individual language codes assigned:
The following codes were previously part of ara:
aym is the ISO 639-3 language code for Aymara. Its ISO 639-1 code is ay. There are two individual language codes assigned:
aze is the ISO 639-3 language code for Azerbaijani. Its ISO 639-1 code is az. There are two individual language codes assigned:
bal is the ISO 639-3 language code for Baluchi. There are three individual language codes assigned:
bik is the ISO 639-3 language code for Bikol. There are eight individual language codes assigned:
The following code was previously part of bik:
bnc is the ISO 639-3 language code for Bontok. There are five individual language codes assigned:
bua is the ISO 639-3 language code for Buriat. There are three individual language codes assigned:
chm is the ISO 639-3 language code for Mari, a language located in Russia. There are two individual language codes assigned:
cre is the ISO 639-3 language code for Cree. Its ISO 639-1 code is cr. There are six individual language codes assigned:
In addition, there are six closely associated individual codes:
In addition, there is one other language without individual codes closely associated, but not part of, this macrolanguage code:
del is the ISO 639-3 language code for Delaware. There are two individual language codes assigned:
den is the ISO 639-3 language code for Slave. There are two individual language codes assigned:
din is the ISO 639-3 language code for Dinka. There are five individual language codes assigned:
doi is the ISO 639-3 language code for Dogri. There are two individual language codes assigned:
est is the ISO 639-3 language code for Estonian. Its ISO 639-1 code is et. There are two individual language codes assigned:
fas is the ISO 639-3 language code for Persian. Its ISO 639-1 code is fa. There are two individual language codes assigned:
ful is the ISO 639-2 and ISO 639-3 language code for Fulah (also spelled Fula). Its ISO 639-1 code is ff. There are nine individual language codes assigned for varieties of Fulah:
gba is the ISO 639-3 language code for Gbaya located in the Central African Republic. There are six individual language codes assigned:
The following code was previously part of gba:
gon is the ISO 639-3 language code for Gondi. There are three individual language codes assigned:
The following code was previously part of gon:
grb is the ISO 639-3 language code for Grebo. There are five individual language codes assigned:
grn is the ISO 639-3 language code for Guarani. Its ISO 639-1 code is gn. There are five individual language codes assigned:
hai is the ISO 639-3 language code for Haida. There are two individual language codes assigned:
hbs is the ISO 639-3 language code for Serbo-Croatian. It formerly had an ISO 639-1 code sh but deprecated in 2000. There are four individual language codes assigned:
hmn is the ISO 639-3 language code for Hmong. There are twenty-five individual language codes assigned:
The following code was previously part of hmn:
iku is the ISO 639-3 language code for Inuktitut. Its ISO 639-1 code is iu. There are two individual language codes assigned:
ipk is the ISO 639-3 language code for Inupiaq. Its ISO 639-1 code is ik. There are two individual language codes assigned:
jrb is the ISO 639-3 language code for Judeo-Arabic. There are four individual language codes assigned:
The following code was previously part of jrb:
kau is the ISO 639-2 and ISO 639-3 language code for the Kanuri. Its ISO 639-1 code is kr. There are three individual language codes assigned in ISO 639-3 for varieties of Kanuri:
There are two other related languages that are not considered part of the macrolanguage under ISO 639:
kln is the ISO 639-3 language code for Kalenjin. There are nine individual language codes assigned:
kok is the ISO 639-3 language code for Konkani (macrolanguage). There are two individual language codes assigned:
Both languages are referred to as Konkani by their respective speakers.
kom is the ISO 639-3 language code for Komi. Its ISO 639-1 code is kv. There are two individual language codes assigned:
kon is the ISO 639-3 language code for Kongo. Its ISO 639-1 code is kg. There are three individual language codes assigned:
kpe is the ISO 639-3 language code for Kpelle. There are two individual language codes assigned:
kur is the ISO 639-3 language code for Kurdish. Its ISO 639-1 code is ku. There are three individual language codes assigned:
lah is the ISO 639-3 language code for Lahnda. There are seven individual language codes assigned.
lah does not include Panjabi/Punjabi (pan).
The following code was previously part of lah:
lav is the ISO 639-3 language code for Latvian. Its ISO 639-1 code is lv. There are two individual language codes assigned:
luy is the ISO 639-3 language code for Luyia. There are fourteen individual language codes assigned:
man is the ISO 639-3 language code for Mandingo. There are six individual language codes assigned:
The following codes were previously part of man:
mlg is the ISO 639-3 language code for Malagasy. Its ISO 639-1 code is mg. There are eleven individual language codes assigned:
The following codes were previously part of mlg:
mon is the ISO 639-3 language code for Mongolian. Its ISO 639-1 code is mn. There are two individual language codes assigned:
msa is the ISO 639-3 language code for Malay (macrolanguage). Its ISO 639-1 code is ms. There are thirty-six individual language codes assigned:
The following code was previously part of msa:
In addition, there is an individual code not part of this macrolanguage because it is categorized as a historical language:
mwr is the ISO 639-3 language code for Marwari. There are six individual language codes assigned:
nep is the ISO 639-3 language code for Nepali (macrolanguage). Its ISO 639-1 code is ne. There are two individual language codes assigned:
nor is the ISO 639-3 language code for Norwegian. Its ISO 639-1 code is no. There are two individual language codes assigned:
oji is the ISO 639-3 language code for Ojibwa. Its ISO 639-1 code is oj. There are seven individual language codes assigned:
In addition, there are three closely associated individual codes:
In addition, there are two other languages without individual codes closely associated, but not part of, this macrolanguage code:
ori is the ISO 639-3 language code for Oriya (macrolanguage). Its ISO 639-1 code is or. There are two individual language codes assigned:
orm is the ISO 639-3 language code for Oromo. Its ISO 639-1 code is om. There are four individual language codes assigned:
pus is the ISO 639-3 language code for Pashto. Its ISO 639-1 code is ps. There are three individual language codes assigned:
que is the ISO 639-3 language code for Quechua. Its ISO 639-1 code is qu. There are forty-three individual language codes assigned:
The following code was previously part of que:
raj is the ISO 639-3 language code for Rajasthani. There are six individual language codes assigned:
rom is the ISO 639-3 language code for Romany. There are seven individual language codes assigned:
In addition, there are nine individual codes not part of this macrolanguage but they are categorized as mixed languages:
san is the ISO 639-3 language code for Sanskrit. Its ISO 639-1 code is sa. As of 2024, it's the only macrolanguage with language type as Historical. There are two individual language codes assigned:
sqi is the ISO 639-3 language code for Albanian. Its ISO 639-1 code is sq. There are four individual language codes assigned:
srd is the ISO 639-3 language code for Sardinian. Its ISO 639-1 code is sc. There are four individual language codes assigned:
swa is the ISO 639-3 language code for Swahili. Its ISO 639-1 code is sw. There are two individual language codes assigned:
syr is the ISO 639-3 language code for Syriac. There are two individual language codes assigned:
tmh is the ISO 639-3 language code for Tamashek. There are four individual language codes assigned:
uzb is the ISO 639-3 language code for Uzbek. Its ISO 639-1 code is uz. There are two individual language codes assigned:
yid is the ISO 639-3 language code for Yiddish. Its ISO 639-1 code is yi. There are two individual language codes assigned:
zap is the ISO 639-3 language code for Zapotec. There are fifty-eight individual language codes assigned.
The following codes were previously part of zap:
In addition, there is an individual code not part of this macrolanguage because it is categorized as a historical language:
zha is the ISO 639-3 language code for Zhuang. Its ISO 639-1 code is za. There are sixteen individual language codes assigned:
The following codes were previously part of zha:
zho is the ISO 639-3 language code for Chinese. Its ISO 639-1 code is zh. There are sixteen individual language codes assigned, most of which are not actually languages but rather groups of Sinitic languages distinguished by isoglosses:
Although the Dungan language (dng) is a dialect of Mandarin, it is not listed under Chinese in ISO 639-3 due to separate historical and cultural development. [11]
ISO 639 also lists codes for Old Chinese (och) and Late Middle Chinese (ltc)). They are not listed under Chinese in ISO 639-3 because they are categorized as ancient and historical languages, respectively.
zza is the ISO 639-3 language code for Zaza. There are two individual language codes assigned:
ISO 639 is a standard by the International Organization for Standardization (ISO) concerned with representation of languages and language groups. It currently consists of four sets of code, named after each part which formerly described respective set ; a part 6 was published but withdrawn. It was first approved in 1967 as a single-part ISO Recommendation, ISO/R 639, superseded in 2002 by part 1 of the new series, ISO 639-1, followed by additional parts. All existing parts of the series were consolidated into a single standard in 2023, largely based on the text of ISO 639-4.
Za or ZA may refer to:
ISO 639-2:1998, Codes for the representation of names of languages — Part 2: Alpha-3 code, is the second part of the ISO 639 standard, which lists codes for the representation of the names of languages. The three-letter codes given for each language in this part of the standard are referred to as "Alpha-3" codes. There are 487 entries in the list of ISO 639-2 codes.
Hmong or Mong is a dialect continuum of the West Hmongic branch of the Hmongic languages spoken by the Hmong people of Sichuan, Yunnan, Guizhou, Guangxi, Hainan, northern Vietnam, Thailand, and Laos. There are some 2.7 million speakers of varieties that are largely mutually intelligible, including over 280,000 Hmong Americans as of 2013. Over half of all Hmong speakers speak the various dialects in China, where the Dananshan (大南山) dialect forms the basis of the standard language. However, Hmong Daw and Mong Leng are widely known only in Laos and the United States; Dananshan is more widely known in the native region of Hmong.
A mixed language, also referred to as a hybrid language, contact language, or fusion language, is a language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. It differs from a creole or pidgin language in that, whereas creoles/pidgins arise where speakers of many languages acquire a common language, a mixed language typically arises in a population that is fluent in both of the source languages.
ISO 639-3:2007, Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages, is an international standard for language codes in the ISO 639 series. It defines three-letter codes for identifying languages. The standard was published by International Organization for Standardization (ISO) on 1 February 2007.
This is a list of ISO 639-3 language codes starting with A.
This is a list of ISO 639-3 language codes starting with B.
This is a list of ISO 639-3 language codes starting with E.
This is a list of ISO 639-3 language codes starting with M.
This is a list of ISO 639-3 language codes starting with S.
This is a list of ISO 639-3 language codes starting with U.
This is a list of ISO 639-3 language codes starting with X.
An IETF BCP 47 language tag is a standardized code that is used to identify human languages on the Internet. The tag structure has been standardized by the Internet Engineering Task Force (IETF) in Best Current Practice (BCP) 47; the subtags are maintained by the IANA Language Subtag Registry.
Meo or MEO may refer to:
BJQ or bjq can refer to:
ISO 639-1 Code | ISO 639-2 Code | English name of Language | French name of Language | Date Added or Changed | Category of Change | Notes |
---|---|---|---|---|---|---|
[-sh] | (none) | Serbo-Croatian | serbo-croate | 2000-02-18 | Dep | This code was deprecated in 2000 because there were separate language codes for each individual language represented (Serbian, Croatian, and then Bosnian was added). It was published in a revision of ISO 639-1, but was never included in ISO 639-2. It is considered a macrolanguage (general name for a cluster of closely related individual languages) in ISO 639-3. Its deprecated status was reaffirmed by the ISO 639 JAC in 2005. |
sr | srp [scc] | Serbian | serbe | 2008-06-28 | CC | ISO 639-2/B code deprecated in favor of ISO 639-2/T code |
hr | hrv [scr] | Croatian | croate | 2008-06-28 | CC | ISO 639-2/B code deprecated in favor of ISO 639-2/T code |