In historical linguistics, grammaticalization (also known as grammatization or grammaticization) is a process of language change by which words representing objects and actions (i.e. nouns and verbs) become grammatical markers (such as affixes or prepositions). Thus it creates new function words from content words, rather than deriving them from existing bound, inflectional constructions. For example, the Old English verb willan 'to want', 'to wish' has become the Modern English auxiliary verb will, which expresses intention or simply futurity. Some concepts are often grammaticalized, while others, such as evidentiality, are not so much. [1]
For an understanding of this process, a distinction needs to be made between lexical items or content words, which carry specific lexical meaning, and grammatical items or function words, which serve mainly to express grammatical relationships between the different words in an utterance. Grammaticalization has been defined as "the change whereby lexical items and constructions come in certain linguistic contexts to serve grammatical functions, and, once grammaticalized, continue to develop new grammatical functions". [2] Where grammaticalization takes place, nouns and verbs which carry certain lexical meaning develop over time into grammatical items such as auxiliaries, case markers, inflections, and sentence connectives.
A well-known example of grammaticalization is that of the process in which the lexical cluster let us, for example in "let us eat", is reduced to let's as in "let's you and me fight". Here, the phrase has lost its lexical meaning of "allow us" and has become an auxiliary introducing a suggestion, the pronoun 'us' reduced first to a suffix and then to an unanalyzed phoneme.
In other areas of linguistics, the term grammaticalization has taken on a much broader meaning. These other senses of the term are discussed below.
The concept was developed in the works of Bopp (1816), Schlegel (1818), Humboldt (1825) and Gabelentz (1891). Humboldt, for instance, came up with the idea of evolutionary language. He suggested that in all languages grammatical structures evolved out of a language stage in which there were only words for concrete objects and ideas. In order to successfully communicate these ideas, grammatical structures slowly came into existence. Grammar slowly developed through four different stages, each in which the grammatical structure would be more developed. Though neo-grammarians like Brugmann rejected the separation of language into distinct "stages" in favour of uniformitarian assumptions, [3] they were positively inclined towards some of these earlier linguists' hypotheses. [4]
The term "grammaticalization" in the modern sense was coined by the French linguist Antoine Meillet in his L'évolution des formes grammaticales (1912). Meillet's definition was "the attribution of grammatical character to an erstwhile autonomous word". [5] Meillet showed that what was at issue was not the origins of grammatical forms but their transformations. He was thus able to present a notion of the creation of grammatical forms as a legitimate study for linguistics. Later studies in the field have further developed and altered Meillet's ideas and have introduced many other examples of grammaticalization.
During the second half of the twentieth century, the field of linguistics was strongly concerned with synchronic studies of language change, with less emphasis on historical approaches such as grammaticalization. It did however, mostly in Indo-European studies, remain an instrument for explaining language change.
It was not until the 1970s, with the growth of interest in discourse analysis and linguistic universals, that the interest for grammaticalization in linguistic studies began to grow again. A greatly influential work in the domain was Christian Lehmann 's Thoughts on Grammaticalization (1982). This was the first work to emphasize the continuity of research from the earliest period to the present, and it provided a survey of the major work in the field. Lehmann also invented a set of 'parameters', a method along which grammaticality could be measured both synchronically and diachronically. [6]
Another important work was Heine and Reh 's Grammaticalization and Reanalysis in African Languages (1984). This work focussed on African languages synchronically from the point of view of grammaticalization. They saw grammaticalization as an important tool for describing the workings of languages and their universal aspects and it provided an exhaustive list of the pathways of grammaticalization.
The great number of studies on grammaticalization in the last decade (up to 2018) show grammaticalization remains a popular item and is regarded as an important field within linguistic studies in general. Among recent publications there is a wide range of descriptive studies trying to come up with umbrella definitions and exhaustive lists, while others tend to focus more on its nature and significance, questioning the opportunities and boundaries of grammaticalization. An important and popular topic which is still debated is the question of unidirectionality.
It is difficult to capture the term "grammaticalization" in one clear definition (see the 'various views on grammaticalization' section below). However, there are some processes that are often linked to grammaticalization. These are semantic bleaching, morphological reduction, phonetic erosion, and obligatorification.
Semantic bleaching, or desemanticization, has been seen from early on as a characteristic of grammaticalization. It can be described as the loss of semantic content. More specifically, with reference to grammaticalization, bleaching refers to the loss of all (or most) lexical content of an entity while only its grammatical content is retained. [7] For example, James Matisoff described bleaching as "the partial effacement of a morpheme's semantic features, the stripping away of some of its precise content so it can be used in an abstracter, grammatical-hardware-like way". [8] John Haiman wrote that "semantic reduction, or bleaching, occurs as a morpheme loses its intention: From describing a narrow set of ideas, it comes to describe an ever broader range of them, and eventually may lose its meaning altogether". [9] He saw this as one of the two kinds of change that are always associated with grammaticalization (the other being phonetic reduction).
For example, both English suffixes -ly (as in bodily and angrily), and -like (as in catlike or yellow-like) ultimately come from an earlier Proto-Germanic etymon, *līką, which meant body or corpse. There is no salient trace of that original meaning in the present suffixes for the native speaker, but speakers instead treat the more newly-formed suffixes as bits of grammar that help them form new words. One could make the connection between the body or shape of a physical being and the abstract property of likeness or similarity, but only through metonymic reasoning, after one is explicitly made aware of this connection.
Once a linguistic expression has changed from a lexical to a grammatical meaning (bleaching), it is likely to lose morphological and syntactic elements that were characteristic of its initial category, but which are not relevant to the grammatical function. [10] This is called decategorialization, or morphological reduction.
For example, the demonstrative 'that' as in "that book" came to be used as a relative clause marker, and lost the grammatical category of number ('that' singular vs. 'those' plural), as in "the book that I know" versus "the things that I know".
Phonetic erosion (also called phonological attrition or phonological reduction), is another process that is often linked to grammaticalization. It implies that a linguistic expression loses phonetic substance when it has undergone grammaticalization. Heine writes that "once a lexeme is conventionalized as a grammatical marker, it tends to undergo erosion; that is, the phonological substance is likely to be reduced in some way and to become more dependent on surrounding phonetic material". [11]
Bernd Heine and Tania Kuteva have described different kinds of phonetic erosion for applicable cases:
'Going to' → 'gonna' (or even 'I am going to' → 'I'm gonna' → 'I'mma') and 'because' → 'coz' are examples of erosion in English. Some linguists trace erosion to the speaker's tendency to follow the principle of least effort, while others think that erosion is a sign of changes taking place.
However, phonetic erosion, a common process of language change that can take place with no connection to grammaticalization, is not a necessary property of grammaticalization. [12] For example, the Latin construction of the type clarā mente, meaning 'with a clear mind' is the source of modern Romance productive adverb formation, as in Italian chiaramente, and Spanish claramente 'clearly'. In both of those languages, -mente in this usage is interpretable by today's native speakers only as a morpheme signaling 'adverb' and it has undergone no phonological erosion from the Latin source, mente. This example also illustrates that semantic bleaching of a form in its grammaticalized morphemic role does not necessarily imply bleaching of its lexical source, and that the two can separate neatly in spite of maintaining identical phonological form: the noun mente is alive and well today in both Italian and Spanish with its meaning 'mind', yet native speakers do not recognize the noun 'mind' in the suffix -mente.
The phonetic erosion may bring a brand-new look to the phonological system of a language, by changing the inventory of phones and phonemes, making new arrangements in the phonotactic patterns of a syllable, etc. Special treatise on the phonological consequences of grammaticalization and lexicalization in the Chinese languages can be found in Wei-Heng Chen (2011), which provides evidence that a morphophonological change can later change into a purely phonological change, and evidence that there is a typological difference in the phonetic and phonological consequences of grammaticalization between monosyllabic languages (featuring an obligatory match between syllable and morpheme, with exceptions of either loanwords or derivations like reduplicatives or diminutives, other morphological alternations) vs non-monosyllabic languages (including disyllabic or bisyllabic Austronesian languages, Afro-Asiatic languages featuring a tri-consonantal word root, Indo-European languages without a 100% obligatory match between such a sound unit as syllable and such a meaning unit as morpheme or word, despite an assumed majority of monosyllabic reconstructed word stems/roots in the Proto-Indo-European hypothesis), a difference mostly initiated by the German linguist W. Humboldt, putting Sino-Tibetan languages in a sharp contrast to the other languages in the world in typology.
Obligatorification occurs when the use of linguistic structures becomes increasingly more obligatory in the process of grammaticalization. [13] Lehmann describes it as a reduction in transparadigmatic variability, by which he means that "the freedom of the language user with regard to the paradigm as a whole" is reduced. [14] Examples of obligatoriness can be found in the category of number, which can be obligatory in some languages or in specific contexts, in the development of articles, and in the development of personal pronouns of some languages. Some linguists, like Heine and Kuteva, stress the fact that even though obligatorification can be seen as an important process, it is not necessary for grammaticalization to take place, and it also occurs in other types of language change. [15]
Although these 'parameters of grammaticalization' are often linked to the theory, linguists such as Bybee et al. (1994) have acknowledged that independently, they are not essential to grammaticalization. In addition, most are not limited to grammaticalization but can be applied in the wider context of language change. Critics of the theory of grammaticalization have used these difficulties to claim that grammaticalization has no independent status of its own, that all processes involved can be described separately from the theory of grammaticalization. Janda, for example, wrote that "given that even writers on grammaticalization themselves freely acknowledge the involvement of several distinct processes in the larger set of phenomena, it is hard to avoid the conclusion that the notion of grammaticalization, too, tends to represent an epiphenomenal telescoping. That is, it may involve certain typical "path(way)s", but the latter seem to be built out of separate stepping-stones which can often be seen in isolation and whose individual outlines are always distinctly recognizable". [16]
In the process of grammaticalization, an uninflected lexical word (or content word) is transformed into a grammar word (or function word). The process by which the word leaves its word class and enters another is not sudden, but occurs by a gradual series of individual shifts. The overlapping stages of grammaticalization form a chain, generally called a cline. These shifts generally follow similar patterns in different languages. [17] Linguists do not agree on the precise definition of a cline or on its exact characteristics in given instances. It is believed that the stages on the cline do not always have a fixed position, but vary. However, Hopper and Traugott's famous pattern for the cline of grammaticalization illustrates the various stages of the form:
This particular cline is called "the cline of grammaticality" [18] or the "cycle of categorial downgrading", [19] and it is a common one. In this cline every item to the right represents a more grammatical and less lexical form than the one to its left.
It is very common for full verbs to become auxiliaries and eventually inflexional endings.
An example of this phenomenon can be seen in the change from the Old English (OE) verb willan ('to want/to wish') to an auxiliary verb signifying intention in Middle English (ME). In Present-Day English (PDE), this form is even shortened to 'll and no longer necessarily implies intention, but often is simply a mark of future tense (see shall and will). The PDE verb 'will' can thus be said to have less lexical meaning than its preceding form in OE. [20]
The final stage of grammaticalization has happened in many languages. For example, in Serbo-Croatian, the Old Church Slavonic verb [xъtěti] Error: {{Lang}}: Non-latn text (pos 2)/Latn script subtag mismatch (help) ("to want/to wish") has gone from a content word (hoće hoditi "s/he wants to walk") to an auxiliary verb in phonetically reduced form (on/onaće hoditi "s/he will walk") to a clitic (hoditi će), and finally to a fused inflection (hodiće "s/he will walk").
Compare the German verb wollen which has partially undergone a similar path of grammaticalization, and note the simultaneous existence of the non-grammaticalized Modern English verb to will (e.g. "He willed himself to continue along the steep path.") or hoteti in Serbo-Croatian (Hoċu da hodim = I want that I walk).
In Latin the original future tense forms (e.g. cantabo) were dropped when they became phonetically too close to the imperfect forms (cantabam). Instead, a phrase like cantare habeo (literally, 'I have got to sing') acquired the sense of futurity (cf. I have to sing). Finally it became the true future tense in almost all Romance languages and the auxiliary became a full-fledged inflection (cf. Spanish cantaré,cantarás,cantará, French je chanterai,tu chanteras,il/elle chantera, Italian canterò,canterai,canterà, 'I will sing', 'you will sing', 's/he will sing'). In some verbs the process went further and produced irregular forms—cf. Spanish haré (instead of *haceré, 'I'll do') and tendré (not *teneré, 'I'll have'; the loss of e followed by epenthesis of d is especially common)—and even regular forms (in Italian, the change of the a in the stem cantare to e in canterò has affected the whole class of conjugation type I verbs).
An illustrative example of this cline is in the orthography of Japanese compound verbs. Many Japanese words are formed by connecting two verbs, as in 'go and ask (listen)' (行って聞く, ittekiku), and in Japanese orthography lexical items are generally written with kanji (here 行く and 聞く), while grammatical items are written with hiragana (as in the connecting て). Compound verbs are thus generally written with a kanji for each constituent verb, but some suffixes have become grammaticalized, and are written in hiragana, such as 'try out, see' (〜みる, -miru), from 'see' (見る, miru), as in 'try eating (it) and see' (食べてみる, tabetemiru).
In Grammaticalization (2003) Hopper and Traugott state that the cline of grammaticalization has both diachronic and synchronic implications. Diachronically (i.e. looking at changes over time), clines represent a natural path along which forms or words change over time. However, synchronically (i.e. looking at a single point in time), clines can be seen as an arrangement of forms along imaginary lines, with at one end a 'fuller' or lexical form and at the other a more 'reduced' or grammatical form. [21] What Hopper and Traugott mean is that from a diachronic or historical point of view, changes of word forms is seen as a natural process, whereas synchronically, this process can be seen as inevitable instead of historical.
The studying and documentation of recurrent clines enable linguists to form general laws of grammaticalization and language change in general. It plays an important role in the reconstruction of older states of a language. Moreover, the documenting of changes can help to reveal the lines along which a language is likely to develop in the future.
The unidirectionality hypothesis is the idea that grammaticalization, the development of lexical elements into grammatical ones, or less grammatical into more grammatical, is the preferred direction of linguistic change and that a grammatical item is much less likely to move backwards rather than forwards on Hopper & Traugott's cline of grammaticalization. [22]
In the words of Bernd Heine, "grammaticalization is a unidirectional process, that is, it leads from less grammatical to more grammatical forms and constructions". [23] That is one of the strongest claims about grammaticalization, and is often cited as one of its basic principles. In addition, unidirectionality refers to a general developmental orientation which all (or the large majority) of the cases of grammaticalization have in common, and which can be paraphrased in abstract, general terms, independent of any specific case. [24]
The idea of unidirectionality is an important one when trying to predict language change through grammaticalization (and for making the claim that grammaticalization can be predicted). Lessau notes that "unidirectionality in itself is a predictive assertion in that it selects the general type of possible development (it predicts the direction of any given incipient case)," and unidirectionality also rules out an entire range of development types that do not follow this principle, hereby limiting the amount of possible paths of development. [25]
Although unidirectionality is a key element of grammaticalization, exceptions exist. Indeed, the possibility of counterexamples, coupled with their rarity, is given as evidence for the general operating principle of unidirectionality. According to Lyle Campbell, however, advocates often minimize the counterexamples or redefine them as not being part of the grammaticalization cline. [26] He gives the example of Hopper and Traugott (1993), who treat some putative counterexamples as cases of lexicalization in which a grammatical form is incorporated into a lexical item but does not itself become a lexical item. An example is the phrase to up the ante, which incorporates the preposition up (a function word) in a verb (a content word) but without up becoming a verb outside of this lexical item. Since it is the entire phrase to up the ante that is the verb, Hopper and Traugott argue that the word up itself cannot be said to have degrammaticalized, a view that is challenged to some extent by parallel usages such as to up the bid, to up the payment, to up the deductions, to up the medication, by the fact that in all cases the can be replaced by a possessive (my, your, her, Bill's, etc.), and by further extensions still: he upped his game 'he improved his performance'.
Examples that are not confined to a specific lexical item are less common. One is the English genitive -'s, which, in Old English, was a suffix but, in Modern English, is a clitic. As Jespersen (1894) put it,
In Modern English...(compared to OE) the -s is much more independent: it can be separated from its main word by an adverb such as else (somebody else's hat ), by a prepositional clause such as of England (the queen of England's power ), or even by a relative clause such as I saw yesterday (the man I saw yesterday's car)...the English genitive is in fact no longer a flexional form...historically attested facts show us in the most unequivocal way a development - not, indeed, from an originally self-existent word to a mere flexional ending, but the exactly opposite development of what was an inseparable part of a complicated flexional system to greater and greater emancipation and independence. [27]
Traugott cites a counterexample from function to content word proposed by Kate Burridge (1998): the development in Pennsylvania German of the auxiliary wotte of the preterite subjunctive modal welle 'would' (from 'wanted') into a full verb 'to wish, to desire'. [28]
In comparison to various instances of grammaticalization, there are relatively few counterexamples to the unidirectionality hypothesis, and they often seem to require special circumstances to occur. One is found in the development of Irish Gaelic with the origin of the first-person-plural pronoun muid (a function word) from the inflectional suffix -mid (as in táimid 'we are') because of a reanalysis based on the verb-pronoun order of the other persons of the verb. [29] Another well-known example is the degrammaticalization of the North Saami abessive ('without') case suffix -haga to the postposition haga 'without' and further to a preposition and a free-standing adverb. [30] Moreover, the morphologically analogous derivational suffix -naga 'stained with' (e.g., gáffenaga 'stained with coffee', oljonaga 'stained with oil') – itself based on the essive case marker *-na – has degrammaticalized into an independent noun naga 'stain'. [31]
Linguists have come up with different interpretation of the term 'grammaticalization', and there are many alternatives to the definition given in the introduction. The following will be a non-exhaustive list of authors who have written about the subject with their individual approaches to the nature of the term 'grammaticalization'.
Since then, the study of grammaticalization has become broader, and linguists have extended the term into various directions.
In linguistics, an allomorph is a variant phonetic form of a morpheme, or in other words, a unit of meaning that varies in sound and spelling without changing the meaning. The term allomorph describes the realization of phonological variations for a specific morpheme. The different allomorphs that a morpheme can become are governed by morphophonemic rules. These phonological rules determine what phonetic form, or specific pronunciation, a morpheme will take based on the phonological or morphological context in which it appears.
In linguistics, an affix is a morpheme that is attached to a word stem to form a new word or word form. The main two categories are derivational and inflectional affixes. Derivational affixes, such as un-, -ation, anti-, pre- etc., introduce a semantic change to the word they are attached to. Inflectional affixes introduce a syntactic change, such as singular into plural, or present simple tense into present continuous or past tense by adding -ing, -ed to an English word. All of them are bound morphemes by definition; prefixes and suffixes may be separable affixes.
A lexicon is the vocabulary of a language or branch of knowledge. In linguistics, a lexicon is a language's inventory of lexemes. The word lexicon derives from Greek word λεξικόν, neuter of λεξικός meaning 'of or for words'.
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this is the distinction, respectively, between free and bound morphemes. The field of linguistic study dedicated to morphemes is called morphology.
In linguistics, morphology is the study of words, including the principles by which they are formed, and how they relate to one another within a language. Most approaches to morphology investigate the structure of words in terms of morphemes, which are the smallest units in a language with some independent meaning. Morphemes include roots that can exist as words by themselves, but also categories such as affixes that can only appear as part of a larger word. For example, in English the root catch and the suffix -ing are both morphemes; catch may appear as its own word, or it may be combined with -ing to form the new word catching. Morphology also analyzes how words behave as parts of speech, and how they may be inflected to express grammatical categories including number, tense, and aspect. Concepts such as productivity are concerned with how speakers create words in specific contexts, which evolves over the history of a language.
Morphological derivation, in linguistics, is the process of forming a new word from an existing word, often by adding a prefix or suffix, such as un- or -ness. For example, unhappy and happiness derive from the root word happy.
Madí—also known as Jamamadí after one of its dialects, and also Kapaná or Kanamanti (Canamanti)—is an Arawan language spoken by about 1,000 Jamamadi, Banawá, and Jarawara people scattered over Amazonas, Brazil.
Morphological typology is a way of classifying the languages of the world that groups languages according to their common morphological structures. The field organizes languages on the basis of how those languages form words by combining morphemes. Analytic languages contain very little inflection, instead relying on features like word order and auxiliary words to convey meaning. Synthetic languages, ones that are not analytic, are divided into two categories: agglutinative and fusional languages. Agglutinative languages rely primarily on discrete particles for inflection, while fusional languages "fuse" inflectional categories together, often allowing one word ending to contain several categories, such that the original root can be difficult to extract. A further subcategory of agglutinative languages are polysynthetic languages, which take agglutination to a higher level by constructing entire sentences, including nouns, as one word.
Hadza is a language isolate spoken along the shores of Lake Eyasi in Tanzania by around 1,000 Hadza people, who include in their number the last full-time hunter-gatherers in Africa. It is one of only three languages in East Africa with click consonants. Despite the small number of speakers, language use is vigorous, with most children learning it, but UNESCO categorizes the language as vulnerable.
A word is a basic element of language that carries meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonological, grammatical or orthographic basis. Others suggest that the concept is simply a convention used in everyday situations.
In linguistics, apophony is an alternation of vowel (quality) within a word that indicates grammatical information.
In linguistics, a suffix is an affix which is placed after the stem of a word. Common examples are case endings, which indicate the grammatical case of nouns and adjectives, and verb endings, which form the conjugation of verbs. Suffixes can carry grammatical information or lexical information . Inflection changes the grammatical properties of a word within its syntactic category. Derivational suffixes fall into two categories: class-changing derivation and class-maintaining derivation.
Deflexion is a diachronic linguistic process in inflectional languages typified by the degeneration of the inflectional structure of a language. All members of the Indo-European language family are subject to some degree of deflexional change. This phenomenon has been especially strong in Western European languages, such as English, French, and others.
Central Alaskan Yupʼik is one of the languages of the Yupik family, in turn a member of the Eskimo–Aleut language group, spoken in western and southwestern Alaska. Both in ethnic population and in number of speakers, the Central Alaskan Yupik people form the largest group among Alaska Natives. As of 2010 Yupʼik was, after Navajo, the second most spoken aboriginal language in the United States. Yupʼik should not be confused with the related language Central Siberian Yupik spoken in Chukotka and St. Lawrence Island, nor Naukan Yupik likewise spoken in Chukotka.
In linguistics, andative and venitive are a type of verbal deixis: verb forms which indicate 'going' or 'coming' motion, respectively, in reference to a particular location or person. Other terms sometimes seen are itive and ventive, or translocative and cislocative. They generally derive historically from the verbs go and come being reduced to auxiliary verbs or verbal affixes, and may in turn be grammaticalized to aspectual morphemes. Many languages of Siberia, California, West Africa, the Caucasus-Mideast-North Africa, and Oceania have such verb forms.
Odia grammar is the study of the morphological and syntactic structures, word order, case inflections, verb conjugation and other grammatical structures of Odia, an Indo-Aryan language spoken in South Asia.
In linguistic morphology, inflection is a process of word formation in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and definiteness. The inflection of verbs is called conjugation, while the inflection of nouns, adjectives, adverbs, etc. can be called declension.
In linguistics an accidental gap, also known as a gap, paradigm gap, accidental lexical gap, lexical gap, lacuna, or hole in the pattern, is a potential word, word sense, morpheme, or other form that does not exist in some language despite being theoretically permissible by the grammatical rules of that language. For example, a word pronounced is theoretically possible in English, as it would obey English phonological rules, but does not currently exist. Its absence is therefore an accidental gap, in the ontologic sense of the word accidental.
The Integrational theory of language is the general theory of language that has been developed within the general linguistic approach of integrational linguistics.
In historical linguistics, subjectification is a language change process in which a linguistic expression acquires meanings that convey the speaker's attitude or viewpoint. An English example is the word while, which, in Middle English, had only the sense of 'at the same time that'. It later acquired the meaning of 'although', indicating a concession on the part of the speaker.
From Language Sciences Volume 23, March (2001):