Bound and unbound morphemes

Last updated

In morphology, a bound morpheme is a morpheme (the most basic unit of meaning) that can appear only as part of a larger word; a free morpheme or unbound morpheme is one that can stand alone or can appear with other morphemes in a lexeme. [1] A bound morpheme is also known as a bound form, and similarly a free morpheme is a free form. [2]

In linguistics, morphology is the study of words, how they are formed, and their relationship to other words in the same language. It analyzes the structure of words and parts of words, such as stems, root words, prefixes, and suffixes. Morphology also looks at parts of speech, intonation and stress, and the ways context can change a word's pronunciation and meaning. Morphology differs from morphological typology, which is the classification of languages based on their use of words, and lexicology, which is the study of words and how they make up a language's vocabulary.

A morpheme is the smallest grammatical unit in a language. A morpheme is not identical to a word, and the principal difference between the two is that a morpheme may or may not stand alone, whereas a word, by definition, is freestanding. The linguistics field of study dedicated to morphemes is called morphology. When a morpheme stands by itself, it is considered as a root because it has a meaning of its own and when it depends on another morpheme to express an idea, it is an affix because it has a grammatical function. Every word comprises one or more morphemes.

A lexeme is a unit of lexical meaning that exists regardless of the number of inflectional endings it may have or the number of words it may contain. It is a basic abstract unit of meaning. Put more technically, a lexeme is an abstract unit of morphological analysis in linguistics, that roughly corresponds to a set of forms taken by a single word. For example, in English, run, runs, ran and running are forms of the same lexeme, which can be represented as RUN. A related concept is the lemma, which is a particular form of a lexeme that is chosen by convention to represent a canonical form of a lexeme. Lemmas, being a subset of lexemes, are likewise used in dictionaries as the headwords, and other forms of a lexeme are often listed later in the entry if they are not common conjugations of that word.

Contents

Roots and affixes

Many roots are free morphemes (ship- in "shipment"), and others are bound. Roots normally carry lexical meaning. Words like chairman that contain two free morphemes (chair and man) are referred to as compound words.

A root is a word that does not have a prefix in front of the word or a suffix at the end of the word. The root word is the primary lexical unit of a word, and of a word family, which carries the most significant aspects of semantic content and cannot be reduced into smaller constituents. Content words in nearly all languages contain, and may consist only of root morphemes. However, sometimes the term "root" is also used to describe the word minus its inflectional endings, but with its lexical endings in place. For example, chatters has the inflectional root or lemma chatter, but the lexical root chat. Inflectional roots are often called stems, and a root in the stricter sense may be thought of as a monomorphemic stem.

In linguistics, a compound is a lexeme that consists of more than one stem. Compounding, composition or nominal composition is the process of word formation that creates compound lexemes. That is, in familiar terms, compounding occurs when two or more words or signs are joined to make one longer word or sign. The meaning of the compound may be similar to or different from the meaning of its components in isolation. The component stems of a compound may be of the same part of speech—as in the case of the English word footpath, composed of the two nouns foot and path—or they may belong to different parts of speech, as in the case of the English word blackbird, composed of the adjective black and the noun bird. With very few exceptions, English compound words are stressed on their first component stem.

Affixes are always bound in English, but some languages like Arabic have forms that sometimes affix to words and sometimes stand alone. English language affixes are almost exclusively prefixes or suffixes: pre- in "precaution" and -ment in "shipment". Affixes may be inflectional, indicating how a certain word relates to other words in a larger phrase, or derivational, changing either the part of speech or the actual meaning of a word.

In linguistics, an affix is a morpheme that is attached to a word stem to form a new word or word form. Affixes may be derivational, like English -ness and pre-, or inflectional, like English plural -s and past tense -ed. They are bound morphemes by definition; prefixes and suffixes may be separable affixes. Affixation is the linguistic process that speakers use to form different words by adding morphemes at the beginning (prefixation), the middle (infixation) or the end (suffixation) of words.

Arabic Central Semitic language

Arabical-ʻarabiyyah[alʕaraˈbijːa](listen) or ʻarabī[ˈʕarabiː](listen) or Arabic pronunciation: [ʕaraˈbij]) is a Central Semitic language that first emerged in Iron Age northwestern Arabia and is now the lingua franca of the Arab world. It is named after the Arabs, a term initially used to describe peoples living in the area bounded by Mesopotamia in the east and the Anti-Lebanon mountains in the west, in northwestern Arabia, and in the Sinai Peninsula. Arabic is classified as a macrolanguage comprising 30 modern varieties, including its standard form, Modern Standard Arabic, which is derived from Classical Arabic.

English language West Germanic language

English is a West Germanic language that was first spoken in early medieval England and eventually became a global lingua franca. It is named after the Angles, one of the Germanic tribes that migrated to the area of Great Britain that later took their name, as England. Both names derive from Anglia, a peninsula in the Baltic Sea. The language is closely related to Frisian and Low Saxon, and its vocabulary has been significantly influenced by other Germanic languages, particularly Norse, and to a greater extent by Latin and French.

Cranberry morphemes are a special form of bound morpheme whose independent meaning has been displaced and serves only to distinguish one word from another, like in cranberry, in which the free morpheme berry is preceded by the bound morpheme cran-, meaning "crane" from the earlier name for the berry, "crane berry".

In linguistic morphology, a cranberry morpheme is a type of bound morpheme that cannot be assigned an independent meaning or grammatical function, but nonetheless serves to distinguish one word from another.

Word formation

Words can be formed purely from bound morphemes, as in English permit, ultimately from Latin per "through" + mittō "I send", where per- and -mit are bound morphemes in English. However, they are often thought of as simply a single morpheme.

Latin Indo-European language of the Italic family

Latin is a classical language belonging to the Italic branch of the Indo-European languages. The Latin alphabet is derived from the Etruscan and Greek alphabets, and ultimately from the Phoenician alphabet.

A similar example is given in Chinese; most of its morphemes are monosyllabic and identified with a Chinese character because of the largely morphosyllabic script, but disyllabic words exist that cannot be analyzed into independent morphemes, such as 蝴蝶 húdié 'butterfly'. Then, the individual syllables and corresponding characters are used only in that word, and while they can be interpreted as bound morphemes 蝴 hú- and 蝶 -dié, it is more commonly considered a single disyllabic morpheme. See polysyllabic Chinese morphemes for further discussion.

Chinese language family of languages

Chinese is a group of related, but in many cases not mutually intelligible, language varieties, forming the Sinitic branch of the Sino-Tibetan language family. Chinese is spoken by the Han majority and many minority ethnic groups in China. About 1.2 billion people speak some form of Chinese as their first language.

Linguists usually distinguish between productive and unproductive forms when speaking about morphemes. For example, the morpheme ten- in tenant was originally derived from the Latin word tenere, "to hold", and the same basic meaning is seen in such words as "tenable" and "intention." But as ten- is not used in English to form new words, most linguists would not consider it to be a morpheme at all.

Analytic and synthetic languages

A language with a very low ratio of morphemes to words is an isolating language. Since such a language uses few bound morphemes, it expresses most grammatical relationships by word order or helper words, so it is an analytic language.

In contrast, a language that uses a substantial number of bound morphemes to express grammatical relationships is a synthetic language.

See also

Related Research Articles

A clitic is a morpheme in morphology and syntax that has syntactic characteristics of a word, but depends phonologically on another word or phrase. In this sense, it is syntactically independent but phonologically dependent—always attached to a host. The term derives from the Greek for leaning. A clitic is pronounced like an affix, but plays a syntactic role at the phrase level. In other words, clitics have the form of affixes, but the distribution of function words. For example, the contracted forms of the auxiliary verbs in I'm and we've are clitics.

A lexicon, word-hoard, wordbook, or word-stock is the vocabulary of a person, language, or branch of knowledge. In linguistics, a lexicon is a language's inventory of lexemes. The word "lexicon" derives from the Greek λεξικόν (lexicon), neuter of λεξικός (lexikos) meaning "of or for words."

Morphological derivation, in linguistics, is the process of forming a new word from an existing word, often by adding a prefix or suffix, such as -ness or un-. For example, happiness and unhappy derive from the root word happy.

In linguistic morphology, an uninflected word is a word that has no morphological markers (inflection) such as affixes, ablaut, consonant gradation, etc., indicating declension or conjugation. If a word has an uninflected form, this is usually the form used as the lemma for the word.

Isolating language Language with a very low morpheme per word ratio

An isolating language is a type of language with a very low morpheme per word ratio and no inflectional morphology whatsoever. In the extreme case, each word contains a single morpheme. Currently, the most spoken purely-isolating language is Yoruba.

A synthetic language uses inflection or agglutination to express syntactic relationships within a sentence. Inflection is the addition of morphemes to a root word that assigns grammatical property to that word, while agglutination is the combination of two or more morphemes into one word. The information added by morphemes can include indications of a word's grammatical category, such as whether a word is the subject or object in the sentence. Morphology can be either relational or derivational.

Morphological typology is a way of classifying the languages of the world that groups languages according to their common morphological structures. The field organizes languages on the basis of how those languages form words by combining morphemes. Analytic languages contain very little inflection, instead relying on features like word order and auxiliary words to convey meaning. Synthetic languages, ones that are not analytic, are divided into two categories: agglutinative and fusional languages. Agglutinative languages rely primarily on discrete particles for inflection, while fusional languages "fuse" inflectional categories together, often allowing one word ending to contain several categories, such that the original root can be difficult to extract. A further subcategory of agglutinative languages are polysynthetic languages, which take agglutination to a higher level by constructing entire sentences, including nouns, as one word.

In linguistics, a stem is a part of a word. The term is used with slightly different meanings.

In linguistics, apophony is any sound change within a word that indicates grammatical information.

In historical linguistics and language change, grammaticalization is a process of language change by which words representing objects and actions become grammatical markers. Thus it creates new function words by a process other than deriving them from existing bound, inflectional constructions, instead deriving them from content words. For example, the Old English verb willan 'to want', 'to wish' has become the Modern English auxiliary verb will, which expresses intention or simply futurity. Some concepts are often grammaticalized, while others, such as evidentiality, are not so much.

In linguistics, a suprafix is a type of affix that gives a suprasegmental pattern to either a neutral base or a base with a preexisting suprasegmental pattern. This affix will, then, convey a derivational or inflectional meaning. This suprasegmental pattern acts like segmental phonemes within a morpheme; the suprafix is a combination of suprasegmental phonemes, organized into a pattern, that creates a morpheme. For example, a number of African languages express tense / aspect distinctions by tone. And English has a process of changing stress on verbs to create nouns.

English prefixes are affixes that are added before either simple roots or complex bases consisting of (a) a root and other affixes, (b) multiple roots, or (c) multiple roots and other affixes. Examples of these follow:

Vietnamese, like many languages in Southeast Asia, is an analytic language. Vietnamese lacks morphological marking of case, gender, number, and tense.

Odia morphology is the identification, analysis and description of the structure of morphemes and other units of meaning in the Odia language. Morphemes are the smallest units of the Odia language that carry and convey a unique meaning and is grammatically appropriate. A morpheme in Odia is the most minuscule meaningful constituent which combines and synthesizes the phonemes into a meaningful expression through its (morpheme's) form & structure. Thus, in essence, the morpheme is a structural combination of phonemes in Odia. In other words, in Odia language, the morpheme is a combination of sounds that possess and convey a meaning. A morpheme is not necessarily a meaningful word in Odia. In Odia, every morpheme is either a base or an affix.

Inflection modification of a word to express different grammatical categories such as tense, mood, voice, aspect, person, number, gender and case

In grammar, inflection is the modification of a word to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, and mood. It is found in many but not all languages. The inflection of verbs is also called conjugation, and one can refer to the inflection of nouns, adjectives, adverbs, pronouns, determiners, participles, prepositions, postpositions, numerals, articles etc., as declension.

In linguistic typology, polysynthetic languages are highly synthetic languages, i.e. languages in which words are composed of many morphemes. They are very highly inflected languages. Polysynthetic languages typically have long "sentence-words" such as the Yupik word tuntussuqatarniksaitengqiggtuq which means "He had not yet said again that he was going to hunt reindeer." The word consists of the morphemes tuntu-ssur-qatar-ni-ksaite-ngqiggte-uq with the meanings, reindeer-hunt-future-say-negation-again-third person-singular-indicative; and except for the morpheme tuntu "reindeer", none of the other morphemes can appear in isolation.

References

  1. Kroeger, Paul (2005). Analyzing Grammar: An Introduction. Cambridge: Cambridge University Press. p. 13. ISBN   978-0-521-01653-7.
  2. Elson and Pickett, Beginning Morphology and Syntax, SIL, 1968, ISBN   0-88312-925-6, p6: Morphemes which may occur alone are called free forms; morphemes which never occur alone are called bound forms.