Khmer grammar

Last updated

This article describes the grammar of the Khmer (Cambodian) language, focusing on the standard dialect.

Contents

Word formation

Khmer is primarily an analytic language, with no inflection. There is some derivation by means of prefixes and infixes, but it is no longer always productive, [1] as those elements are often crystallized in words inherited from Old Khmer. Even in Old Khmer, the same affix could have multiple functions (for example, it could serve as a nominalizer in one word and as a causativizer in another).

A common infix has the form [-ɑm(n)-] or [-ɑn-] (or with other vowels), inserted after an initial consonant, especially to convert adjectives or verbs into nouns.

Compounding is common; in a compound of two nouns, the head generally comes first, often the reverse of the English order: "duck egg" is ពងទា[pɔːŋtiə], literally "egg-duck".

Word order

Since Khmer is an analytic language, word order is relatively fixed, as changes in word order often affect meaning. Khmer is generally a subject–verb–object (SVO) language. Topicalization is common: the topic of the sentence is often placed at the start, with the rest of the sentence a comment on that topic.

Like in English, prepositions are used rather than postpositions (words meaning "in", "on", etc. precede the noun that they govern). [2] The language is generally head-initial so modifiers come after the words modified (adjectives, possessives, demonstratives, relative clauses, etc. follow nouns; adverbs mostly follow verbs; and so on).

Nouns

Khmer nouns have no grammatical gender or grammatical number inflections. There are no articles, but indefiniteness is often expressed by the word for "one" following the noun. Plurality can be marked by postnominal particles, numerals, or reduplication of a following adjective, which, although it is similar to intensification, is usually differentiated by context: [3]

Possessives are formed by placing the noun or pronoun representing the possessor after the main noun, often with the word របស់[rəbɑh] between them. (The word របស់ is also a noun, meaning "thing".)

Numbers and classifiers

In Khmer, a number that indicates quantity follows the noun.

សៀវភៅ

siəw pʰɨw

book

បួន

ɓuən

four

សៀវភៅ បួន

{siəw pʰɨw} ɓuən

book four

four books

Exceptions include nouns indicating passage of time such as hours or days, units of measurements and currencies, all of which function as noun classifiers without the explicit mention of what is being classified. Reversal of the order can change the meaning:

Classifying particles for use with numerals and nouns exist but are optional, unlike Thai, except in introductory clauses. They are used for clarity or formality, and number precedes classifiers.

The following example illustrates the superfluous mention of what is being classified:

Counting in Khmer is based on a biquinary system (6 to 9 have the form "five one", "five two", etc.) However, the words for multiples of ten from 30 to 90 are not related to the basic Khmer numbers but are probably borrowed from Thai. The Khmer script has its own versions of the Arabic numerals.

The principal number words are listed in the following table, which gives Western and Khmer digits, Khmer spelling and IPA transcription: [1]

0សូន្យ[soun]
1មួយ[muəj]
2ពីរ[piː]20២០ម្ភៃ[mphej]
3បី[ɓəːj]30៣០សាមសិប[saːmsəp]
4បួន[ɓuən]40៤០សែសិប[saesəp]
5ប្រាំ[pram]50៥០ហាសិប[haːsəp]
6ប្រាំមូយ[prammuəj]60៦០ហុកសិប[hoksəp]
7ប្រាំពីរ[prampiː], [prampɨl]70៧០ចិតសិប[cətsəp]
8ប្រាំបី[pramɓəːj]80៨០ប៉ែតសិប[paetsəp]
9ប្រាំបួន[pramɓuən]90៩០កៅសិប[kaʋsəp]
10១០ដប់[ɗɑp]100១០០មួយរយ[muəjrɔːj]

Intermediate numbers are formed by compounding the above elements. Powers of ten are denoted by រយ[rɔːj] (100), ពាន់[pŏən] (1,000), ម៉ឺន[məɨn] (10,000), សែន[saen] (100,000), លាន[liən] (1,000,000) and កោដិ[kaot] (10,000,000). For more information, see Khmer numerals.

Ordinal numbers are formed by placing the particle ទី[tiː] before the corresponding cardinal number.

Demonstratives

In Khmer, demonstratives follow the nouns that they modify. In standard literary Khmer, there are two degrees of distance, but there are other distinctions in colloquial language. Demonstratives can also function as articles only after introductory clauses.

DemonstrativeAlternative FormsMeaning
នេះ (nih)ហ្នឹង (nəŋ)This
នោះ (nuh)ហ្នុង (noŋ)That

Pronouns

The pronominal system is complex and is full of honorific variations. There is generally no single pronoun appropriate for all situations, with the choice of pronoun based on age, gender, and relationship. Nouns referring to a specific person, like brother or uncle, can be used as pronouns even when one directly addresses the person.

For some examples of typical pronoun forms, see Khmer language – Social registers. Below is a table of the most common kinship terms that are also used as personal pronouns.

Kinship term Reciprocal TranslationNon-kinship usageNote
ឪពុក
/ʔəwpuk/
កូន
/koun/
father; childnoneformally refers to one's own father, other terms for "father/dad" include: /paː/, /ʔəw/, /bəidaː/, /puk/
ម្តាយ
/mdaːj/
កូន
/koun/
mother; childMany other terms are used commonly: /mae/, /maʔ/
បង
/bɑːŋ/
ប្អូន, អូន
/pʔoun/, /ʔoun/
older siblingan older man or woman of the same generation; the man in a romantic relationshipចែ /cae/ is a Chinese-derived term used to refer to a woman around the same age in Cambodia and is more often said among Chinese Cambodians and sometimes in urban settings
ប្អូន, អូនបងyounger siblinga younger person of the same generation; a child; the woman in a romantic relationshipOther colloquial terms are used to refer to younger generations using /ʔaː/ as a diminutive: /ʔaːnuh/, /miːnuh/, /niəŋ/
កូន
/koun/
ប៉ា, មែ
/paː/, /mae/
biological child or grandchilda young child; a person at least one generation younger
ចៅ
/caw/
តា, យាយ
/taː/, /jiəj/
grandchild, cousin of junior generationsa young child; a person at least one generation younger
តា
/taː/
ចៅ or កូន
/caw/ or /koun/
grandfatherany man of grandparents' generationother terms can be used depending on family structure and dialect
យាយ
/jiəj/
ចៅ or កូនgrandmothera middle-aged (married) womanother terms may be used depending on family structure and dialect
មីង
/miːŋ/
ក្មួយ
/kmuəj/
a parent's younger sister/sister-in-lawany woman of parents' generation, but younger than parents, a young (usually unmarried) woman (formal)អ៊ី /ʔiː/ is the equivalent Chinese term and, similar to /cae/, is sometimes used in urban areas and among Chinese Cambodians
ពូ
/puː/
ក្មួយ
/kmuəj/
a parent's younger brother/brother-in-lawany man of parents' generation, but younger than parents; also a man slightly older than the speaker (formal)an equivalent, but more colloquial, term is /miə/
អ៊ុំ
/ʔom/
ក្មួយ
/kmuəj/
a parent's older sibling; his/her spouseany person of parents' generation but older than parents
អ្នក
/nĕəʔ/
ប្អូន
/pʔoun/
"you" (when addressing an older brother- or sister-in-law)third person equivalent is បងថ្លៃ /bɑːŋtʰlai/

Adjectives

Adjectives in Khmer follow the noun; doubling the adjective can indicate plurality or intensify the meaning (see Nouns above). Adjectives follow verbs when they function as adverbs. Adjectives are actually stative verbs and are used without a copula and can be negated, like other verbs.

Comparatives are expressed using the word ជាង /ciəŋ/: "A X /ciəŋ/ [B]" (A is more X [than B]). The most common way to express superlatives is with ជាងគេ /ciəŋ keː/: "A X /ciəŋ keː/" (A is X-est of all). [3]

Reduplication

In Khmer, nouns and adjectives can be reduplicated. Reduplication can occur as perfect reduplicates or by altering the rhyme of words. Khmer also uses compound reduplication in which two phonologically unrelated words with similar or identical meanings are compounded.

The Khmer script includes the symbol ៗ, which indicates that the preceding word or phrase is to be repeated.

Verbs

As is typical of most East Asian languages, [4] Khmer verbs do not inflect at all; tense and aspect can be expressed using particles (such as កំពុង [kəmpuŋ], placed before a verb to express continuous aspect) and adverbs (such as "yesterday", "earlier", "tomorrow"), or may be understood from context. Serial verb construction is quite common.

Questions

Yes–no questions can be formed by placing the particle ទេ /teː/ at the end of a sentence. This particle can also serve as an emphatic particle (it is also used in negative sentences, as shown below), and so intonation may be required to indicate that a question is being asked.

In wh-questions, the question word generally remains in its usual grammatical position in the sentence, rather than being brought to the start as in English (that is, wh-fronting does not normally take place).

Negation

Verbs can be negated in three primary fashions, all of which convey a slightly different connotation or formality. The most common method of negation is a discontinuous construction placing មិន "[mɨn]" (not, not to be) before the verb and ending the verb phrase with ទេ "[teː]", which, as a stand-alone word can be either "no" or a particle contradicting a previous statement. Colloquially, the final "ទេ" may be omitted.

ខ្ញុំ

kʰɲom

ជឿ

cɨə

ខ្ញុំ ជឿ

kʰɲom cɨə

I believe

ខ្ញុំ

kʰɲom

មិន

mɨn

ជឿ

cɨə

ទេ

teː

ខ្ញុំ មិន ជឿ ទេ

kʰɲom mɨn cɨə teː

I don't believe

Another fairly common way of indicating negation uses អត់ (also spelled ឥត) "[ɑt]" instead of មិន. "អត់", as an independent word, means "without" or "lacking" and expresses a similar connotation when used to negate a verb.

ខ្ញុំ

kʰɲom

អត់

ɑt

ឃ្លាន

khliə̯n

ទេ

teː

ខ្ញុំ អត់ ឃ្លាន ទេ

kʰɲom ɑt khliə̯n teː

"I am not hungry." (lit. "I (am) without hunger".)

A third method is basically identical to the first method except ពុំ "[pum]" is used instead of "[mɨn]". This is used only in literary or very formal contexts.

Related Research Articles

<span class="mw-page-title-main">Khmer language</span> Austroasiatic language of Cambodia

Khmer is an Austroasiatic language spoken by the Khmer people, and the official and national language of Cambodia. Khmer has been influenced considerably by Sanskrit and Pali, especially in the royal and religious registers, through Hinduism and Buddhism. It is also the earliest recorded and earliest written language of the Mon–Khmer family, predating Mon and Vietnamese, due to Old Khmer being the language of the historical empires of Chenla, Angkor and, presumably, their earlier predecessor state, Funan.

Esperanto is the most widely used constructed language intended for international communication; it was designed with highly regular grammatical rules, and as such is considered an easy language to learn.

In grammar, a part of speech or part-of-speech is a category of words that have similar grammatical properties. Words that are assigned to the same part of speech generally display similar syntactic behavior, sometimes similar morphological behavior in that they undergo inflection for similar properties and even similar semantic behavior. Commonly listed English parts of speech are noun, verb, adjective, adverb, pronoun, preposition, conjunction, interjection, numeral, article, and determiner.

English grammar is the set of structural rules of the English language. This includes the structure of words, phrases, clauses, sentences, and whole texts.

<span class="mw-page-title-main">Chinese grammar</span> Grammar of the Standard Chinese language

The grammar of Standard Chinese shares many features with other varieties of Chinese. The language almost entirely lacks inflection; words typically have only one grammatical form. Categories such as number and verb tense are often not expressed by grammatical means, but there are several particles that serve to express verbal aspect and, to some extent, mood.

<span class="mw-page-title-main">Gilbertese language</span> Austronesian language spoken in Kiribati

Gilbertese or taetae ni Kiribati, also Kiribati, is an Austronesian language spoken mainly in Kiribati. It belongs to the Micronesian branch of the Oceanic languages.

The grammar of the Polish language is complex and characterized by a high degree of inflection, and has relatively free word order, although the dominant arrangement is subject–verb–object (SVO). There commonly are no articles, and there is frequent dropping of subject pronouns. Distinctive features include the different treatment of masculine personal nouns in the plural, and the complex grammar of numerals and quantifiers.

<span class="mw-page-title-main">Majhi language</span> Indo-Aryan language of Nepal

Majhi is an Indo-Aryan language spoken in parts of Nepal and formerly in some small pockets of neighboring India.:1 The language is associated with the Majhi people, an ethnic group in those regions who dwell historically near the Saptakoshi River and its tributaries and elsewhere in central and eastern Nepal. The Majhi people generally subsist off of work associated with rivers, including fishing and ferrying.:2 Majhi is written using the Devanagari writing system.

<span class="mw-page-title-main">Hindustani grammar</span> Grammatical features of the Hindustani lingua franca

Hindustani, the lingua franca of Northern India and Pakistan, has two standardised registers: Hindi and Urdu. Grammatical differences between the two standards are minor but each uses its own script: Hindi uses Devanagari while Urdu uses an extended form of the Perso-Arabic script, typically in the Nastaʿlīq style.

Ilocano grammar is the study of the morphological and syntactic structures of the Ilocano language, a language spoken in the northern Philippines by ethnic Ilocanos and Ilocano communities in the US, Saudi Arabia and other countries around the globe.

Pohnpeian is a Micronesian language spoken as the indigenous language of the island of Pohnpei in the Caroline Islands. Pohnpeian has approximately 30,000 (estimated) native speakers living in Pohnpei and its outlying atolls and islands with another 10,000-15,000 (estimated) living off island in parts of the US mainland, Hawaii and Guam. It is the second-most widely spoken native language of the Federated States of Micronesia the first being Chuukese.

This article is a description of the morphology, syntax, and semantics of Korean. For phonetics and phonology, see Korean phonology. See also Korean honorifics, which play a large role in the grammar.

Araki is a nearly extinct language spoken in the small island of Araki, south of Espiritu Santo Island in Vanuatu. Araki is gradually being replaced by Tangoa, a language from a neighbouring island.

Breton is a Brittonic Celtic language in the Indo-European family, and its grammar has many traits in common with these languages. Like most Indo-European languages it has grammatical gender, grammatical number, articles and inflections and, like the other Celtic languages, Breton has mutations. In addition to the singular–plural system, it also has a singulative–collective system, similar to Welsh. Unlike the other Brittonic languages, Breton has both a definite and indefinite article, whereas Welsh and Cornish lack an indefinite article and unlike the other extant Celtic languages, Breton has been influenced by French.

Vietnamese is an analytic language, meaning it conveys grammatical information primarily through combinations of words as opposed to suffixes. The basic word order is subject-verb-object (SVO), but utterances may be restructured so as to be topic-prominent. Vietnamese also has verb serialization. In Vietnamese the head of the phrase usually precedes its complements, nouns are classified according to series of lexical parameters, and pronouns may be absent from utterances. Question words in the language do not exhibit wh-movement.

<span class="mw-page-title-main">Inflection</span> Process of word formation

In linguistic morphology, inflection is a process of word formation in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and definiteness. The inflection of verbs is called conjugation, and one can refer to the inflection of nouns, adjectives, adverbs, pronouns, determiners, participles, prepositions and postpositions, numerals, articles, etc, as declension.

Tlahuitoltepec Mixe, called South Highland Mixe in Wichmann (1995), is a Mixe language spoken in Mexico.

Malay grammar is the body of rules that describe the structure of expressions in the Malay language and Indonesian. This includes the structure of words, phrases, clauses and sentences. In Malay and Indonesian, there are four basic parts of speech: nouns, verbs, adjectives, and grammatical function words (particles). Nouns and verbs may be basic roots, but frequently they are derived from other words by means of prefixes and suffixes.

<span class="mw-page-title-main">Baluan-Pam language</span> Oceanic language of Manus Province, Papua New Guinea

Baluan-Pam is an Oceanic language of Manus Province, Papua New Guinea. It is spoken on Baluan Island and on nearby Pam Island. The number of speakers, according to the latest estimate based on the 2000 Census, is 2,000. Speakers on Baluan Island prefer to refer to their language with its native name Paluai.

Levantine Arabic grammar is the set of rules by which Levantine Arabic creates statements, questions and commands. In many respects, it is quite similar to that of the other vernacular Arabic varieties.

References

  1. 1 2 David Smyth (1995). Colloquial Cambodian: A Complete Language Course. Routledge (UK). ISBN   0-415-10006-2.
  2. Huffman, Franklin. 1967. An outline of Cambodian Grammar. PhD thesis, Cornell University.
  3. 1 2 Franklin E. Huffman, Modern Spoken Cambodian, Yale University, 1970.
  4. East and Southeast Asian Languages: A First Look Archived 2012-11-20 at the Wayback Machine at Oxford University Press Online