Sesotho grammar

Last updated
Note:

Sesotho

Lesotho Moshoeshoe

This article presents a brief overview of the grammar of the Sesotho and provides links to more detailed articles.

Contents

Typology

The Sesotho language may be described in several ways depending on the aspect being considered.

Formatives

Bantu languages are agglutinative — words are constructed by combining discrete formatives (a.k.a. "morphemes") according to specific rules, and sentences are constructed by stringing together words according to somewhat less strict rules. Formatives alone cannot constitute words; formatives are the component parts of words.

These formatives may be classed generally into roots, stems, prefixes, concords, suffixes, verbal auxiliaries, enclitics, and proclitics.


Roots are the most basic irreducible elements of words and are immutable (except under purely phonetic changes ). Entire words are built from roots by affixing other formatives around the root as appendages; [1] every word (except contractions and compounds) contains exactly one root, from which it derives its most basic meaning (though, technically speaking, the root by itself does not really have any meaning). Roots are the basis of the Sotho parts of speech.

The following words:

  1. [huˌʀutɑ]ho ruta ('to teach')
  2. [bɑliˌʀutʼile]ba le rutile ('they taught you [pl]')
  3. [ʀɪ'ɑʀutʼɑnɑ]re a rutana ('we teach one another')
  4. [hɑbɑliˌʀutʼisise]ha ba le rutisise ('they do not teach you [pl.] properly')
  5. [muˌʀutʼehi]morutehi ('an academic')
  6. [tʰutʼɔ]thuto ('education')
  7. [muˌ'itʰutʼi]moithuti ('learner')

are all formed from the root [ʀutʼ]-rut-.

Although in some cases various phonetic processes may ultimately change the root's form in predictable ways (such as the nasalization in the last two examples above) the root itself is considered to be unchanged.

There can be no doubt that words never emerged simply as roots. The root is a dead thing — the study of roots is primarily to aid the compilation of dictionaries, to further the study of comparative Bantu linguistics, and to help trace the evolution and connections of different languages. Many roots are shared by a wide range of Bantu languages. [2]

Some further examples of roots:

Note that although it is often true that the common root of a number of words may be defined as having some inherent meaning, very often the connection between words sharing common roots is tentative, and this is further evidence that prefix-less noun roots and stems are ultimately meaningless. Roots from a common source help to connect nouns with certain meanings, and often the class prefixes are merely incidental.


Stems are not much different from roots, and the difference between them is fairly arbitrary. Though all roots are also stems, stems often include derivational suffixes, which roots never include. Additionally, the ending [ɑ]-a is included in the verb stem but not in the root (if it was truly part of the core root then it wouldn't be replaced in verb derivations and conjugations).

For example, from the verb root [ʀɑʀ]-rar- one may derive several words, including the following (stems in bold):

[hʊʀɑʀɑ]horara ('to entangle')
[mʊʀɑʀɑ]morara (nom. 3) ('grapes')
[lɪʀɑʀɑ]lerara (nom. 5) ('a single grape')
[hʊʀɑʀɑbʊl̩lɑ]horarabolla ('to solve')
[hʊʀɑʀɑhɑnɑ]horarahana (ass. vb.) ('to be entangled together')
[hʊʀɑʀɑhɑnɛlɑ]horarahanela (app. ass. vb.) ('to spiral')
[hʊʀɑʀɑnɑ]horarana (recip. vb.) ('to entangle each other')
[mɑʀɑʀɑnɛ]mararane (nom. rel.) ('entangled')
[hʊʀɑʀɛlɑ]horarela (app. vb.) ('to twist')
[hʊʀɑʀʊl̩lɑ]horarolla (rev. vb.) ('to untangle')
[tʰɑʀʊl̩lɔ]tharollo (nom. 9; pl. 10 [di]di-) ('solution')

These may all be listed under the same headword in a dictionary.

Note how, in the above example, not only do many of the words have slightly unexpected or expanded meanings, but the form [hʊʀɑʀɑbʊl̩lɑ]ho rarabolla uses an irregular derivation pattern.


Prefixes are affixes attached to the fronts of words (noun class prefixes are called such by convention, even though bare roots are not independent words). These are distinct from concords, since changing the prefix of a word may radically alter its meaning, while changing the concord attached to a stem does not change that stem's meaning.

[kʼɪlɪnɑnɛ'ɔ]Ke lenaneo ('it is a programme')


Concords are similar to prefixes in that they appear before the word stem. Verbs and qualificatives used to describe a noun are brought into agreement with that noun by using the appropriate concords.

There are seven basic types of concords in Sesotho. In addition, there are two immutable prefixes used with verbs that function similarly to concords.

[bɑt͡ɬʼa'ɪʀɑlɑ]Ba tla e rala ('they shall design it')


Suffixes appear at the ends of words. There are numerous suffixes in Sesotho serving varied functions. For example, verbs may be derived from other verbs through the employment of several verbal suffixes. Diminutives, augmentatives, and locatives may all be derived from nouns through the use of several suffixes. Most suffixes, except the noun locative suffix and verb inflexional suffixes, are derivational and create new stems.

Strictly speaking the final vowel -a in verb stems is a suffix, as it is often regularly replaced by other vowels in the derivation and inflexion of verbs and nouns.

[hɑ'ɑ'ɑbu'ɑɲeweŋ̩]Ha a a bua nyeweng ('she did not speak at the court trial')


Verbal auxiliaries are not to be confused with auxiliary verbs or deficient verbs. They may appear as prefixes or as infixes. [4] Basically, all formatives that may be affixed to the verb root, excluding suffixes and the objectival and subjectival concords, are verbal auxiliaries.

These include prefixes such as ha- used to negate verbs, and infixes such as -ka- used to form potential tenses.

The infix -a- used to form the past subjunctive (not to be confused with the infix -a- used to form the present indicative positive and the perfect indicative negative; and also used as a "focus marker") merges with the subjectival concord resulting in what is often termed the "auxiliary concord."

[kʼɪɑt͡ɬʼɑ]Ke a tla ('I am coming')
[hɑkʼɪnot͡ɬʼɑ]Ha ke no tla ('I shall not come')

Infix verbal auxiliaries may be further divided into simple infixes and verbal infixes. The main difference lies in the fact that, when forming the relative construction (participial sub-mood) of a verbal complex employing the infix, the verbal infixes may be detached from the main verb and carry the -ng suffix with the main verb converted to an infinitive object, [5] while a verb using a simple infix has to carry the suffix itself.

Ba ka bona ('they might see') [bɑkʼɑbɔnɑ] (simple infix used) ⇒ Ba ka bonang ('those who might see') [bɑkʼɑbɔnɑŋ̩]
Ba tla bona ('hey shall see') [bɑt͡ɬʼɑbɔnɑ] (verbal infix used) ⇒ Ba tlang ho bona ('those who shall see') [bɑt͡ɬʼɑŋ̩hʊbɔnɑ]


Enclitics (leaning-on words) are usually suffixed to verbs and convey a definite meaning. They were probably once separate words.

They may be divided into two categories: those that draw forward the stress (as normal suffixes), and those that don't alter the word's stress. The second type may result in words that don't have the stress on the penult (as is usual with Sesotho words).

Ha a sa le yo ('he is no longer there') [hɑˈɑsɑlɪjɔ] (stress on the penult)
Thola bo! ('please keep quiet!') [ˈtʰʊlɑbo] (stress on the antepenultimate syllable)


Proclitics are clitics that appear at the fronts of words. There is only one regular proclitic in Sesotho — le- — which is normally prefixed to nouns, pronouns, qualificatives, and adverbs as a conjunction, to convey the same meaning as English "and" when used between substantives. Some Indo-European languages have a post-clitic with a similar meaning (for example Latin -que [6] and Sanskrit-ca).

It may also be used to express the idea of "together with" and "even."

[n̩tʼɑtʼelɪm̩mɛ]Ntate le mme ('my father and mother')
[kʼɪkʼɔpʼɑnɪlɪjɛnɑ]Ke kopane le yena ('I met with her')
[lɪbɔnɑhɑbɑxolʷɪ]Le bona ha ba kgolwe ('Even they do not believe')

There are also a number of curious utterances where the proclitic is used to express emphatic negatives.

[lɪxɑlɛ]Le kgale ('Never', lit. 'And a long time')
[lɪlɪtʰɔ]Le letho ('Nothing', lit. 'And something')
[lɪhʊkʼɑ]Le ho ka ('Never', lit. 'And to be able')

This is similar to the use of the Latin "et" ('and') to mean "even" or "not", as in the supposed last words of Caesar – "Et tu, Brute?" meaning "Not (or even) you Brutus?".

The Sesotho word

The Sotho language is spoken conjunctively yet written disjunctively (that is, the spoken phonological words are not the same as the written orthographical words). [7] In the following discussion, the natural conjunctive word division will be indicated by joining the disjunctive elements with the symbol • in the Sesotho and the English translation.

ex:
[bɑtʰʊbɑlɪlɑpʼɑlɑhɑ'ɛbɑ'ɑmʊ'ɑɬʊlɑ]

Batho

people

ba•lelapa

of•family

la•hae

of•his

ba•a•mo•ahlola

they•judge•him

Batho ba•lelapa la•hae ba•a•mo•ahlola

people of•family of•his they•judge•him

'His family members judge him'


Certain observations about the Sesotho word (and those of many other Bantu languages in general) may be made:

Not counting compounds and contractions, the word begins with zero or more proclitics, infixes, [4] and prefixes, followed by a stem, followed by zero or more suffixes (which extend the stem) and enclitics.

For example, in the word [kʼɪ'ɑliˌdumedisɑ]Ke•a•le•dumedisa ('I•greet•you[pl]') the stem is the verb stem [dumɛlɑ]-dumel(a) ('agree') surrounded by the subjectival concord [kʼɪ]ke- (first person singular), the present definite positive indicative infix marker [ɑ]-a-, the objectival concord [lɪ]-le- (third person plural), and the verb extension [isɑ]-isa (causative, but in this case it gives the idiomatic meaning of "greet").

The phonological interactions can be quite complex:

[ʊ'ɑm̩pʼon̩t͡sʰɑ]O•a•mpontsha ('he•shows•me') subject concord [ʊ]o- + present indicative positive marker [ɑ]-a- + objectival concord -N- + verb stem [bɔn]-bon(a) (see) + causative extension [isɑ]-isa

Here the formatives are distorted by two instances of nasalization.

No matter how many prefixes, suffixes, enclitics, and proclitics are appended to the word stem the complete word only has one main stressed syllable. This stress is most prominent on the final word in the sentence or "prosodic phrase." [8]

ex:
[hɑʀɪ'ɑxɔnɑhʊmʊ'elet͡sʼɑhʊbɑnɪʊneɑlɪmɑŋɑŋɑ]

Ha•re•a•kgona

we•failed

ho•mo•eletsa

to•advise•him

hobane

because

o•ne

he•PAST

a•le

he•COPULATIVE

manganga

stubborn

Ha•re•a•kgona ho•mo•eletsa hobane o•ne a•le manganga

we•failed to•advise•him because he•PAST he•COPULATIVE stubborn

'he was stubborn'

ex:
[ʀɪt͡ɬʼɑjɑhɑʊt͡ʃʰɔ]

Re•tla•ya

ha

o•tjho

Re•tla•ya hao•tjho

we•shall•go if you•say.so

Note the monosyllabic conjunctive [hɑ]ha.


Note that, unlike the Nguni languages, Sesotho does not have rules against juxtaposing strings of vowels:

[hɑ'ɑ'ɑpʼɑʀɑ]Ha•a•a•apara ('he•is•not•dressed') although the sequence [ɑ'ɑ]-a•a- (class 1 negative subjectival concord followed by present definite positive indicative marker) is usually pronounced as a long [ɑ] with a high falling tone, or simply as a short high tone.

Certain situations may make the word division complex. This can happen with contractions (especially with deficient verb constructions), and in some complex verb conjugations. In all these situations, however, each proper word has exactly one main stressed syllable.

Parts of speech

Each complete Sesotho word belongs to some part of speech.

In form, some parts of speech (adjectives, enumeratives, some relatives, and all verbs) are radical stems, which need affixes to form meaningful words; others (possessives and copulatives) are formed from full words by the employment of certain formatives; the rest (nouns, pronouns, adverbs, ideophones, conjunctives, and interjectives) are complete words themselves, which may or may not be modified with affixes to form new words.

The difference between the four types of qualificatives is merely in the concords used to associate them with the noun or pronoun they qualify. Since the simplest copulatives do not use any verbs whatsoever (zero copula), entire predicative sentences in Sesotho may be formed without the use of verbs.

Notes

Sotho words translation in Isizulu

  1. Bantuists do it with multiple appendages.
  2. Including the root *-ntu whence the name "Bantu languages" comes. Current work on Proto-Bantu has it that no true roots began with prenasalized consonants, and that the form of this root was actually *-jîntu, as in *mu-jîntu and *ba-jîntu.
  3. Although there has historically always been a general belief among Westerners that African religions are polytheist, the plural of this word — [miˌdimʊ]medimo — was specifically invented by Christian missionaries to aid in translating the Bible (which regularly speaks of "gods" — a concept foreign to Sesotho ATR). Additionally, the noun is traditionally in class 1, but is used in class 3 by Christians and the Bible. There is, and has never been, any confusion among Basotho that the class 2 [bɑdimʊ]Badimo may be the plural of the class 1 [muˌdimʊ]Modimo since, in the same way that [muˌdimʊ]Modimo was never used in the plural, [bɑdimʊ]Badimo is never used in the singular (an ancestor is referred to as "one of the ancestors").
  4. 1 2 The use of this term in Bantu linguistics means "formatives placed in the middle of a word" and not the more common "formatives placed in the middle of a morpheme." Bantu languages, being agglutinative, construct words by placing affixes around a stem, and if an affix is always placed after other affixes but before the stem (such as in the verbal complex) then it is usually called an "infix."
  5. This is exactly the same as the behaviour of deficient verbs, and it is very likely that these infixes are grammaticalized contractions using originally Group VI deficient verbs. Additionally, in the negative (and sometimes in the positive) these infixes change to a form ending in the vowel /o/, which obviously comes from some coalescence with the vowel /ʊ/ (in the infinitive prefix ho-) and the vowel of the original deficient verb (/ɛ/ or /ɑ/ in the positive, and /ɪ/ in the negative). A possible (pre-contraction and grammaticalization) example would be:
    (pre-)Proto-Sotho–Tswana *kɪt͡ɬɑxʊdʒɑ ('I come to/shall eat'), *xɑkɪt͡ɬɪxʊdʒɑ ('I do not come to/shall not eat'),
    which in modern Sesotho appear as
    [kʼɪt͡ɬʼɑʒɑ]Ke tla ja, and [hɑkʼɪt͡ɬʼoʒɑ]Ha ke tlo ja
  6. Senatus Populusque Romanus.
  7. This is a common situation in many (written) Bantu languages, as their orthographies were invented by Europeans who spoke isolating languages. Notice how the class 10 prefix ho- is written separated from the verb stem (contrary to how the other class prefixes are indicated) because this is how infinitives are indicated in their languages. IsiZulu and other Nguni languages are written conjunctively, primarily due to the efforts of Doke and others. Consider the following example:
    [kʼɪt͡ɬʼɑ'uˌtʰusɑ]Ke tla o thusa
    I•FUT.+VE.INDIC•you•help
    'I will help you'
    This would be Ngizakusiza in isiZulu. The English free morphemes may usually be moved around to make valid statements, with some change in meaning:
    Help you I will
    Will I help you(?)
    But this is absolutely impossible to do with the Sesotho bound morphemes.
    *Thusa o ke tla
    *Tla ke o thusa
    When compared with other word division schemes, the orthographies used to write the non-Nguni South African languages are extremely disjunctive, since many Bantu language orthographies at least write the verbal complex (such as the example above) as a single orthographical word, but may write prefixes, concords, and clitics as separate words.
  8. Some researchers completely reject the notion that those Southern Bantu languages claimed to have word stress really do, and instead view it as phrasal stress (that is, the penultimate syllable in the prosodic phrase — not the word — is stressed). Although it is true that in normal speech it is usually the penultimate syllable of the prosodic phrase that is stressed, the existence of words with irregular stress patterns suggests that, in Sesotho at least, it is not entirely incorrect to say that stress is a lexical property of the word itself, not just the phrase, and that the word's inherent stress pattern is most prominent when the word is phrase-final.

Related Research Articles

An infix is an affix inserted inside a word stem. It contrasts with adfix, a rare term for an affix attached to the outside of a stem, such as a prefix or suffix.

SothoSesotho, also known as Southern Sotho or Sesotho sa Borwa is a Southern Bantu language of the Sotho–Tswana ("S.30") group, spoken in Lesotho, and South Africa where it is an official language;

<span class="mw-page-title-main">Madí language</span> Arawan language spoken in Brazil

Madí—also known as Jamamadí after one of its dialects, and also Kapaná or Kanamanti (Canamanti)—is an Arawan language spoken by about 1,000 Jamamadi, Banawá, and Jarawara people scattered over Amazonas, Brazil.

<span class="mw-page-title-main">Quechan language</span> Yuman language spoken in California and Arizona

Quechan or Kwtsaan, also known as Yuma, is the native language of the Quechan people of southeastern California and southwestern Arizona in the Lower Colorado River Valley and Sonoran Desert. Despite its name, it is not related to the Quechua language of the Andes.

<span class="mw-page-title-main">Aslian languages</span> Subgroup of the Austroasiatic language family

The Aslian languages are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the Orang Asli, the aboriginal inhabitants of the peninsula. The total number of native speakers of Aslian languages is about fifty thousand and all are in danger of extinction. Aslian languages recognized by the Malaysian administration include Kensiu, Kintaq, Jahai, Minriq, Batek, Cheq Wong, Lanoh, Temiar, Semai, Jah Hut, Mah Meri, Semaq Beri, Semelai and Temoq.

Tsimshian, known by its speakers as Sm'álgyax, is a dialect of the Tsimshian language spoken in northwestern British Columbia and southeastern Alaska. Sm'algyax means literally "real or true language."

Roviana is a member of the North West Solomonic branch of Oceanic languages. It is spoken around Roviana and Vonavona lagoons at the north central New Georgia in the Solomon Islands. It has 10,000 first-language speakers and an additional 16,000 people mostly over 30 years old speak it as a second language. In the past, Roviana was widely used as a trade language and further used as a lingua franca, especially for church purposes in the Western Province, but now it is being replaced by the Solomon Islands Pijin. Published studies on Roviana include: Ray (1926), Waterhouse (1949) and Todd (1978) contain the syntax of Roviana. Corston-Oliver discuss ergativity in Roviana. Todd (2000) and Ross (1988) discuss the clause structure in Roviana. Schuelke (2020) discusses grammatical relations and syntactic ergativity in Roviana.

Sesotho nouns signify concrete or abstract concepts in the language, but are distinct from the Sesotho pronouns.

Sesotho verbs are words in the language that signify the action or state of a substantive, and are brought into agreement with it using the subjectival concord. This definition excludes imperatives and infinitives, which are respectively interjectives and class 14 nouns.

The phonology of Sesotho and those of the other Sotho–Tswana languages are radically different from those of "older" or more "stereotypical" Bantu languages. Modern Sesotho in particular has very mixed origins inheriting many words and idioms from non-Sotho–Tswana languages.

Just as the Sesotho sentence centres on the Sesotho noun, the noun is made to "concòrd" ("agree") with the verbs, pronouns, and qualificatives describing it by a set of Sesotho noun concords.

Like most other Niger–Congo languages, Sesotho is a tonal language, spoken with two basic tones, high (H) and low (L). The Sesotho grammatical tone system is rather complex and uses a large number of "sandhi" rules.

The Sesotho parts of speech convey the most basic meanings and functions of the words in the language, which may be modified in largely predictable ways by affixes and other regular morphological devices. Each complete word in the Sesotho language must comprise some "part of speech."

In the Sotho language, the deficient verbs are a special subset of Sesotho verbs that require a subordinate or complementary verb to complete their action, and which are used to form many tenses and to impart certain shades of meaning to the predicate. These verbs form part of multi-verbal conjugations comprising a string of verbs and verbal auxiliaries.

The orthography of the Sotho language is fairly recent and is based on the Latin script, but, like most languages written using the Latin alphabet, it does not use all the letters; as well, several digraphs and trigraphs are used to represent single sounds.

<span class="mw-page-title-main">Nukak language</span> Endangered indigenous language of Colombia

The Nukak language is a language of uncertain classification, perhaps part of the macrofamily Puinave-Maku. It is very closely related to Kakwa.

Zulu grammar is the way in which meanings are encoded into wordings in the Zulu language. Zulu grammar is typical for Bantu languages, bearing all the hallmarks of this language family. These include agglutinativity, a rich array of noun classes, extensive inflection for person, tense and aspect, and a subject–verb–object word order.

Vamale (Pamale) is a Kanak language of northern New Caledonia. The Hmwaeke dialect, spoken in Tiéta, is fusing with Haveke and nearly extinct. Vamale is nowadays spoken in Tiendanite, We Hava, Téganpaïk and Tiouandé. It was spoken in the Pamale valley and its tributaries Vawe and Usa until the colonial war of 1917, when its speakers were displaced.

The grammar of the constructed Na'vi language was created for the movie Avatar by Paul Frommer. It is a tripartite, primarily affixing agglutinative language.

This article describes the grammar of the Old Irish language. The grammar of the language has been described with exhaustive detail by various authors, including Thurneysen, Binchy and Bergin, McCone, O'Connell, Stifter, among many others.

References