Affix grammar

Last updated

An affix grammar is a kind of formal grammar; it is used to describe the syntax of languages, mainly computer languages, using an approach based on how natural language is typically described. [1]

Contents

The grammatical rules of an affix grammar are those of a context-free grammar, except that certain parts in the nonterminals (the affixes) are used as arguments. If the same affix occurs multiple times in a rule, its value must agree, i.e. it must be the same everywhere. In some types of affix grammar, more complex relationships between affix values are possible.

Example

We can describe an extremely simple fragment of English in the following manner:

SentenceSubjectPredicate
SubjectNoun
PredicateVerbObject
ObjectNoun
Noun → John
Noun → Mary
Noun → children
Noun → parents
Verb → like
Verb → likes
Verb → help
Verb → helps

This context-free grammar describes simple sentences such as

John likes children
Mary helps John
children help parents
parents like John

With more nouns and verbs, and more rules to introduce other parts of speech, a large range of English sentences can be described; so this is a promising approach for describing the syntax of English.

However, the given grammar also describes sentences such as

John like children
children helps parents

These sentences are wrong: in English, subject and verb have a grammatical number, which must agree.

An affix grammar can express this directly:

SentenceSubject + number Predicate + number
Subject + numberNoun + number
Predicate + numberVerb + number Object
ObjectNoun + number
Noun + singular John
Noun + singular Mary
Noun + plural children
Noun + plural parents
Verb + singular likes
Verb + plural like
Verb + singular helps
Verb + plural help

This grammar only describes correct English sentences, although it could be argued that

John likes John

is still incorrect and should instead read

John likes himself

This, too, can be incorporated using affixes, if the means of describing the relationships between different affix values are powerful enough. As remarked above, these means depend on the type of affix grammar chosen.

Types

In the simplest type of affix grammar, affixes can only take values from a finite domain, and affix values can only be related through agreement, as in the example. Applied in this way, affixes increase compactness of grammars, but do not add expressive power.

Another approach is to allow affixes to take arbitrary strings as values and allow concatenations of affixes to be used in rules. The ranges of allowable values for affixes can be described with context-free grammar rules. This produces the formalism of two-level grammars, also known as Van Wijngaarden grammars or 2VW grammars. These have been successfully used to describe complicated languages, in particular, the syntax of the Algol 68 programming language. However, it turns out that, even though affix values can only be manipulated with string concatenation, this formalism is Turing complete; hence, even the most basic questions about the language described by an arbitrary 2VW grammar are undecidable in general.

Extended Affix Grammars, developed in the 1980s, are a more restricted version of the same idea. They were mainly applied to describe the grammar of natural language, e.g. English.

Another possibility is to allow the values of affixes to be computed by code written in some programming language. Two basic approaches have been used:

Related Research Articles

In linguistics, morphology is the study of words, how they are formed, and their relationship to other words in the same language. It analyzes the structure of words and parts of words, such as stems, root words, prefixes, and suffixes. Morphology also looks at parts of speech, intonation and stress, and the ways context can change a word's pronunciation and meaning. Morphology differs from morphological typology, which is the classification of languages based on their use of words, and lexicology, which is the study of words and how they make up a language's vocabulary.

In linguistics, syntax is the set of rules, principles, and processes that govern the structure of sentences in a given language, usually including word order. The term syntax is also used to refer to the study of such principles and processes. The goal of many syntacticians is to discover the syntactic rules common to all languages.

English grammar Grammar of the English language

English grammar is the way in which meanings are encoded into wordings in the English language. This includes the structure of words, phrases, clauses, sentences, and whole texts.

Arabic grammar

Arabic grammar or Arabic language sciences is the grammar of the Arabic language. Arabic is a Semitic language and its grammar has many similarities with the grammar of other Semitic languages.

A grammatical category or grammatical feature is a property of items within the grammar of a language. Within each category there are two or more possible values, which are normally mutually exclusive. Frequently encountered grammatical categories include:

In linguistics, especially within generative grammar, phi features are the semantic features of person, number, gender, and case, as encoded in pronominal agreement with nouns and pronouns. Several other features are included in the set of phi-features, such as the categorical features ±N (nominal) and ±V (verbal), which can be used to describe lexical categories and case features.

There are two competing notions of the predicate in grammar. The first is inspired by propositional logic and concerns traditional grammar, the other part being the subject. In the sentence Bill heard Fred, the predicate is heard Fred. On this approach, the purpose of the predicate is to complete an idea about the subject, such as what it does or what it is like. The second notion is derived from work in predicate logic and is prominent in modern theories of syntax and grammar. The predicate of a clause corresponds to a finite verb ; whereas the arguments of that predicate are outside the predicate. On this approach, the predicate in the sentence Bill heard Fred is just the verb heard. The competition between these two notions has generated confusion concerning the use of the term predicate in general. This article considers both of these notions.

Traditional grammar is a framework for the description of the structure of a language. The roots of traditional grammar are in the work of classical Greek and Latin philologists. The formal study of grammar based on these models became popular during the Renaissance.

Agreement or concord happens when a word changes form depending on the other words to which it relates. It is an instance of inflection, and usually involves making the value of some grammatical category "agree" between varied words or parts of the sentence.

Manam is a Kairiru–Manam language spoken mainly on the volcanic Manam Island, northeast of New Guinea.

In linguistics, an argument is an expression that helps complete the meaning of a predicate, the latter referring in this context to a main verb and its auxiliaries. In this regard, the complement is a closely related concept. Most predicates take one, two, or three arguments. A predicate and its arguments form a predicate-argument structure. The discussion of predicates and arguments is associated most with (content) verbs and noun phrases (NPs), although other syntactic categories can also be construed as predicates and as arguments. Arguments must be distinguished from adjuncts. While a predicate needs its arguments to complete its meaning, the adjuncts that appear with a predicate are optional; they are not necessary to complete the meaning of the predicate. Most theories of syntax and semantics acknowledge arguments and adjuncts, although the terminology varies, and the distinction is generally believed to exist in all languages. Dependency grammars sometimes call arguments actants, following Tesnière (1959).

Somali is an agglutinative language, using many affixes and particles to determine and alter the meaning of words.

Cebuano grammar encompasses the rules that define the Cebuano language, the most widely spoken of all the languages in the Visayan Group of languages, spoken in Cebu, Bohol, Siquijor, part of Leyte island, part of Samar island, Negros Oriental, especially in Dumaguete, and the majority of cities and provinces of Mindanao.

Inflection

In linguistic morphology, inflection is a process of word formation, in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and definiteness. The inflection of verbs is called conjugation, and one can refer to the inflection of nouns, adjectives, adverbs, pronouns, determiners, participles, prepositions and postpositions, numerals, articles etc., as declension.

Nominal sentence is a linguistic term that refers to a nonverbal sentence. As a nominal sentence does not have a verbal predicate, it may contain a nominal predicate, an adjectival predicate, in Semitic languages also an adverbial predicate or even a prepositional predicate. In Egyptian-Coptic, however, as in the majority of African languages, sentences with adverbial or prepositional predicate show a distinctly different structure. The relation of nominal sentences to verbal sentences is a question of tense marking. In most languages with nominal sentences such as Russian, Arabic and Hebrew, the copular verb does not surface in indicatival present tense sentences. Conversely, these languages allow the copular verb in non-present sentences.

Iatmül language

Iatmul is the language of the Iatmul people, spoken around the Sepik River in the East Sepik Province, northern Papua New Guinea. The Iatmul, however, do not refer to their language by the term Iatmul, but call it gepmakudi.

Teiwa language

Teiwa is a Papuan language spoken on the Pantar island in eastern Indonesia. The island is the second largest in the Alor archipelago, lying just west of the largest island Alor.

Mekéns (Mekem), or Amniapé, is a nearly extinct Tupian language of the state of Rondônia, in the Amazon region of Brazil.

Daakaka language Austronesian language spoken in Vanuatu

Daakaka[ⁿdaːkaka] is a native language of Ambrym, Vanuatu. It is spoken by about one thousand speakers in the south-western corner of the island.

A bare noun is a noun that is used without a surface determiner or quantifier. In natural languages, the distribution of bare nouns is subject to various language-specific constraints. Under the DP hypothesis a noun in an argument position must have a determiner or quantifier that introduces the noun, warranting special treatment of the bare nouns that seemingly contradict this. As a result, bare nouns have attracted extensive study in the fields of both semantics and syntax.

References

  1. Koster, Cornelis HA. "Affix grammars for natural languages." Attribute Grammars, Applications and Systems. Springer, Berlin, Heidelberg, 1991.