In linguistics, anaphora ( /əˈnæfərə/ ) is the use of an expression whose interpretation depends upon another expression in context (its antecedent). In a narrower sense, anaphora is the use of an expression that depends specifically upon an antecedent expression and thus is contrasted with cataphora, which is the use of an expression that depends upon a postcedent expression. The anaphoric (referring) term is called an anaphor. For example, in the sentence Sally arrived, but nobody saw her, the pronoun her is an anaphor, referring back to the antecedent Sally. In the sentence Before her arrival, nobody saw Sally, the pronoun her refers forward to the postcedent Sally, so her is now a cataphor (and an anaphor in the broader, but not the narrower, sense). Usually, an anaphoric expression is a pro-form or some other kind of deictic (contextually dependent) expression. [1] Both anaphora and cataphora are species of endophora, referring to something mentioned elsewhere in a dialog or text.
Anaphora is an important concept for different reasons and on different levels: first, anaphora indicates how discourse is constructed and maintained; second, anaphora binds different syntactical elements together at the level of the sentence; third, anaphora presents a challenge to natural language processing in computational linguistics, since the identification of the reference can be difficult; and fourth, anaphora partially reveals how language is understood and processed, which is relevant to fields of linguistics interested in cognitive psychology. [2]
The term anaphora is actually used in two ways.
In a broad sense, it denotes the act of referring. Any time a given expression (e.g. a pro-form) refers to another contextual entity, anaphora is present.
In a second, narrower sense, the term anaphora denotes the act of referring backwards in a dialog or text, such as referring to the left when an anaphor points to its left toward its antecedent in languages that are written from left to right. Etymologically, anaphora derives from Ancient Greek ἀναφορά (anaphorá, "a carrying back"), from ἀνά (aná, "up") + φέρω (phérō, "I carry"). In this narrow sense, anaphora stands in contrast to cataphora, which sees the act of referring forward in a dialog or text, or pointing to the right in languages that are written from left to right: Ancient Greek καταφορά (kataphorá, "a downward motion"), from κατά (katá, "downwards") + φέρω (phérō, "I carry"). A pro-form is a cataphor when it points to its right toward its postcedent. Both effects together are called either anaphora (broad sense) or less ambiguously, along with self-reference they comprise the category of endophora. [3]
Examples of anaphora (in the narrow sense) and cataphora are given next. Anaphors and cataphors appear in bold, and their antecedents and postcedents are underlined:
A further distinction is drawn between endophoric and exophoric reference. Exophoric reference occurs when an expression, an exophor, refers to something that is not directly present in the linguistic context, but is rather present in the situational context. Deictic pro-forms are stereotypical exophors, e.g.
Exophors cannot be anaphors as they do not substantially refer within the dialog or text, though there is a question of what portions of a conversation or document are accessed by a listener or reader with regard to whether all references to which a term points within that language stream are noticed (i.e., if you hear only a fragment of what someone says using the pronoun her, you might never discover who she is, though if you heard the rest of what the speaker was saying on the same occasion, you might discover who she is, either by anaphoric revelation or by exophoric implication because you realize who she must be according to what else is said about her even if her identity is not explicitly mentioned, as in the case of homophoric reference).
A listener might, for example, realize through listening to other clauses and sentences that she is a Queen because of some of her attributes or actions mentioned. But which queen? Homophoric reference occurs when a generic phrase obtains a specific meaning through knowledge of its context. For example, the referent of the phrase the Queen (using an emphatic definite article, not the less specific a Queen, but also not the more specific Queen Elizabeth) must be determined by the context of the utterance, which would identify the identity of the queen in question. Until further revealed by additional contextual words, gestures, images or other media, a listener would not even know what monarchy or historical period is being discussed, and even after hearing her name is Elizabeth does not know, even if an English-UK Queen Elizabeth becomes indicated, if this queen means Queen Elizabeth I or Queen Elizabeth II and must await further clues in additional communications. Similarly, in discussing 'The Mayor' (of a city), the Mayor's identity must be understood broadly through the context which the speech references as general 'object' of understanding; is a particular human person meant, a current or future or past office-holder, the office in a strict legal sense, or the office in a general sense which includes activities a mayor might conduct, might even be expected to conduct, while they may not be explicitly defined for this office.
The term anaphor is used in a special way in the generative grammar tradition. Here it denotes what would normally be called a reflexive or reciprocal pronoun, such as himself or each other in English, and analogous forms in other languages. The use of the term anaphor in this narrow sense is unique to generative grammar, and in particular, to the traditional binding theory. [4] This theory investigates the syntactic relationship that can or must hold between a given pro-form and its antecedent (or postcedent). In this respect, anaphors (reflexive and reciprocal pronouns) behave very differently from, for instance, personal pronouns. [5]
In some cases, anaphora may refer not to its usual antecedent, but to its complement set. In the following example a, the anaphoric pronoun they refers to the children who are eating the ice-cream. Contrastingly, example b has they seeming to refer to the children who are not eating ice-cream:
In its narrower definition, an anaphoric pronoun must refer to some noun (phrase) that has already been introduced into the discourse. In complement anaphora cases, however, the anaphor refers to something that is not yet present in the discourse, since the pronoun's referent has not been formerly introduced, including the case of 'everything but' what has been introduced. The set of ice-cream-eating-children in example b is introduced into the discourse, but then the pronoun they refers to the set of non-ice-cream-eating-children, a set which has not been explicitly mentioned. [7]
Both semantic and pragmatics considerations attend this phenomenon, which following discourse representation theory since the early 1980s, such as work by Kamp (1981) and Heim (File Change Semantics, 1982), and generalized quantifier theory, such as work by Barwise and Cooper (1981), was studied in a series of psycholinguistic experiments in the early 1990s by Moxey and Sanford (1993) and Sanford et al. (1994). [6] [8] In complement anaphora as in the case of the pronoun in example b, this anaphora refers to some sort of complement set (i.e. only to the set of non-ice-cream-eating-children) or to the maximal set (i.e. to all the children, both ice-cream-eating-children and non-ice-cream-eating-children) or some hybrid or variant set, including potentially one of those noted to the right of example b. The various possible referents in complement anaphora are discussed by Corblin (1996), Kibble (1997), and Nouwen (2003). [7] Resolving complement anaphora is of interest in shedding light on brain access to information, calculation, mental modeling, communication. [9] [10]
There are many theories that attempt to prove how anaphors are related and trace back to their antecedents, with centering theory (Grosz, Joshi, and Weinstein 1983) being one of them. Taking the computational theory of mind view of language, centering theory gives a computational analysis of underlying antecedents. In their original theory, Grosz, Joshi, & Weinstein (1983) propose that some discourse entities in utterances are more "central" than others, and this degree of centrality imposes constraints on what can be the antecedent.
In the theory, there are different types of centers: forward facing, backwards facing, and preferred.
A ranked list of discourse entities in an utterance. The ranking is debated, some focusing on theta relations (Yıldırım et al. 2004) and some providing definitive lists.[ example needed ]
The highest ranked discourse entity in the previous utterance.[ example needed ]
The highest ranked discourse entity in the previous utterance realised in the current utterance.[ example needed ]
In linguistics and grammar, a pronoun is a word or a group of words that one may substitute for a noun or noun phrase.
In linguistics, deixis is the use of words or phrases to refer to a particular time, place, or person relative to the context of the utterance. Deixis exists in all known natural languages and is closely related to anaphora, with a sometimes unclear distinction between the two. In linguistic anthropology, deixis is seen as the same as, or a subclass of, indexicality.
In grammar, an antecedent is one or more words that establish the meaning of a pronoun or other pro-form. For example, in the sentence "John arrived late because traffic held him up," the word "John" is the antecedent of the pronoun "him." Pro-forms usually follow their antecedents, but sometimes precede them. In the latter case, the more accurate term would technically be postcedent, although this term is not commonly distinguished from antecedent because the definition of antecedent usually encompasses it. The linguistic term that is closely related to antecedent and pro-form is anaphora. Theories of syntax explore the distinction between antecedents and postcedents in terms of binding.
In linguistics, binding is the phenomenon in which anaphoric elements such as pronouns are grammatically associated with their antecedents. For instance in the English sentence "Mary saw herself", the anaphor "herself" is bound by its antecedent "Mary". Binding can be licensed or blocked in certain contexts or syntactic configurations, e.g. the pronoun "her" cannot be bound by "Mary" in the English sentence "Mary saw her". While all languages have binding, restrictions on it vary even among closely related languages. Binding has been a major area of research in syntax and semantics since the 1970s and, as the name implies, is a core component of government and binding theory.
In linguistics, coreference, sometimes written co-reference, occurs when two or more expressions refer to the same person or thing; they have the same referent. For example, in Bill said Alice would arrive soon, and she did, the words Alice and she refer to the same person.
In generative grammar and related frameworks, a node in a parse tree c-commands its sister node and all of its sister's descendants. In these frameworks, c-command plays a central role in defining and constraining operations such as syntactic movement, binding, and scope. Tanya Reinhart introduced c-command in 1976 as a key component of her theory of anaphora. The term is short for "constituent command".
In linguistics and philosophy, a presupposition is an implicit assumption about the world or background belief relating to an utterance whose truth is taken for granted in discourse. Examples of presuppositions include:
In formal linguistics, discourse representation theory (DRT) is a framework for exploring meaning under a formal semantics approach. One of the main differences between DRT-style approaches and traditional Montagovian approaches is that DRT includes a level of abstract mental representations within its formalism, which gives it an intrinsic ability to handle meaning across sentence boundaries. DRT was created by Hans Kamp in 1981. A very similar theory was developed independently by Irene Heim in 1982, under the name of File Change Semantics (FCS). Discourse representation theories have been used to implement semantic parsers and natural language understanding systems.
In linguistics, cataphora is the use of an expression or word that co-refers with a later, more specific expression in the discourse. The preceding expression, whose meaning is determined or specified by the later expression, may be called a cataphor. Cataphora is a type of anaphora, although the terms anaphora and anaphor are sometimes used in a stricter sense, denoting only cases where the order of the expressions is the reverse of that found in cataphora.
Cohesion is the grammatical and lexical linking within a text or sentence that holds a text together and gives it meaning. It is related to the broader concept of coherence.
A reciprocal pronoun is a pronoun that indicates a reciprocal relationship. A reciprocal pronoun can be used for one of the participants of a reciprocal construction, i.e. a clause in which two participants are in a mutual relationship. The reciprocal pronouns of English are one another and each other, and they form the category of anaphors along with reflexive pronouns.
In linguistics, locality refers to the proximity of elements in a linguistic structure. Constraints on locality limit the span over which rules can apply to a particular structure. Theories of transformational grammar use syntactic locality constraints to explain restrictions on argument selection, syntactic binding, and syntactic movement.
In semantics, a donkey sentence is a sentence containing a pronoun which is semantically bound but syntactically free. They are a classic puzzle in formal semantics and philosophy of language because they are fully grammatical and yet defy straightforward attempts to generate their formal language equivalents. In order to explain how speakers are able to understand them, semanticists have proposed a variety of formalisms including systems of dynamic semantics such as Discourse representation theory. Their name comes from the example sentence "Every farmer who owns a donkey beats it", in which "it" acts as a donkey pronoun because it is semantically but not syntactically bound by the indefinite noun phrase "a donkey". The phenomenon is known as donkey anaphora.
A bound variable pronoun is a pronoun that has a quantified determiner phrase (DP) – such as every, some, or who – as its antecedent.
In linguistics, sloppy identity is an interpretive property that is found with verb phrase ellipsis where the identity of the pronoun in an elided VP is not identical to the antecedent VP.
Logophoricity is a phenomenon of binding relation that may employ a morphologically different set of anaphoric forms, in the context where the referent is an entity whose speech, thoughts, or feelings are being reported. This entity may or may not be distant from the discourse, but the referent must reside in a clause external to the one in which the logophor resides. The specially-formed anaphors that are morphologically distinct from the typical pronouns of a language are known as logophoric pronouns, originally coined by the linguist Claude Hagège. The linguistic importance of logophoricity is its capability to do away with ambiguity as to who is being referred to. A crucial element of logophoricity is the logophoric context, defined as the environment where use of logophoric pronouns is possible. Several syntactic and semantic accounts have been suggested. While some languages may not be purely logophoric, logophoric context may still be found in those languages; in those cases, it is common to find that in the place where logophoric pronouns would typically occur, non-clause-bounded reflexive pronouns appear instead.
The nearest referent is a grammatical term sometimes used when two or more possible referents of a pronoun, or other part of speech, cause ambiguity in a text. However "nearness", proximity, may not be the most meaningful criterion for a decision, particularly where word order, inflection and other aspects of syntax are more relevant.
An anaphoric macro is a type of programming macro that deliberately captures some form supplied to the macro which may be referred to by an anaphor. Anaphoric macros first appeared in Paul Graham's On Lisp and their name is a reference to linguistic anaphora—the use of words as a substitute for preceding words.
In linguistics, givenness is a phenomenon in which a speaker assumes that contextual information of a topic of discourse is already known to the listener. The speaker thus considers it unnecessary to supply further contextual information through an expression's linguistic properties, its syntactic form or position, or its patterns of stress and intonation. Givenness involves contextual information in a discourse that is given, or assumed to be known, by the addressee in the moment of utterance. Therefore, a given expression must be known from prior discourse.
Gregory Ward is an American linguist, academic and researcher. He is Professor of Linguistics, Gender & Sexuality Studies and, by courtesy, Philosophy at Northwestern University.