In linguistics, binding is the phenomenon in which anaphoric elements such as pronouns are grammatically associated with their antecedents.[ citation needed ] For instance in the English sentence "Mary saw herself", the anaphor "herself" is bound by its antecedent "Mary". Binding can be licensed or blocked in certain contexts or syntactic configurations, e.g. the pronoun "her" cannot be bound by "Mary" in the English sentence "Mary saw her". While all languages have binding, restrictions on it vary even among closely related languages. Binding has been a major area of research in syntax and semantics since the 1970s and, as the name implies, is a core component of government and binding theory. [1]
The following sentences illustrate some basic facts of binding. The words that bear the index i should be construed as referring to the same person or thing. [2]
These sentences illustrate some aspects of the distribution of reflexive and personal pronouns. In the first pair of sentences, the reflexive pronoun must appear for the indicated reading to be possible. In the second pair, the personal pronoun must appear for the indicated reading to be possible. The third pair shows that at times a personal pronoun must follow its antecedent, and the fourth pair further illustrates the same point, although the acceptability judgement is not as robust. Based on such data, one sees that reflexive and personal pronouns differ in their distribution and that linear order (of a pronoun in relation to its antecedent or postcedent) is a factor influencing where at least some pronouns can appear. A theory of binding should be capable of predicting and explaining the differences in distribution seen in sentences like these. It should be able to answer questions like: What explains where a reflexive pronoun must appear as opposed to a personal pronoun? When does linear order play a role in determining where pronouns can appear? What other factor (or factors) beyond linear order help predict where pronouns can appear?
The following three subsections consider the binding domains that are relevant for the distribution of pronouns and nouns in English. The discussion follows the outline provided by the traditional binding theory (see below), which divides nominals into three basic categories: reflexive and reciprocal pronouns, personal pronouns, and nouns (common and proper). [3]
When one examines the distribution of reflexive pronouns and reciprocal pronouns (which are often subsumed under the general category of "anaphor"), one sees that there are certain domains that are relevant, a "domain" being a syntactic unit that is clause-like. Reflexive and reciprocal pronouns often seek their antecedent close by, in a binding domain that is local, e.g.
These examples illustrate that there is a domain within which a reflexive or reciprocal pronoun should find its antecedent. The a-sentences are fine because the reflexive or reciprocal pronoun has its antecedent within the clause. The b-sentences, in contrast, do not allow the indicated reading, a fact illustrating that personal pronouns have a distribution that is different from that of reflexive and reciprocal pronouns. A related observation is that a reflexive and reciprocal pronoun often cannot seek its antecedent in a superordinate clause, e.g.
When the reflexive or reciprocal pronoun attempts to find an antecedent outside of the immediate clause containing it, it fails. In other words, it can hardly seek its antecedent in the superordinate clause. The binding domain that is relevant is the immediate clause containing it.
Personal pronouns have a distribution that is different from reflexive and reciprocal pronouns, a point that is evident with the first two b-sentences in the previous section. The local binding domain that is decisive for the distribution of reflexive and reciprocal pronouns is also decisive for personal pronouns, but in a different way. Personal pronouns seek their antecedent outside of the local binding domain containing them, e.g.
In these cases, the pronoun has to look outside of the embedded clause containing it to the matrix clause to find its antecedent. Hence based on such data, the relevant binding domain appears to be the clause. Further data illustrate, however, that the clause is actually not the relevant domain:
Since the pronouns appear within the same minimal clause containing their antecedents in these cases, one cannot argue that the relevant binding domain is the clause. The most one can say based on such data is that the domain is "clause-like".
The distribution of common and proper nouns is unlike that of reflexive, reciprocal, and personal pronouns. The relevant observation in this regard is that a noun is often reluctantly coreferential with another nominal that is within its binding domain or in a superordinate binding domain, e.g.
The readings indicated in the a-sentences are natural, whereas the b-sentences are very unusual. Indeed, sentences like these b-sentences were judged to be impossible in the traditional binding theory according to Condition C (see below). Given a contrastive context, however, the b-sentences can work, e.g. Susan does not admire Jane, but rather Susani admires Susani. One can therefore conclude that nouns are not sensitive to binding domains in the same way that reflexive, reciprocal, and personal pronouns are.
The following subsections illustrate the extent to which pure linear order impacts the distribution of pronouns. While linear order is clearly important, it is not the only factor influencing where pronouns can appear.
A simple hypothesis concerning the distribution of many anaphoric elements, of personal pronouns in particular, is that linear order plays a role. In most cases, a pronoun follows its antecedent, and in many cases, the coreferential reading is impossible if the pronoun precedes its antecedent. The following sentences suggest that pure linear can indeed be important for the distribution of pronouns:
While the coreferential readings indicated in these b-sentences are possible, they are unlikely. The order presented in the a-sentences is strongly preferred. The following, more extensive data sets further illustrate that linear order is important:
While the acceptability judgements here are nuanced, one can make a strong case that pure linear order is at least in part predictive of when the indicated reading is available. The a- and c-sentences allow the coreferential reading more easily than their b- and d-counterparts.
While linear order is an important factor influencing the distribution of pronouns, it is not the only factor. The following sentences are similar to the c- and d-sentences in the previous section insofar as an embedded clause is present.
While there may be a mild preference for the order in the a-sentences here, the indicated reading in the b-sentences is also available. Hence linear order is hardly playing a role in such cases. The relevant difference between these sentences and the c- and d-sentences in the previous section is that the embedded clauses here are adjunct clauses, whereas they are argument clauses above. The following examples involve adjunct phrases: [4]
The fact that the c-sentences marginally allow the indicated reading whereas the b-sentences do not at all allow this reading further demonstrates that linear order is important. But in this regard, the d-sentences are telling, since if linear order were the entire story, one would expect the d-sentences to be less acceptable than they are. The conclusion that one can draw from such data is that there are one or more other factors beyond linear order that are impacting the distribution of pronouns.
Given that linear order is not the only factor influencing the distribution of pronouns, the question is what other factor or factors might also be playing a role. The traditional binding theory (see below) took c-command to be the all important factor, but the importance of c-command for syntactic theorizing has been extensively criticized in recent years. [5] The primary alternative to c-command is functional rank. These two competing concepts (c-command vs. rank) have been debated extensively and they continue to be debated. C-command is a configurational notion; it is defined over concrete syntactic configurations. Syntactic rank, in contrast, is a functional notion that resides in the lexicon; it is defined over the ranking of the arguments of predicates. Subjects are ranked higher than objects, first objects are ranked higher than second objects, and prepositional objects are ranked lowest. The following two subsections briefly consider these competing notions.
C-command is a configurational notion that acknowledges the syntactic configuration as primitive. Basic subject-object asymmetries, which are numerous in many languages, are explained by the fact that the subject appears outside of the finite verb phrase (VP) constituent, whereas the object appears inside it. Subjects therefore c-command objects, but not vice versa. C-command is defined as follows:
Given the binary division of the clause (S → NP + VP) associated with most phrase structure grammars, this definition sees a typical subject c-commanding everything inside the verb phrase (VP), whereas everything inside the VP is incapable of c-commanding anything outside of the VP. Some basic binding facts are explained in this manner, e.g.
Sentence a is fine because the subject Larry c-commands the object himself, whereas sentence b does not work because the object Larry does not c-command the subject himself. The assumption has been that within its binding domain, a reflexive pronoun must be c-commanded by its antecedent. While this approach based on c-command makes a correct prediction much of the time, there are other cases where it fails to make the correct prediction, e.g.
The reading indicated is acceptable in this case, but if c-command were the key notion helping to explain where the reflexive can and must appear, then the reading should be impossible since himself is not c-commanded by Larry. [7]
As reflexive and personal pronouns occur in complementary distribution, the notion of c-command can also be used to explain where personal pronouns can appear. The assumption is that personal pronouns cannot c-command their antecedent, e.g.
In both examples, the personal pronoun she does not c-command its antecedent Alice, resulting in the grammaticality of both sentences despite reversed linear order.
The alternative to a c-command approach posits a ranking of syntactic functions (SUBJECT > FIRST OBJECT > SECOND OBJECT > PREPOSITIONAL OBJECT). [8] Subject-object asymmetries are addressed in terms of this ranking. Since subjects are ranked higher than objects, an object can have the subject as its antecedent, but not vice versa. With basic cases, this approach makes the same prediction as the c-command approach. The first two sentences from the previous section are repeated here:
Since the subject outranks the object, sentence a is predictably acceptable, the subject Larry outranking the object himself. Sentence b, in contrast, is bad because the subject reflexive pronoun himself outranks its postcedent Larry. In other words, this approach in terms of rank is assuming that within its binding domain, a reflexive pronoun may not outrank its antecedent (or postcedent). Consider the third example sentence from the previous section in this regard:
The approach based on rank does not require a particular configurational relationship to hold between a reflexive pronoun and its antecedent. In other words, it makes no prediction in this case, and hence does not make an incorrect prediction. The reflexive pronoun himself is embedded within the subject noun phrase, which means that it is not the subject and hence does not outrank the object Larry.
A theory of binding that acknowledges both linear order and rank can at least begin to predict many of the marginal readings. [9] When both linear order and rank combine, acceptability judgments are robust, e.g.
This ability to address marginal readings is something that an approach combining linear order and rank can accomplish, whereas an approach that acknowledges only c-command cannot do the same.
The exploration of binding phenomena got started in the 1970s and interest peaked in the 1980s with Government and Binding Theory, a grammar framework in the tradition of generative syntax that is still prominent today. [10] The theory of binding that became widespread at that time serves now merely as reference point (since it is no longer believed to be correct). This theory distinguishes between 3 different binding conditions: A, B, and C. The theory classifies nominals according to two features, [±anaphor] and [±pronominal], which are binary. The binding characteristics of a nominal are determined by the values of these features, either plus or minus. Thus, a nominal that is [-anaphor, -pronominal] is an R-expression (referring expression), such as a common noun or a proper name. A nominal that is [-anaphor, +pronominal] is a pronoun, such as he or they, and a nominal that is [+anaphor, -pronominal] is a reflexive pronoun, such as himself or themselves. [ clarification needed ] Note that the term anaphor here is being used in a specialized sense; it essentially means "reflexive". This meaning is specific to the Government and Binding framework and has not spread beyond this framework. [11]
Based on the classifications according to these two features, three conditions are formulated:
While the theory of binding that these three conditions represent is no longer held to be valid, as mentioned above, the associations with the three conditions are so firmly anchored in the study of binding that one often refers to, for example, "Condition A effects" or "Condition B effects" when describing binding phenomena.
In linguistics and grammar, a pronoun is a word or a group of words that one may substitute for a noun or noun phrase.
In mathematics, and in other disciplines involving formal languages, including mathematical logic and computer science, a variable may be said to be either free or bound. Some older books use the terms real variable and apparent variable for free variable and bound variable, respectively. A free variable is a notation (symbol) that specifies places in an expression where substitution may take place and is not a parameter of this or any container expression. The idea is related to a placeholder, or a wildcard character that stands for an unspecified symbol.
Government and binding is a theory of syntax and a phrase structure grammar in the tradition of transformational grammar developed principally by Noam Chomsky in the 1980s. This theory is a radical revision of his earlier theories and was later revised in The Minimalist Program (1995) and several subsequent papers, the latest being Three Factors in Language Design (2005). Although there is a large literature on government and binding theory which is not written by Chomsky, Chomsky's papers have been foundational in setting the research agenda.
A reflexive pronoun is a pronoun that refers to another noun or pronoun within the same sentence.
In linguistics, anaphora is the use of an expression whose interpretation depends upon another expression in context. In a narrower sense, anaphora is the use of an expression that depends specifically upon an antecedent expression and thus is contrasted with cataphora, which is the use of an expression that depends upon a postcedent expression. The anaphoric (referring) term is called an anaphor. For example, in the sentence Sally arrived, but nobody saw her, the pronoun her is an anaphor, referring back to the antecedent Sally. In the sentence Before her arrival, nobody saw Sally, the pronoun her refers forward to the postcedent Sally, so her is now a cataphor. Usually, an anaphoric expression is a pro-form or some other kind of deictic expression. Both anaphora and cataphora are species of endophora, referring to something mentioned elsewhere in a dialog or text.
In linguistics, a pro-form is a type of function word or expression (linguistics) that stands in for another word, phrase, clause or sentence where the meaning is recoverable from the context. They are used either to avoid repetitive expressions or in quantification.
In linguistics, coreference, sometimes written co-reference, occurs when two or more expressions refer to the same person or thing; they have the same referent. For example, in Bill said Alice would arrive soon, and she did, the words Alice and she refer to the same person.
In generative grammar and related frameworks, a node in a parse tree c-commands its sister node and all of its sister's descendants. In these frameworks, c-command plays a central role in defining and constraining operations such as syntactic movement, binding, and scope. Tanya Reinhart introduced c-command in 1976 as a key component of her theory of anaphora. The term is short for "constituent command".
In linguistics, the empty category principle (ECP) was proposed in Noam Chomsky's syntactic framework of government and binding theory. The ECP is supposed to be a universal syntactic constraint that requires certain types of empty categories, namely traces, to be properly governed.
A reciprocal pronoun is a pronoun that indicates a reciprocal relationship. A reciprocal pronoun can be used for one of the participants of a reciprocal construction, i.e. a clause in which two participants are in a mutual relationship. The reciprocal pronouns of English are one another and each other, and they form the category of anaphors along with reflexive pronouns.
In linguistics, an empty category, which may also be referred to as a covert category, is an element in the study of syntax that does not have any phonological content and is therefore unpronounced. Empty categories exist in contrast to overt categories which are pronounced. When representing empty categories in tree structures, linguists use a null symbol (∅) to depict the idea that there is a mental category at the level being represented, even if the word(s) are being left out of overt speech. The phenomenon was named and outlined by Noam Chomsky in his 1981 LGB framework, and serves to address apparent violations of locality of selection — there are different types of empty categories that each appear to account for locality violations in different environments. Empty categories are present in most of the world's languages, although different languages allow for different categories to be empty.
In generative linguistics, PRO is a pronominal determiner phrase (DP) without phonological content. As such, it is part of the set of empty categories. The null pronoun PRO is postulated in the subject position of non-finite clauses. One property of PRO is that, when it occurs in a non-finite complement clause, it can be bound by the main clause subject or the main clause object. The presence of PRO in non-finite clauses lacking overt subjects allows a principled solution for problems relating to binding theory.
The Tensed-S condition is a condition proposed in Noam Chomsky (1973) which essentially stipulates that certain classes of syntactic transformational rules cannot apply across clause boundaries. The condition is formalised as follows:
The Specified Subject Condition (SSC) is a condition proposed in Chomsky (1973) which restricts the application of certain syntactic transformational grammar rules. In many ways it is a counterpart to the Tensed-S Condition (TSC), applying to non-finite clauses and complex determiner phrases (DPs) which are not covered by the TSC. The rule was formalized as follows, where a "specified subject" is a lexical subject i.e. a subject with semantic content, like a proper noun, a complex DP, or a pronominal:
In linguistics, locality refers to the proximity of elements in a linguistic structure. Constraints on locality limit the span over which rules can apply to a particular structure. Theories of transformational grammar use syntactic locality constraints to explain restrictions on argument selection, syntactic binding, and syntactic movement.
A resumptive pronoun is a personal pronoun appearing in a relative clause, which restates the antecedent after a pause or interruption, as in This is the girli that whenever it rains shei cries.
A bound variable pronoun is a pronoun that has a quantified determiner phrase (DP) – such as every, some, or who – as its antecedent.
In linguistics, sloppy identity is an interpretive property that is found with verb phrase ellipsis where the identity of the pronoun in an elided VP is not identical to the antecedent VP.
Logophoricity is a phenomenon of binding relation that may employ a morphologically different set of anaphoric forms, in the context where the referent is an entity whose speech, thoughts, or feelings are being reported. This entity may or may not be distant from the discourse, but the referent must reside in a clause external to the one in which the logophor resides. The specially-formed anaphors that are morphologically distinct from the typical pronouns of a language are known as logophoric pronouns, originally coined by the linguist Claude Hagège. The linguistic importance of logophoricity is its capability to do away with ambiguity as to who is being referred to. A crucial element of logophoricity is the logophoric context, defined as the environment where use of logophoric pronouns is possible. Several syntactic and semantic accounts have been suggested. While some languages may not be purely logophoric, logophoric context may still be found in those languages; in those cases, it is common to find that in the place where logophoric pronouns would typically occur, non-clause-bounded reflexive pronouns appear instead.
In linguistics, crossover effects are restrictions on possible binding or coreference that hold between certain phrases and pronouns. Coreference that is normal and natural when a pronoun follows its antecedent becomes impossible, or at best just marginally possible, when "crossover" is deemed to have occurred, e.g. ?Who1 do his1 friends admire __1? The term itself refers to the traditional transformational analysis of sentences containing leftward movement, whereby it appears as though the fronted constituent crosses over the expression with which it is coindexed on its way to the front of the clause. Crossover effects are divided into strong crossover (SCO) and weak crossover (WCO). The phenomenon occurs in English and related languages, and it may be present in all natural languages that allow fronting.