Distributionalism

Last updated

Distributionalism was a general theory of language and a discovery procedure for establishing elements and structures of language based on observed usage. The purpose of distributionalism was to provide a scientific basis for syntax as independent of meaning. Zellig Harris defined 'distribution' as follows. [1]

Contents

“The DISTRIBUTION of an element is the total of all environments in which it occurs, i.e. the sum of all the (different) positions (or occurrences) of an element relative to the occurrence of other elements[.]”

Based on this idea, an analysis of immediate constituents could be based on observing the environments in which an element, such as a word, appears in corpora. Critics of distributionalism, such as Louis Hjelmslev, pointed out that the analysis of occurrence adds nothing to traditional structure analysis, which is based on the hierarchical, step-by-step categorization of elements. Hjelmslev proposed glossematics, which combines the analysis of meaning and form. However, in American linguistics in the 1960s, distributionalism became replaced by Noam Chomsky's proposal of transformational generative grammar. It proposed that the constituency structure is the manifestation of innate grammar, allowing the preservation of autonomous syntax. [2]

Origins

Distributionalism can be said to have originated in the work of structuralist linguist Leonard Bloomfield and was more clearly formalised by Zellig S. Harris. [1] [3]

This theory emerged in the United States in the 1950s, as a variant of structuralism, which was the mainstream linguistic theory at the time, and dominated American linguistics for some time. [4] Using "distribution" as a technical term for a component of discovery procedure is likely first to have been done by Morris Swadesh in 1934 [5] and then applied to principles of phonematics, to establish which observable various sounds of a language constitute the allophones of a phoneme and which should be kept as separate phonemes. [6] According to Turenne and Pomerol, distributionalism was in fact a second phase in the history of linguistics, following that of structuralism, as distributionalism was mainly dominant since 1935 to 1960. [7] It is considered one of the scientific grounds of Noam Chomsky's generative grammar and had considerable influence on language teaching.

Distributionalism has much in common with structuralism. However, both appear in the United States while the theses of Ferdinand de Saussure are only just beginning to be known in Europe: distributionism must be considered as an original theory in relation to Saussurianism.

Behaviorist psychological theories which allowed the birth of distributionalism are reminiscent of Pavlov's work on animals. According to these theories, human behaviour would be totally explainable, and its mechanics could be studied. The study of reflexes, for example, should have made it possible to predict certain attitudes. Leonard Bloomfield argues that language, like behaviour, could be analysed as a predictable mechanism, explicable by the external conditions of its appearance.

The notions of "mechanism", "inductive method" and "corpus" are key terms of distributionalism.

Mechanism vs Mentalism

Bloomfield calls his thesis mechanism, and he opposes it to mentalism: for him, in fact, speech cannot be explained as an effect of thoughts (intentions, beliefs, feelings). Thus, one must be able to account for linguistic behaviour and the hierarchical structure of the messages conveyed without any assumptions about the speakers' intentions and mental states. [8]

From the behaviourist perspective, a given stimulus corresponds to a given response. However, meaning is an unstable thing for distributionists, depending on the situation, and is not observable. It must therefore be eliminated as an element of language analysis. The only regularity is of a morphosyntactic nature: it is the structural invariants of the morphosyntax that allow us to reconstruct the language system from an analysis of its observable elements, the words of a given corpus.

Salient features

The main idea of distributionalism is that linguistic units "are what they do", [9] which means that the identity of linguistic units are defined by their distribution. Zellig Harris used to consider meaning as too intuitive to be a reliable ground for linguistic research. Language use has to be observed directly while looking at all the environments in which a unit can occur. Harris advocated for a distributional approach, since "difference of meaning correlates with difference of distribution.". [10]

Related Research Articles

<span class="mw-page-title-main">Ferdinand de Saussure</span> Swiss linguist and philosopher (1857–1913)

Ferdinand de Saussure was a Swiss linguist, semiotician and philosopher. His ideas laid a foundation for many significant developments in both linguistics and semiotics in the 20th century. He is widely considered one of the founders of 20th-century linguistics and one of two major founders of semiotics, or semiology, as Saussure called it.

The following outline is provided as an overview and topical guide to linguistics:

In phonology and linguistics, a phoneme is a set of phones that can distinguish one word from another in a particular language.

Phonology is the branch of linguistics that studies how languages systematically organize their phones or, for sign languages, their constituent parts of signs. The term can also refer specifically to the sound or sign system of a particular language variety. At one time, the study of phonology related only to the study of the systems of phonemes in spoken languages, but may now relate to any linguistic analysis either:

In linguistics, syntax is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituency), agreement, the nature of crosslinguistic variation, and the relationship between form and meaning (semantics). There are numerous approaches to syntax that differ in their central assumptions and goals.

In linguistics, transformational grammar (TG) or transformational-generative grammar (TGG) is part of the theory of generative grammar, especially of natural languages. It considers grammar to be a system of rules that generate exactly those combinations of words that form grammatical sentences in a given language and involves the use of defined operations to produce new sentences from existing ones.

<span class="mw-page-title-main">Zellig Harris</span> American linguist

Zellig Sabbettai Harris was an influential American linguist, mathematical syntactician, and methodologist of science. Originally a Semiticist, he is best known for his work in structural linguistics and discourse analysis and for the discovery of transformational structure in language. These developments from the first 10 years of his career were published within the first 25. His contributions in the subsequent 35 years of his career include transfer grammar, string analysis, elementary sentence-differences, algebraic structures in language, operator grammar, sublanguage grammar, a theory of linguistic information, and a principled account of the nature and origin of language.

<span class="mw-page-title-main">Generative grammar</span> Theory in linguistics

Generative grammar, or generativism, is a linguistic theory that regards linguistics as the study of a hypothesised innate grammatical structure. It is a biological or biologistic modification of earlier structuralist theories of linguistics, deriving from logical syntax and glossematics. Generative grammar considers grammar as a system of rules that generates exactly those combinations of words that form grammatical sentences in a given language. It is a system of explicit rules that may be applied repeatedly to generate an indefinite number of sentences which can be as long as one wants them to be. The difference from structural and functional models is that the object is base-generated within the verb phrase in generative grammar. This purportedly cognitive structure is thought of as being a part of a universal grammar, a syntactic structure which is caused by a genetic mutation in humans.

Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi. Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free grammars have rules for rewriting symbols as strings of other symbols, tree-adjoining grammars have rules for rewriting the nodes of trees as other trees.

<i>Syntactic Structures</i> Book by Noam Chomsky

Syntactic Structures is an important work in linguistics by American linguist Noam Chomsky, originally published in 1957. A short monograph of about a hundred pages, it is recognized as one of the most significant and influential linguistic studies of the 20th century. It contains the now-famous sentence "Colorless green ideas sleep furiously", which Chomsky offered as an example of a grammatically correct sentence that has no discernible meaning, thus arguing for the independence of syntax from semantics.

In linguistics, glossematics is a structuralist theory proposed by Louis Hjelmslev and Hans Jørgen Uldall. It defines the glosseme as the most basic unit of language.

Langueandparole is a theoretical linguistic dichotomy distinguished by Ferdinand de Saussure in his Course in General Linguistics.

Structural linguistics, or structuralism, in linguistics, denotes schools or theories in which language is conceived as a self-contained, self-regulating semiotic system whose elements are defined by their relationship to other elements within the system. It is derived from the work of Swiss linguist Ferdinand de Saussure and is part of the overall approach of structuralism. Saussure's Course in General Linguistics, published posthumously in 1916, stressed examining language as a dynamic system of interconnected units. Saussure is also known for introducing several basic dimensions of semiotic analysis that are still important today. Two of these are his key methods of syntagmatic and paradigmatic analysis, which define units syntactically and lexically, respectively, according to their contrast with the other units in the system.

Linguistics is the scientific study of language. Linguistics is based on a theoretical as well as a descriptive study of language and is also interlinked with the applied fields of language studies and language learning, which entails the study of specific languages. Before the 20th century, linguistics evolved in conjunction with literary study and did not employ scientific methods. Modern-day linguistics is considered a science because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language – i.e., the cognitive, the social, the cultural, the psychological, the environmental, the biological, the literary, the grammatical, the paleographical, and the structural.

Rasmus Viggo Brøndal was a Danish philologist and professor of Romance languages and literature at Copenhagen University.

<span class="mw-page-title-main">Formalism (linguistics)</span> Concept in linguistics

In linguistics, the term formalism is used in a variety of meanings which relate to formal linguistics in different ways. In common usage, it is merely synonymous with a grammatical model or a syntactic model: a method for analyzing sentence structures. Such formalisms include different methodologies of generative grammar which are especially designed to produce grammatically correct strings of words; or the likes of Functional Discourse Grammar which builds on predicate logic.

Formal linguistics is the branch of linguistics which uses applied mathematical methods for the analysis of natural languages. Such methods include formal languages, formal grammars and first-order logical expressions. Formal linguistics also forms the basis of computational linguistics. Since the 1980s, the term is often used to refer to Chomskyan linguistics.

Theory of language is a topic in philosophy of language and theoretical linguistics. It has the goal of answering the questions "What is language?"; "Why do languages have the properties they do?"; or "What is the origin of language?". In addition to these fundamental questions, the theory of language also seeks to understand how language is acquired and used by individuals and communities. This involves investigating the cognitive and neural processes involved in language processing and production, as well as the social and cultural factors that shape linguistic behavior.

In linguistics, the autonomy of syntax is the assumption that syntax is arbitrary and self-contained with respect to meaning, semantics, pragmatics, discourse function, and other factors external to language. The autonomy of syntax is advocated by linguistic formalists, and in particular by generative linguistics, whose approaches have hence been called autonomist linguistics.

The basis of Noam Chomsky's linguistic theory lies in biolinguistics, the linguistic school that holds that the principles underpinning the structure of language are biologically preset in the human mind and hence genetically inherited. He argues that all humans share the same underlying linguistic structure, irrespective of sociocultural differences. In adopting this position Chomsky rejects the radical behaviorist psychology of B. F. Skinner, who viewed speech, thought, and all behavior as a completely learned product of the interactions between organisms and their environments. Accordingly, Chomsky argues that language is a unique evolutionary development of the human species and distinguished from modes of communication used by any other animal species. Chomsky's nativist, internalist view of language is consistent with the philosophical school of "rationalism" and contrasts with the anti-nativist, externalist view of language consistent with the philosophical school of "empiricism", which contends that all knowledge, including language, comes from external stimuli.

References

  1. 1 2 Zellig, Harris. 1951. Methods in Structural Linguistics. Chicago: University of Chicago Press, xvi, 384 pp. (Ms. title Methods in Descriptive Linguistics. Repr. as "Phoenix Books" P 52 with the title Structural Linguistics, 1960; 7th impression, 1966; 1984.) [Completed 1946, Preface signed "Philadelphia, January 1947".]
  2. Shakeri, Mohammed Amin (2022). "Last Glossematic Conference: A Rich Source of Comparison with American Structural Linguistics" (PDF). nors.ku.dk. Institut for Nordiske Studier og Sprogvidenskab. Archived from the original (PDF) on 2023-06-15. Retrieved 2023-06-15.
  3. Harris, Zellig. 1954. "Distributional Structure". Word 10:2/3.146-162. (Also in Linguistics Today: Published on the occasion of the Columbia University Bicentennial ed. by Andre Martinet & Uriel Weinreich, 26-42. New York: Linguistic Circle of New York, 1954. (Repr. in The Structure of Language: Readings in the philosophy of language ed. by Jerry A[lan] Fodor & Jerrold J[acob] Katz, 33-49. Englewood Cliffs, N.J.: Prentice-Hall, 1964, and also in Harris 1970a.775-794, and in 1981.3-22.)]
  4. Peter Spyns, 2000, Natural Language Processing in Medicine: Design, Implementation and Evaluation of an Analyser for Dutch, Leuven University Press, ISBN   978-90-5867-069-4, p. 36.
  5. Swadesh, Morris (1934). "The Phonemic Principle". Language. 10 (2): 117–129. doi:10.2307/409603. JSTOR   409603.
  6. Diderichsen, Paul (1958). Sivertsen, Eva (ed.). "The Importance of Distribution versus Other Criteria in Linguistic Analysis". Proceedings of the VIII International Congress of Linguists. Oslo University Press: 156–182.
  7. Turenne, Nicolas, and Jean‐Charles Pomerol. "Language Modeling." Knowledge Needs and Information Extraction (2013): 61-80.
  8. Glottopedia, v. Mentalism
  9. Dilley. 1999. The Problem of Context, Berghahn Books, p. 62
  10. Harris, Zellig. 1954. "Distributional Structure". Word 10:2/3. p. 156)

Sources