Lexicon-Grammar is a method and a praxis of formalized description of human languages, [1][2] which considers that the systematic investigation of lexical entries is presently the main challenge of the scientific study of languages. [3] The development of Lexicon-Grammar began in the late 1960s under Maurice Gross.
The method of Lexicon-Grammar is inspired from hard sciences. [8] It focuses on data collection, hence on the real use of language, both from a quantitative and observational point of view.
Lexicon-grammar also requires formalisation. [9] The results of the description must be sufficiently formal to be usable for natural language processing, in particular through the development of parsers. [10][11][12][13][14] The information model is such that the results of the description take the form of two-dimensional tables, also called matrices. [6] Lexicon-grammar tables cross-tabulate lexical entries with their syntactic-semantic properties. [3][15] As a result, they make up a database of syntactic-semantic information. [16]
Experiments showed that several researchers or teams can make their observations cumulative. [17]
The term lexicon-grammar is used for the first time by Annibale Elia. [18]
Theoretical basis
The theoretical basis of Lexicon-grammar is Zellig Harris' distributionalism,[4] and in particular the notion of transformation in the sense of Zellig Harris. [6][19][20] Maurice Gross was a student of Zellig Harris. The conventions for the presentation of grammatical information are intended to be as simple and transparent as possible. [7] This concern comes from Zellig Harris, whose theory is oriented towards directly observable surface forms; this differs from Generative Grammar, which normally uses abstract structures such as deep structures.
Fact collection
The Lexicon-Grammar method is inspired by experimental science. [8] It emphasizes the collection of facts, confronting the researcher with the reality of language use, from a quantitative and an observational point of view.[21]
Quantitatively: a lexicon-grammar includes a program of systematic description of the lexicon, including observing for each lexical entry in which syntactic constructions it occurs. [3][22] This involves large-scale work, which can be carried out by teams and not by individual specialists. The exclusive search for general rules of syntax, independent of the lexical material they handle, is dismissed as a dead end.[23] This is different from Generative Grammar, which values the notion of generalization.
Observationally: methodological precautions are applied to ensure good reproducibility of observations, and in particular to guard against the risks associated with constructed examples.[24] One of these precautions is to take as a minimum unit of meaning the basic sentence.[25][26][15] Indeed, a word acquires a precise meaning only in a context; moreover, by inserting a word in a sentence, one has the advantage of manipulating a sequence that may be judged acceptable or unacceptable. It is at this price that syntactic-semantic properties are considered as defined with sufficient precision that it makes sense to test and check them against the whole lexicon.[3][6][27] These precautions have evolved in line with needs and the appearance of new technical means. Thus, from the beginning of the 1990s, the contributors of the Lexicon-Grammar have been able to use attested examples in text corpora more and more easily.[28][29] This new precaution has simply been added to the previous ones, positioning the Lexicon-Grammar simultaneously in introspective linguistics [30] and in corpus linguistics, much as advocated by Fillmore.[31] The American projects FrameNet[32] and VerbNet[33] show a relative convergence towards objectives close to those of Lexicon-Grammar.
Formalisation
Lexicon-grammar also requires formalisation.[9] The results of the description must be sufficiently formal to allow for:
verification by comparison with the reality of language use;
The results obtained through the application of these methodological principles by several dozen linguists over several decades constitute a database of syntactic-semantic information for natural language processing. The quality of this database can be assessed by considering:
its size, measurable by the number of entries,
the richness of the linguistic phenomena it covers, measurable by the number of properties,
Work has been carried out and published within the Lexicon–Grammar framework on predicative nouns since the 1970s,[44][45] and on fixed expressions since the 1980s.[46][47]
The notion of a nounpredicate comes from the work of Zellig Harris.[4] It is based on the following parallel: if, for example, the verb study is analyzed as the predicate in the sentence Luke studies eclipses, then it is natural to analyze the noun study—or the sequence carry out a study—as a predicate in the sentence Luke carries out a study on eclipses. In this case, the noun in question is called a noun predicate. The verb that accompanies it—here carry out—is referred to as a support verb(it). This idea has been systematically applied in the Lexicon–Grammar framework since the 1970s.[48][49][50][51][52][53]
Lexicon–Grammar contributors use the term 'fixed expression' when an expression (for example state of the art) has specific properties (here, its meaning) that justify giving it its own lexical entry, even though it is made up of several elements (state, of, the, and art) that can, in one way or another, be considered individual words.[46] A systematic program for describing such expressions has been undertaken within the Lexicon–Grammar framework since the 1980s.[19][54][55][56][57][58][59]
Cumulativity
These experiments have shown that several individuals or teams can arrive at identical results. This reproducibility ensures the cumulativity of the descriptions. This outcome is crucial for the future of language processing: the amount of data that must be gathered and represented within a coherent model is such that many research and development teams have to collaborate, and it must be possible to merge their results without having to rewrite substantial parts of the grammar and lexicon of each language. This requirement is far from easy to meet, as there are few known examples of grammars of significant size that are not the work of a single specialist.[17][60][61]
Interface with international standards
With the goal of improving the availability of the data in a clear and explicit way, part of the French Lexicon–Grammar has been transcoded into the ISO-standard LMF format.[62]
12De Bueriis, Giustino; Langella, Alberto Maria (2019). "Algebraic Lexicon Grammar". International Journal of Linguistics, Philology and Literature. 11 (3). Las Vegas: 21–42. Retrieved 5 December 2025. Page 22.
↑Leclère, Christian (2005). "The lexicon-grammar of French verbs: a syntactic database". Linguistic Informatics - State of the Art and the Future. First International Conference on Linguistic Informatics. Amsterdam/Philadelphia: Benjamins. pp.29–45. Retrieved 5 December 2025.
↑Gross, Maurice (1979). "On the failure of generative grammar". Language. 55 (4): 859–885. doi:10.2307/412748. Retrieved 5 December 2025.We were forced to conclude that we could obtain no generalization without a reasonably complete study of the lexical items of the language and their syntactic uses. (...) Accumulating data (...) in all natural sciences (...) is a fundamental activity, a necessary condition for evaluating the generality of phenomena.
↑Gross, Maurice (1988). "Methods and Tactics in the Construction of a Lexicon-Grammar"(PDF). Linguistics in the Morning Calm 2, Selected Papers. SICOL 1986. Yonsei University, Seoul, Korea: Hanshin Pub. Co. pp.177–197. Retrieved 5 December 2025. Page 182.
↑Fillmore, Charles (1992). "'Corpus linguistics' vs. 'Computer-aided armchair linguistics'". Directions in Corpus Linguistics. By Svartvik, Jan. Berlin / New York: Mouton de Gruyter. pp.35–60. doi:10.1093/oso/9780199292332.003.0007. ISBN3-11-012826-8. Retrieved 5 December 2025. (Proceedings from the 1991 Nobel Symposium on Corpus Linguistics, Stockholm.)
↑Gross, Maurice (2006). "État du lexique-grammaire du français et perspectives d'extension". History of the Language Sciences. By Auroux, Sylvain; Koerner, E.F.K.; Niederehe, Hand-J.; Versteegh, Kees (in French). Vol.3. Berlin-New York: Walter de Gruyter. pp.2122–2129. doi:10.1515/9783110167368.3. ISBN978-3-11-016736-8. Retrieved 11 December 2025.
↑Kyriacopoulou, Tita (2004). Analyse automatique des textes écrits: le cas du grec moderne (in French). Thessaloniki: University Studio Press. ISBN960-12-1376-7.
↑Gross, Maurice (1979). "On the failure of generative grammar". Language. 55 (4): 859–885. doi:10.2307/412748. Retrieved 12 December 2025. Footnote 6, page 865: The solution of Zellig Harris [to the analysis of] NPs like the solution of the equation by our teacher (...) directly relates the two sentences: Our teacher solved the equation. / Our teacher effected the solution of the equation. (...) The passive form enters by relativization into the NP: the solution of the equation that was effected by our teacher. By deletion of that was effected, we obtain the NP under analysis.
↑Labelle, Jacques (1974). "Le substantif symétrique". Cahier de linguistique. 5. Montréal: UQAM: 17–49.
↑Malik, Mahdi Mohamed; Royauté, Jean (2007). A Predicate Database for Assisting the Design of a Lexicon-Grammar of Predicative Nouns. Human Language Technology (LTC), Poznań, Poland. Springer. pp.312–324. doi:10.1007/978-3-642-04235-5_27.
↑Ranchhod, Elisabete (1990). Sintaxe dos Predicados Nominais com Estar. Linguística (in Portuguese). Vol.12. Lisbon: Instituto Nacional de Investigação Cientifica.
↑Blanc, Olivier; Constant, Matthieu (2005). Lexicalisation of grammars with parameterized graphs(PDF). International Conference of Recent Advances in Natural Language Processing (RANLP), Borovets, Bulgaria. pp.117–121. Retrieved 12 December 2025.
Gross, Maurice. 1994. "The lexicon grammar of a language: Application to French", in Asher, R.E. (ed.), Encyclopedia of Language and Linguistics, Oxford: Pergamon Press, pp.2195–2205.
郑定欧 (= Zheng Ding Ou) (org.). 2012. 词汇-语法五十年(1960-2010). Lexicon-grammar: 50 years. Lexique-Grammaire: 50 ans. Beijing/Guangzhou/Shanghai/Xi'an: World Publishing Corporation, 278 pages. (Introduction in Chinese).
This page is based on this Wikipedia article Text is available under the CC BY-SA 4.0 license; additional terms may apply. Images, videos and audio are available under their respective licenses.