Lexicon-grammar
Lexicon-Grammar is a method and a praxis of formalized description of human languages, [1][2] which considers that the systematic investigation of lexical entries is presently the main challenge of the scientific study of languages. [3] The development of Lexicon-Grammar began in the late 1960s under Maurice Gross.
Its theoretical basis is Zellig S. Harris's[4] [5] distributionalism, [3] and notably the notion of transformational rule. [6] The notational conventions are meant to be as clear and comprehensible as possible. [7]
The method of Lexicon-Grammar is inspired from hard sciences. [8] It focuses on data collection, hence on the real use of language, both from a quantitative and observational point of view.
Lexicon-grammar also requires formalisation. [9] The results of the description must be sufficiently formal to be usable for natural language processing, in particular through the development of parsers. [10] [11] [12] [13] [14] The information model is such that the results of the description take the form of two-dimensional tables, also called matrices. [6] Lexicon-grammar tables cross-tabulate lexical entries with their syntactic-semantic properties. [3] [15] As a result, they make up a database of syntactic-semantic information. [16]
Experiments showed that several researchers or teams can make their observations cumulative. [17]
The term lexicon-grammar is used for the first time by Annibale Elia. [18]
Theoretical basis
[edit | edit source]The theoretical basis of Lexicon-grammar is Zellig Harris' distributionalism,[4] and in particular the notion of transformation in the sense of Zellig Harris. [6][19][20] Maurice Gross was a student of Zellig Harris. The conventions for the presentation of grammatical information are intended to be as simple and transparent as possible. [7] This concern comes from Zellig Harris, whose theory is oriented towards directly observable surface forms; this differs from Generative Grammar, which normally uses abstract structures such as deep structures.
Fact collection
[edit | edit source]The Lexicon-Grammar method is inspired by experimental science. [8] It emphasizes the collection of facts, confronting the researcher with the reality of language use, from a quantitative and an observational point of view.[21]
Quantitatively: a lexicon-grammar includes a program of systematic description of the lexicon, including observing for each lexical entry in which syntactic constructions it occurs. [3][22] This involves large-scale work, which can be carried out by teams and not by individual specialists. The exclusive search for general rules of syntax, independent of the lexical material they handle, is dismissed as a dead end.[23] This is different from Generative Grammar, which values the notion of generalization.
Observationally: methodological precautions are applied to ensure good reproducibility of observations, and in particular to guard against the risks associated with constructed examples.[24] One of these precautions is to take as a minimum unit of meaning the basic sentence.[25] [26] [15] Indeed, a word acquires a precise meaning only in a context; moreover, by inserting a word in a sentence, one has the advantage of manipulating a sequence that may be judged acceptable or unacceptable. It is at this price that syntactic-semantic properties are considered as defined with sufficient precision that it makes sense to test and check them against the whole lexicon.[3] [6] [27] These precautions have evolved in line with needs and the appearance of new technical means. Thus, from the beginning of the 1990s, the contributors of the Lexicon-Grammar have been able to use attested examples in text corpora more and more easily.[28][29] This new precaution has simply been added to the previous ones, positioning the Lexicon-Grammar simultaneously in introspective linguistics [30] and in corpus linguistics, much as advocated by Fillmore.[31] The American projects FrameNet[32] and VerbNet[33] show a relative convergence towards objectives close to those of Lexicon-Grammar.
Formalisation
[edit | edit source]Lexicon-grammar also requires formalisation.[9] The results of the description must be sufficiently formal to allow for:
- verification by comparison with the reality of language use;
- application to automatic language processing, and more particularly to deep linguistic processing, in particular through the development of parsers by computer scientists.[10] [11] [12] [13] [14]
Results
[edit | edit source]The results obtained through the application of these methodological principles by several dozen linguists over several decades constitute a database of syntactic-semantic information for natural language processing. The quality of this database can be assessed by considering:
- its size, measurable by the number of entries,
- the richness of the linguistic phenomena it covers, measurable by the number of properties,
- and its degree of formalization.
For French, more than 75,000 entries have been established;[34] more or less substantial descriptions—always following the same model—exist for about ten other languages,[6][19][27][35][28][29][36][37] the best-represented being Italian,[38] Portuguese,[39] Modern Greek,[40][41] and Korean.[42][43]
Work has been carried out and published within the Lexicon–Grammar framework on predicative nouns since the 1970s,[44][45] and on fixed expressions since the 1980s.[46][47]
The notion of a noun predicate comes from the work of Zellig Harris.[4] It is based on the following parallel: if, for example, the verb study is analyzed as the predicate in the sentence Luke studies eclipses, then it is natural to analyze the noun study—or the sequence carry out a study—as a predicate in the sentence Luke carries out a study on eclipses. In this case, the noun in question is called a noun predicate. The verb that accompanies it—here carry out—is referred to as a support verb. This idea has been systematically applied in the Lexicon–Grammar framework since the 1970s.[48][49][50][51][52][53]
Lexicon–Grammar contributors use the term 'fixed expression' when an expression (for example state of the art) has specific properties (here, its meaning) that justify giving it its own lexical entry, even though it is made up of several elements (state, of, the, and art) that can, in one way or another, be considered individual words.[46] A systematic program for describing such expressions has been undertaken within the Lexicon–Grammar framework since the 1980s.[19][54][55][56][57][58][59]
Cumulativity
[edit | edit source]These experiments have shown that several individuals or teams can arrive at identical results. This reproducibility ensures the cumulativity of the descriptions. This outcome is crucial for the future of language processing: the amount of data that must be gathered and represented within a coherent model is such that many research and development teams have to collaborate, and it must be possible to merge their results without having to rewrite substantial parts of the grammar and lexicon of each language. This requirement is far from easy to meet, as there are few known examples of grammars of significant size that are not the work of a single specialist.[17][60][61]
Interface with international standards
[edit | edit source]With the goal of improving the availability of the data in a clear and explicit way, part of the French Lexicon–Grammar has been transcoded into the ISO-standard LMF format.[62]
References
[edit | edit source]- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page vi.
- ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Pages I-II.
- ^ a b c Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b c d e Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Pages I-V.
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 22.
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 76.
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 160.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 379.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Nous avons entrepris un projet de lexique-grammaire de la langue italienne (We have undertaken a project of a lexicon-grammar of the Italian language.)
- ^ a b c Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Pages 79-81.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). We were forced to conclude that we could obtain no generalization without a reasonably complete study of the lexical items of the language and their syntactic uses. (...) Accumulating data (...) in all natural sciences (...) is a fundamental activity, a necessary condition for evaluating the generality of phenomena.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Pages 384-385.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Page 182.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Pages 385-386.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). (Proceedings from the 1991 Nobel Symposium on Corpus Linguistics, Stockholm.)
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value). Footnote 6, page 865: The solution of Zellig Harris [to the analysis of] NPs like the solution of the equation by our teacher (...) directly relates the two sentences: Our teacher solved the equation. / Our teacher effected the solution of the equation. (...) The passive form enters by relativization into the NP: the solution of the equation that was effected by our teacher. By deletion of that was effected, we obtain the NP under analysis.
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
- ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
See also
[edit | edit source]External links
[edit | edit source]- Lexicon-Grammar tables (click on Language resources - Lexicon-Grammar - Presentation or Download).
- A selection of bibliography on lexicon-grammars.
Selected bibliography
[edit | edit source]- Boons, Jean-Paul; Alain Guillet; Christian Leclère. 1976. La structure des phrases simples en français. Constructions intransitives, Genève : Droz. (in French)
- Guillet, Alain; Christian Leclère. 1992. La structure des phrases simples en français. Constructions transitives locatives, Genève : Droz. (in French)
- Gross, Maurice. 1994. "The lexicon grammar of a language: Application to French", in Asher, R.E. (ed.), Encyclopedia of Language and Linguistics, Oxford: Pergamon Press, pp. 2195–2205.
- 郑定欧 (= Zheng Ding Ou) (org.). 2012. 词汇-语法五十年(1960-2010). Lexicon-grammar: 50 years. Lexique-Grammaire : 50 ans. Beijing/Guangzhou/Shanghai/Xi'an : World Publishing Corporation, 278 pages. (Introduction in Chinese).