Morphology based automatic acquisition of large-coverage lexica
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Linguistic signs, grammar and meaning: computational logic for natural language [SIGNES] | |
dc.contributor.author | CLÉMENT, Lionel | |
hal.structure.identifier | Software tools for natural language [ATOLL] | |
dc.contributor.author | LANG, Bernard | |
hal.structure.identifier | Software tools for natural language [ATOLL] | |
dc.contributor.author | SAGOT, Benoît | |
dc.date.accessioned | 2024-04-15T09:50:21Z | |
dc.date.available | 2024-04-15T09:50:21Z | |
dc.date.created | 2004 | |
dc.date.issued | 2004 | |
dc.date.conference | 2004 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198333 | |
dc.description.abstractEn | In this article, we introduce a new technique for constructing wide-coverage morphological lexica from large corpora and morphological knowledge, with an application to French. Basically, it relies on the idea that the existence of a hypothetical lemma can be guessed if several different words found in the corpus are best interpreted as morphological variants of this lemma. We first validated our technique by extracting verbs and adjectives on a general French corpus of 25 million words. Compared with other lexical resources available for French, our results are very satisfying, since we cover many words, often derived words, that are not always present in other lexica. Application of our algorithm to the acquisition of domain-specific adjectives on a botanic corpus gave also very good results, thus demonstrating its usability to extract domain-specific lexica. Moreover, it is generalizable to any language with a substantial morphology. | |
dc.language.iso | en | |
dc.source.title | LREC 04 | |
dc.title.en | Morphology based automatic acquisition of large-coverage lexica | |
dc.type | Communication dans un congrès | |
dc.subject.hal | Informatique [cs]/Autre [cs.OH] | |
bordeaux.page | 1841-1844 | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | LREC 04 | |
bordeaux.country | PT | |
bordeaux.title.proceeding | LREC 04 | |
bordeaux.conference.city | Lisbonne | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-00413189 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | oui | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00413189v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.btitle=LREC%2004&rft.date=2004&rft.spage=1841-1844&rft.epage=1841-1844&rft.au=CL%C3%89MENT,%20Lionel&LANG,%20Bernard&SAGOT,%20Beno%C3%AEt&rft.genre=unknown |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |