Treelex : a subcategorization lexicon automatically extracted from a French Treebank
Langue
en
Communication dans un congrès avec actes
Ce document a été publié dans
ICGL, ICGL, 2008. 2008
Résumé en anglais
TreeLex is a subcategorization lexicon of French verbs, automatically extracted from a syntactically annotated corpus. The lexicon comprises 1362 verbs (12353 occurrences). We present not only a list of verbs with their ...Lire la suite >
TreeLex is a subcategorization lexicon of French verbs, automatically extracted from a syntactically annotated corpus. The lexicon comprises 1362 verbs (12353 occurrences). We present not only a list of verbs with their subcategorization frames but we also estimate the number of different verb frames available in French in general. Additionally, we estimate the average number of frames per verb. After applying various factorization techniques, we obtain 58 frames for a function-based representation (on average, 1.72 frames per verb), and 160 frames for a richer representation based on function-category information (on average, 1.91 frames per verb)< Réduire
Origine
Importé de halUnités de recherche