Query selection methods for automated corpora construction with a use case in food-drug interactions
Langue
EN
Autre communication scientifique (congrès sans actes - poster - séminaire...)
Ce document a été publié dans
ACL Workshop on Biomedical Natural Language Processing, 2019-08-01, Florence. 2019p. 115–124
Association for Computational Linguistics
Résumé en anglais
In this paper, we address the problem of automatically constructing a relevant corpus of scientific articles about food-drug interactions. There is a growing number of scientific publications that describe food-drug ...Lire la suite >
In this paper, we address the problem of automatically constructing a relevant corpus of scientific articles about food-drug interactions. There is a growing number of scientific publications that describe food-drug interactions but currently building a high-coverage corpus that can be used for information extraction purposes is not trivial. We investigate several methods for automating the query selection process using an expert-curated corpus of food-drug interactions. Our experiments show that index term features along with a decision tree classifier are the best approach for this task and that feature selection approaches and in particular gain ratio outperform frequency-based methods for query selection.< Réduire
Unités de recherche