Contribution à la construction d’ontologies et à la recherche d’information : application au domaine médical

DRAME, Khadim

The system will be going down for regular maintenance. Please save your work and logout.

dc.contributor.advisor	Salamon, Roger
dc.contributor.advisor	Diallo, Gayo
dc.contributor.advisor	Mougin, Fleur
dc.contributor.author	DRAME, Khadim
dc.contributor.other	Melançon, Guy
dc.contributor.other	Hernandez, Nathalie
dc.contributor.other	Dartigues, Jean-François
dc.date	2014-12-10
dc.identifier.uri	http://www.theses.fr/2014BORD0444/abes
dc.identifier.uri	https://tel.archives-ouvertes.fr/tel-01166042
dc.identifier.nnt	2014BORD0444
dc.description.abstract	Ce travail vise à permettre un accès efficace à des informations pertinentes malgré le volume croissant des données disponibles au format électronique. Pour cela, nous avons étudié l’apport d’une ontologie au sein d’un système de recherche d'information (RI).Nous avons tout d’abord décrit une méthodologie de construction d’ontologies. Ainsi, nous avons proposé une méthode mixte combinant des techniques de traitement automatique des langues pour extraire des connaissances à partir de textes et la réutilisation de ressources sémantiques existantes pour l’étape de conceptualisation. Nous avons par ailleurs développé une méthode d’alignement de termes français-anglais pour l’enrichissement terminologique de l’ontologie. L’application de notre méthodologie a permis de créer une ontologie bilingue de la maladie d’Alzheimer.Ensuite, nous avons élaboré des algorithmes pour supporter la RI sémantique guidée par une ontologie. Les concepts issus d’une ontologie ont été utilisés pour décrire automatiquement les documents mais aussi pour reformuler les requêtes. Nous nous sommes intéressés à : 1) l’identification de concepts représentatifs dans des corpus, 2) leur désambiguïsation, 3), leur pondération selon le modèle vectoriel, adapté aux concepts et 4) l’expansion de requêtes. Ces propositions ont permis de mettre en œuvre un portail de RI sémantique dédié à la maladie d’Alzheimer. Par ailleurs, le contenu des documents à indexer n’étant pas toujours accessible dans leur ensemble, nous avons exploité des informations incomplètes pour déterminer les concepts pertinents permettant malgré tout de décrire les documents. Pour cela, nous avons proposé deux méthodes de classification de documents issus d’un large corpus, l’une basée sur l’algorithme des k plus proches voisins et l’autre sur l’analyse sémantique explicite. Ces méthodes ont été évaluées sur de larges collections de documents biomédicaux fournies lors d’un challenge international.
dc.description.abstractEn	This work aims at providing efficient access to relevant information among the increasing volume of digital data. Towards this end, we studied the benefit from using ontology to support an information retrieval (IR) system.We first described a methodology for constructing ontologies. Thus, we proposed a mixed method which combines natural language processing techniques for extracting knowledge from text and the reuse of existing semantic resources for the conceptualization step. We have also developed a method for aligning terms in English and French in order to enrich terminologically the resulting ontology. The application of our methodology resulted in a bilingual ontology dedicated to Alzheimer’s disease.We then proposed algorithms for supporting ontology-based semantic IR. Thus, we used concepts from ontology for describing documents automatically and for query reformulation. We were particularly interested in: 1) the extraction of concepts from texts, 2) the disambiguation of terms, 3) the vectorial weighting schema adapted to concepts and 4) query expansion. These algorithms have been used to implement a semantic portal about Alzheimer’s disease. Further, because the content of documents are not always fully available, we exploited incomplete information for identifying the concepts, which are relevant for indexing the whole content of documents. Toward this end, we have proposed two classification methods: the first is based on the k nearest neighbors’ algorithm and the second on the explicit semantic analysis. The two methods have been evaluated on large standard collections of biomedical documents within an international challenge.
dc.language.iso	fr
dc.subject	Construction d’ontologie
dc.subject	Réutilisation de RTO
dc.subject	Recherche d’information
dc.subject	Indexation sémantique
dc.subject	Classification de documents biomédicaux
dc.subject	Maladie d’Alzheimer
dc.subject.en	Ontology construction
dc.subject.en	TOR reuse
dc.subject.en	Information retrieval
dc.subject.en	Semantic indexing
dc.subject.en	Biomedical document classification
dc.subject.en	Alzheimer’s disease
dc.title	Contribution à la construction d’ontologies et à la recherche d’information : application au domaine médical
dc.title.en	Contribution to ontology building and to semantic information retrieval : application to medical domain
dc.type	Thèses de doctorat
bordeaux.hal.laboratories	Université de Bordeaux. Centre de recherche en épidémiologie et biostatistique
bordeaux.type.institution	Bordeaux
bordeaux.thesis.discipline	Santé publique. Option Informatique et Santé
bordeaux.ecole.doctorale	École doctorale Sociétés, politique, santé publique (Bordeaux)
star.origin.link	https://www.theses.fr/2014BORD0444
dc.contributor.rapporteur	Lô, Moussa
dc.contributor.rapporteur	Zweigenbaum, Pierre
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.title=Contribution%20%C3%A0%20la%20construction%20d%E2%80%99ontologies%20et%20%C3%A0%20la%20recherche%20d%E2%80%99information%20:%20application%20au%20domaine%20m%C3%A9dical&rft.atitle=Contribution%20%C3%A0%20la%20construction%20d%E2%80%99ontologies%20et%20%C3%A0%20la%20recherche%20d%E2%80%99information%20:%20application%20au%20domaine%20m%C3%A9dical&rft.au=DRAME,%20Khadim&rft.genre=unknown

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Thèses de l’Université de Bordeaux

Show simple item record