metaMatch: un algorithme pour l'assignation taxonomique en métagénomique

FRIGERIO, Jean-Marc; CHAUMEIL, Philippe; GAY, Pierre; KERMARREC, Lenaïg; RIMET, Frédéric; BOUCHEZ, Agnes; FRANC, Alain

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Afficher la notice abrégée

hal.structure.identifier	Biodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.author	FRIGERIO, Jean-Marc
hal.structure.identifier	Biodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.author	CHAUMEIL, Philippe
hal.structure.identifier	Mésocentre de Calcul Intensif Aquitain [MCIA]
dc.contributor.author	GAY, Pierre
dc.contributor.author	KERMARREC, Lenaïg
hal.structure.identifier	Centre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.author	RIMET, Frédéric
hal.structure.identifier	Centre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.author	BOUCHEZ, Agnes
hal.structure.identifier	Biodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.author	FRANC, Alain
dc.date.created	2000
dc.date.issued	2012
dc.date.conference	2012-10-01
dc.description.abstractEn	Community ecology faces a new challenge as the next-generation sequencing approaches can yield data from hundreds of microbial community samples. This way, combined with accurate and reliable taxonomic assessment, yields hundreds of new data that will contribute to a better understanding of community assemblies formed under various environmental and historical conditions. Algorithms classifying sequences by comparison to a reference library are the most widely used tools for assessing community composition of environmental samples. However, as they are computationally intensive, almost all these algorithms (most standard being BLAST and similar offsprings) use heuristics designed to speed up the database exploration phase, at the cost of being less strict with the quality of the match between a query and a reference. This problem is naturally distributable, as all comparisons (query, reference) are independent. Here, we present a tool enabling comparisons between queries ( say, one million reads) and reference sequences (say, several thousands), and its implementation on two infrastructures: a cluster in MCIA (Mésocentre de Calcul Intensif en Aquitaine) and a production grid EGI. We show how tracking the large number of jobs generated was nearly impossible with gLite, and how this problem could be solved using Dirac. We compare time and quality between a run on Avakas and on the grid EGI. As a perspective, we will develop a user friendly interface enabling this tool to be used routinely on the grid as a diagnostic for a user not acquainted with computing subtleties of the grid.
dc.language.iso	fr
dc.title	metaMatch: un algorithme pour l'assignation taxonomique en métagénomique
dc.type	Communication dans un congrès
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.conference.title	journées scientifiques mésocentres et France Grilles 2012
bordeaux.country	FR
bordeaux.conference.city	Paris
bordeaux.peerReviewed	oui
hal.identifier	hal-00766072
hal.version	1
hal.invited	non
hal.proceedings	non
hal.conference.end	2012-10-03
hal.popular	non
hal.audience	Non spécifiée
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00766072v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.title=metaMatch:%20un%20algorithme%20pour%20l'assignation%20taxonomique%20en%20m%C3%A9tag%C3%A9nomique&rft.atitle=metaMatch:%20un%20algorithme%20pour%20l'assignation%20taxonomique%20en%20m%C3%A9tag%C3%A9nomique&rft.date=2012&rft.au=FRIGERIO,%20Jean-Marc&CHAUMEIL,%20Philippe&GAY,%20Pierre&KERMARREC,%20Lena%C3%AFg&RIMET,%20Fr%C3%A9d%C3%A9ric&rft.genre=unknown

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

BioGeCo (Biodiversité Gènes & Communautés) - UMR 1202

Afficher la notice abrégée