Afficher la notice abrégée

hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.authorFRIGERIO, Jean-Marc
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.authorCHAUMEIL, Philippe
hal.structure.identifierMésocentre de Calcul Intensif Aquitain [MCIA]
dc.contributor.authorGAY, Pierre
dc.contributor.authorKERMARREC, Lenaïg
hal.structure.identifierCentre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.authorRIMET, Frédéric
hal.structure.identifierCentre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.authorBOUCHEZ, Agnes
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.authorFRANC, Alain
dc.date.created2000
dc.date.issued2012
dc.date.conference2012-10-01
dc.description.abstractEnCommunity ecology faces a new challenge as the next-generation sequencing approaches can yield data from hundreds of microbial community samples. This way, combined with accurate and reliable taxonomic assessment, yields hundreds of new data that will contribute to a better understanding of community assemblies formed under various environmental and historical conditions. Algorithms classifying sequences by comparison to a reference library are the most widely used tools for assessing community composition of environmental samples. However, as they are computationally intensive, almost all these algorithms (most standard being BLAST and similar offsprings) use heuristics designed to speed up the database exploration phase, at the cost of being less strict with the quality of the match between a query and a reference. This problem is naturally distributable, as all comparisons (query, reference) are independent. Here, we present a tool enabling comparisons between queries ( say, one million reads) and reference sequences (say, several thousands), and its implementation on two infrastructures: a cluster in MCIA (Mésocentre de Calcul Intensif en Aquitaine) and a production grid EGI. We show how tracking the large number of jobs generated was nearly impossible with gLite, and how this problem could be solved using Dirac. We compare time and quality between a run on Avakas and on the grid EGI. As a perspective, we will develop a user friendly interface enabling this tool to be used routinely on the grid as a diagnostic for a user not acquainted with computing subtleties of the grid.
dc.language.isofr
dc.titlemetaMatch: un algorithme pour l'assignation taxonomique en métagénomique
dc.typeCommunication dans un congrès
dc.subject.halInformatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.conference.titlejournées scientifiques mésocentres et France Grilles 2012
bordeaux.countryFR
bordeaux.conference.cityParis
bordeaux.peerReviewedoui
hal.identifierhal-00766072
hal.version1
hal.invitednon
hal.proceedingsnon
hal.conference.end2012-10-03
hal.popularnon
hal.audienceNon spécifiée
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-00766072v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.title=metaMatch:%20un%20algorithme%20pour%20l'assignation%20taxonomique%20en%20m%C3%A9tag%C3%A9nomique&rft.atitle=metaMatch:%20un%20algorithme%20pour%20l'assignation%20taxonomique%20en%20m%C3%A9tag%C3%A9nomique&rft.date=2012&rft.au=FRIGERIO,%20Jean-Marc&CHAUMEIL,%20Philippe&GAY,%20Pierre&KERMARREC,%20Lena%C3%AFg&RIMET,%20Fr%C3%A9d%C3%A9ric&rft.genre=unknown


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée