Afficher la notice abrégée

hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
hal.structure.identifierfrom patterns to models in computational biodiversity and biotechnology [PLEIADE]
dc.contributor.authorFRIGERIO, Jean-Marc
hal.structure.identifierCentre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.authorRIMET, Frédéric
hal.structure.identifierCentre Alpin de Recherche sur les Réseaux Trophiques et Ecosystèmes Limniques [CARRTEL]
dc.contributor.authorBOUCHEZ, Agnes
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
dc.contributor.authorCHANCEREL, Emilie
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
hal.structure.identifierfrom patterns to models in computational biodiversity and biotechnology [PLEIADE]
dc.contributor.authorCHAUMEIL, Philippe
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
hal.structure.identifierfrom patterns to models in computational biodiversity and biotechnology [PLEIADE]
dc.contributor.authorSALIN, Franck
hal.structure.identifierInstitut du développement et des ressources en informatique scientifique [IDRIS]
dc.contributor.authorTHÉROND, Sylvie
hal.structure.identifierSwedish University of Agricultural Sciences = Sveriges lantbruksuniversitet [SLU]
dc.contributor.authorKAHLERT, Maria
hal.structure.identifierBiodiversité, Gènes & Communautés [BioGeCo]
hal.structure.identifierfrom patterns to models in computational biodiversity and biotechnology [PLEIADE]
dc.contributor.authorFRANC, Alain
dc.date.created2016-11-28
dc.description.abstractEnMetabarcoding on amplicons is rapidly expanding as a method to produce molecular based inventories of microbial communities. Here, we work on freshwater diatoms, which are microalgae possibly inventoried both on a morphological and a molecular basis. We have developed an algorithm, in a program called diagno-syst, based a the notion of informative read, which carries out supervised clustering of reads by mapping them exactly one by one on all reads of a well curated and taxonomically annotated reference database. This program has been run on a HPC (and HTC) infrastructure to address computation load. We compare optical and molecular based inventories on 10 samples from Léman lake, and 30 from Swedish rivers. We track all possibilities of mismatches between both approaches, and compare the results with standard pipelines (with heuristics) like Mothur. We find that the comparison with optics is more accurate when using exact calculations, at the price of a heavier computation load. It is crucial when studying the long tail of biodiversity, which may be overestimated by pipelines or algorithms using heuristics instead (more false positive). This work supports the analysis that these methods will benefit from progress in, first, building an agreement between molecular based and morphological based systematics and, second, having as complete as possible publicly available reference databases.
dc.language.isoen
dc.title.endiagno-syst: a tool for accurate inventories in metabarcoding
dc.typeDocument de travail - Pré-publication
dc.subject.halInformatique [cs]/Bio-informatique [q-bio.QM]
dc.identifier.arxiv1611.09410
hal.identifierhal-01426764
hal.version1
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-01426764v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.au=FRIGERIO,%20Jean-Marc&RIMET,%20Fr%C3%A9d%C3%A9ric&BOUCHEZ,%20Agnes&CHANCEREL,%20Emilie&CHAUMEIL,%20Philippe&rft.genre=preprint


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée