Mapler: Assessing assembly quality in taxonomically-rich metagenomes sequenced with HiFi reads
FRIOUX, Clémence
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
See more >
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
FRIOUX, Clémence
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
VICEDOMINI, Riccardo
Scalable, Optimized and Parallel Algorithms for Genomics [GenScale]
Institut des sciences informatiques et de leurs interactions - CNRS Sciences informatiques [INS2I-CNRS]
< Reduce
Scalable, Optimized and Parallel Algorithms for Genomics [GenScale]
Institut des sciences informatiques et de leurs interactions - CNRS Sciences informatiques [INS2I-CNRS]
Language
en
Autre communication scientifique (congrès sans actes - poster - séminaire...)
This item was published in
2025 - Journées scientifiques Agroécologie et Numérique, 2025-01-28, Dijon. 2025p. 1-1
English Abstract
Evaluating the quality of metagenome assemblies can be a challenging task, especially when no reference genome is available and when comparing samples at various taxonomic complexity and sequencing depth. A high quality ...Read more >
Evaluating the quality of metagenome assemblies can be a challenging task, especially when no reference genome is available and when comparing samples at various taxonomic complexity and sequencing depth. A high quality assembly is expected not only to produce high quality bins, but also to be representative of most of the read sequences, especially in complex samples where algorithms struggle reconstructing low-abundance genomes. Recent studies showed a great improvement in number and quality of bins obtained with highly accurate PacBio HiFi long reads. It remains however to be assessed how much of the sample these bins represent, especially in highly complex environmental samples. There is therefore a need to use and compare other evaluation methods. We designed and implemented Mapler, a metagenomic assembly and evaluation pipeline with a primary focus on evaluating the quality of HiFi-based metagenome assemblies. It incorporates state-of-the-art tools for assembly, binning, and assembly evaluation. In addition to classifying assembly bins in classical quality categories according to their marker gene content and taxonomic assignment, Mapler analyzes the alignment of reads on contigs. To do so, it calculates the ratio of mapped reads and bases, and separately analyzes mapped and unmapped reads via their k-mer frequency, read quality, and taxonomic assignment.Read less <
English Keywords
Metagenomics
MAGs
Complex ecosystems
Metagenome assembly
ANR Project
Computationel models of crop plant microbial biodiversity - ANR-22-PEAE-0011
Origin
Hal imported