Mostrar el registro sencillo del ítem

hal.structure.identifierAnalyse, ingénierie et contrôle des micro-organismes [MICROCOSME]
hal.structure.identifierLaboratoire Interdisciplinaire de Physique [Saint Martin d’Hères] [LIPhy]
hal.structure.identifierBIOP - Fluctuations, Régulation et Evolution des Systèmes Vivants [BIOP-LIPhy]
dc.contributor.authorBELCOUR, Arnaud
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorHAMON-GIRAUD, Pauline
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorMATAIGNE, Alice
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorRUIZ, Baptiste
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorCUNFF, Yann Le
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorGOT, Jeanne
hal.structure.identifierOptimisation des procédés en Agriculture, Agroalimentaire et Environnement [UR OPAALE]
dc.contributor.authorAWHANGBO, Lorraine
hal.structure.identifierOptimisation des procédés en Agriculture, Agroalimentaire et Environnement [UR OPAALE]
dc.contributor.authorLEBRETON, Megane
hal.structure.identifierPleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
dc.contributor.authorFRIOUX, Clémence
hal.structure.identifierLaboratoire de Biologie Intégrative des Modèles Marins [LBI2M]
dc.contributor.authorDITTAMI, Simon
hal.structure.identifierOptimisation des procédés en Agriculture, Agroalimentaire et Environnement [UR OPAALE]
dc.contributor.authorDABERT, P.
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorSIEGEL, Anne
hal.structure.identifierDynamics, Logics and Inference for biological Systems and Sequences [Dyliss]
dc.contributor.authorBLANQUART, Samuel
dc.date.created2025
dc.description.abstractEnPurpose: Metabarcoding, and metagenomic sequencing have enabled the characterization of highly diverse environmental communities. The challenge of estimating the metabolic functions carried out by these communities has led to the development of several state-of-the-art methods, most of which are tailored to a specific gene marker. However, the increasing diversity of approaches resulting from advances in sequencing technologies drives the need for methods capable of handling heterogeneous microbial community data. Moreover, predictions often depend on their internal analysis pipelines and are influenced by the underlying databases, which link marker genes to specific functional annotations. This limits users' ability to evaluate the quality of predictions by tracing internal data and processes. Finally, users are constrained by the specific annotations provided by these methods (e.g. EC numbers), limiting their ability to conduct further specialized analyses based on intermediate results.Methods: EsMeCaTa predicts consensus proteomes and their associated functions from taxonomic affiliations. A key feature of EsMeCaTa is its explainability and flexibility. To support the flexible integration of heterogeneous sequencing data, EsMeCaTa utilizes taxonomic affiliations obtained through analyses of diverse sequencing datasets. To provide insight into the knowledge available for each taxonomic affiliation and to interpret the relevance of predicted functions, EsMeCaTa identifies a taxonomic rank within a given affiliation that is sufficiently represented by documented proteomes in the UniProt database. The proteins of the UniProt proteomes are clustered and filtered according to a threshold to create consensus proteomes. These consensus proteomes are automatically annotated with functional information (e.g., EC numbers, GO terms) but they are also designed to be used in further customized annotation workflows. Functional annotations are reported in a functional table, which can be enriched with taxon abundances to generate comprehensive functional profiles.Results: EsMeCaTa predictions have been validated using multiple datasets and compared to a state-of-the-art method. Additionally, it was applied to a novel metabarcoding dataset from a methanogenic reactor, characterizing the microbial community and biogas production across different time points and intake condition. Our results demonstrate the link between biogas production, intake condition and the dynamics of the metabolic functions predicted by EsMeCaTa in the microbial communities.
dc.description.sponsorshipDeciphering plant-microbiome interactions to enhance crop defense to bioagressors - ANR-20-PCPA-0004
dc.description.sponsorshipLes origines microbiennes potentielles des propriétés biostimulantes des extraits d'un holobinte d'algues brunes - ANR-20-CE43-0013
dc.language.isoen
dc.rights.urihttp://creativecommons.org/licenses/by/
dc.title.enEstimating consensus proteomes and metabolic functions from taxonomic affiliations
dc.typeDocument de travail - Pré-publication
dc.identifier.doi10.1101/2022.03.16.484574
dc.subject.halInformatique [cs]/Bio-informatique [q-bio.QM]
hal.identifierhal-03697249
hal.version1
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-03697249v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.au=BELCOUR,%20Arnaud&HAMON-GIRAUD,%20Pauline&MATAIGNE,%20Alice&RUIZ,%20Baptiste&CUNFF,%20Yann%20Le&rft.genre=preprint


Archivos en el ítem

ArchivosTamañoFormatoVer

No hay archivos asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem