Does clustering of DNA barcodes agree with botanical classification directly at high taxonomic levels? Trees in French Guiana as a case study
ABOUABDALLAH, Mohamed Anwar
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
FRANC, Alain
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
ABOUABDALLAH, Mohamed Anwar
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
FRANC, Alain
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
< Leer menos
Biodiversité, Gènes & Communautés [BioGeCo]
Pleiade, from patterns to models in computational biodiversity and biotechnology [PLEIADE]
Idioma
en
Article de revue
Este ítem está publicado en
Molecular Ecology Resources. 2022
Wiley/Blackwell
Resumen en inglés
Characterizing biodiversity is one of the main challenges for the coming decades. Most diversity has not been morphologically described, and barcoding is now complementing morphological-based taxonomy to further develop ...Leer más >
Characterizing biodiversity is one of the main challenges for the coming decades. Most diversity has not been morphologically described, and barcoding is now complementing morphological-based taxonomy to further develop inventories. Both approaches have been cross-validated at the level of species and OTUs. However, many known species are not listed in reference databases. One path to speed up inventories using barcoding is to directly identify individuals at coarser taxonomic levels. We therefore studied in barcoding of plants whether morphological-based and molecular-based approaches are in agreement at genus, family and order levels. We used Agglomerative Hierarchical Clustering (with Ward, Complete and Single Linkage) and Stochastic Block Models (SBM), with two dissimilarity measures (Smith-Waterman scores, kmers). The agreement between morphological-based and molecular-based classifications ranges in most of the cases from good to very good at taxonomic levels above species, even though it decreases when taxonomic levels increase, or when using the tetramer-based distance. Agreement is correlated with the entropy of morphological-based classification and with the ratio of the mean within- and mean between-groups dissimilarities. The Ward method globally leads to the best agreement, whereas Single Linkage can show poor behaviours. SBM provides a useful tool to test whether or not the dissimilarities are structured by the botanical groups. These results suggest that automatic clustering and group identification at taxonomic levels above species are possible in barcoding.< Leer menos
Palabras clave en inglés
barcoding
clustering
French Guianan trees
stochastic block model
taxonomy
Ward method
Proyecto ANR
CEnter of the study of Biodiversity in Amazonia - ANR-10-LABX-0025
Orígen
Importado de HalCentros de investigación