ALUOME, Christelle; AUBERT, Gregoire; ALVES CARVALHO, Susete; LE PASLIER, Marie-Christine; BURSTIN, Judith; BRUNEL, Dominique

doi:10.1186/s13104-016-1903-z

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Métadonnées

Afficher la notice complète

Licence d’utilisation du document

ALUOME, Christelle
Biodiversité, Gènes & Communautés [BioGeCo]

AUBERT, Gregoire
Agroécologie [Dijon]

ALVES CARVALHO, Susete
Agroécologie [Dijon]

Langue

Article de revue

Ce document a été publié dans

BMC Research Notes. 2016, vol. 9, n° 1, p. 1-9

BioMed Central

Résumé en anglais

The continuing increase in size and quality of the “short reads” raw data is a significant help for the quality of the assembly obtained through various bioinformatics tools. However, building a reference genome sequence for most plant species remains a significant challenge due to the large number of repeated sequences which are problematic for a whole-genome quality de novo assembly. Furthermore, for most SNP identification approaches in plant genetics and breeding, only the “Gene-space” regions including the promoter, exon and intron sequences are considered. Results: We developed the iPea protocol to produce a de novo Gene-space assembly by reconstructing, in an iterative way, the non-coding sequence flanking the Unigene cDNA sequence through addition of next-generation DNA-seq data. The approach was elaborated with the large diploid genome of pea (<em>Pisum sativum</em> L.), rich in repetitive sequences. The final Gene-space assembly included 35,400 contigs (97 Mb), covering 88 % of the 40,227 contigs (53.1 Mb) of the PsCam_low-copy Unigen set. Its accuracy was validated by the results of the built GenoPea 13.2 K SNP Array. Conclusion: The iPEA protocol allows the reconstruction of a Gene-space based from RNA-Seq and DNA-seq data with limited computing resources.< Réduire

Mots clés en anglais

gene-space

unigene

next-generation sequencing NGS

assembly

lterative process

limited computing resources

Métadonnées

Licence d’utilisation du document

De novo construction of a “Gene-space” for diploid plant genome rich in repetitive sequences by an iterative Process of Extraction and Assembly of NGS reads (iPEA protocol) with limited computing resources

Langue

Ce document a été publié dans

Résumé en anglais

Mots clés en anglais

DOI

Origine

Unités de recherche