Semi-artificial datasets as a resource for validation of bioinformatics pipelines for plant virus detection
hal.structure.identifier | Gembloux Agro-Bio Tech [Faculté universitaire des sciences agronomiques de Gembloux] [[FUSAGx]] | |
dc.contributor.author | TAMISIER, Lucie | |
dc.contributor.author | HAEGEMAN, Annelies | |
dc.contributor.author | FOUCART, Yoika | |
dc.contributor.author | FOUILLIEN, Nicolas | |
dc.contributor.author | AL RWAHNIH, Maher | |
dc.contributor.author | BUZKAN, Nihal | |
hal.structure.identifier | Biologie du fruit et pathologie [BFP] | |
dc.contributor.author | CANDRESSE, Thierry | |
dc.contributor.author | CHIUMENTI, Michela | |
dc.contributor.author | DE JONGHE, Kris | |
dc.contributor.author | LEFEBVRE, Marie | |
dc.contributor.author | MARGARIA, Paolo | |
dc.contributor.author | REYNARD, Jean Sébastien | |
dc.contributor.author | STEVENS, Kristian | |
dc.contributor.author | KUTNJAK, Denis | |
dc.contributor.author | MASSART, Sébastien | |
dc.date.accessioned | 2025-01-08T03:06:20Z | |
dc.date.available | 2025-01-08T03:06:20Z | |
dc.date.issued | 2021 | |
dc.identifier.issn | 2804-3871 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/204175 | |
dc.description.abstractEn | The widespread use of High-Throughput Sequencing (HTS) for detection of plant viruses and sequencing of plant virus genomes has led to the generation of large amounts of data and of bioinformatics challenges to process them. Many bioinformatics pipelines for virus detection are available, making the choice of a suitable one difficult. A robust benchmarking is needed for the unbiased comparison of the pipelines, but there is currently a lack of reference datasets that could be used for this purpose. We present 7 semi-artificial datasets composed of real RNA-seq datasets from virus-infected plants spiked with artificial virus reads. Each dataset addresses challenges that could prevent virus detection. We also present 3 real datasets showing a challenging virus composition as well as 8 completely artificial datasets to test haplotype reconstruction software. With these datasets that address several diagnostic challenges, we hope to encourage virologists, diagnosticians and bioinformaticians to evaluate and benchmark their pipeline(s). | |
dc.language.iso | en | |
dc.publisher | Peer Community In | |
dc.subject.en | HTS | |
dc.subject.en | Bioinformatics | |
dc.subject.en | Plant virus detection | |
dc.subject.en | Benchmark | |
dc.title.en | Semi-artificial datasets as a resource for validation of bioinformatics pipelines for plant virus detection | |
dc.type | Article de revue | |
dc.identifier.doi | 10.24072/pcjournal.62 | |
dc.subject.hal | Sciences du Vivant [q-bio] | |
bordeaux.journal | Peer Community Journal | |
bordeaux.page | e53 | |
bordeaux.volume | 1 | |
bordeaux.hal.laboratories | Biologie du Fruit & Pathologie (BFP) - UMR 1332 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | INRAE | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-04107125 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-04107125v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Peer%20Community%20Journal&rft.date=2021&rft.volume=1&rft.spage=e53&rft.epage=e53&rft.eissn=2804-3871&rft.issn=2804-3871&rft.au=TAMISIER,%20Lucie&HAEGEMAN,%20Annelies&FOUCART,%20Yoika&FOUILLIEN,%20Nicolas&AL%20RWAHNIH,%20Maher&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |