Show simple item record

dc.rights.licenseopenen_US
hal.structure.identifierBordeaux population health [BPH]
dc.contributor.authorAYLLÓN-BENÍTEZ, Aarón
hal.structure.identifierBordeaux population health [BPH]
dc.contributor.authorMOUGIN, Fleur
IDREF: 116242337
dc.contributor.authorALLALI, J.
hal.structure.identifierStatistics In System biology and Translational Medicine [SISTM]
hal.structure.identifierBordeaux population health [BPH]
dc.contributor.authorTHIEBAUT, Rodolphe
dc.contributor.authorTHEBAULT, P.
dc.date.accessioned2020-10-19T07:28:17Z
dc.date.available2020-10-19T07:28:17Z
dc.date.issued2018-11-27
dc.identifier.issn1932-6203 (Electronic) 1932-6203 (Linking)en_US
dc.identifier.urihttps://oskar-bordeaux.fr/handle/20.500.12278/11392
dc.description.abstractEnMOTIVATION: The recent revolution in new sequencing technologies, as a part of the continuous process of adopting new innovative protocols has strongly impacted the interpretation of relations between phenotype and genotype. Thus, understanding the resulting gene sets has become a bottleneck that needs to be addressed. Automatic methods have been proposed to facilitate the interpretation of gene sets. While statistical functional enrichment analyses are currently well known, they tend to focus on well-known genes and to ignore new information from less-studied genes. To address such issues, applying semantic similarity measures is logical if the knowledge source used to annotate the gene sets is hierarchically structured. In this work, we propose a new method for analyzing the impact of different semantic similarity measures on gene set annotations. RESULTS: We evaluated the impact of each measure by taking into consideration the two following features that correspond to relevant criteria for a "good" synthetic gene set annotation: (i) the number of annotation terms has to be drastically reduced and the representative terms must be retained while annotating the gene set, and (ii) the number of genes described by the selected terms should be as large as possible. Thus, we analyzed nine semantic similarity measures to identify the best possible compromise between both features while maintaining a sufficient level of details. Using Gene Ontology to annotate the gene sets, we obtained better results with node-based measures that use the terms' characteristics than with measures based on edges that link the terms. The annotation of the gene sets achieved with the node-based measures did not exhibit major differences regardless of the characteristics of terms used.
dc.language.isoENen_US
dc.rightsAttribution 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/*
dc.subject.enSISTM
dc.subject.enERIAS
dc.title.enA new method for evaluating the impacts of semantic similarity measures on the annotation of gene sets
dc.title.alternativePLoS Oneen_US
dc.typeArticle de revueen_US
dc.identifier.doi10.1371/journal.pone.0208037en_US
dc.subject.halSciences du Vivant [q-bio]/Santé publique et épidémiologieen_US
dc.identifier.pubmed30481204en_US
bordeaux.journalPLoS ONEen_US
bordeaux.pagee0208037en_US
bordeaux.volume13en_US
bordeaux.hal.laboratoriesBordeaux Population Health Research Center (BPH) - U1219en_US
bordeaux.issue11en_US
bordeaux.institutionUniversité de Bordeauxen_US
bordeaux.teamSISTM_BPH
bordeaux.peerReviewedouien_US
bordeaux.inpressnonen_US
hal.identifierhal-01938961
hal.version1
hal.date.transferred2021-03-06T02:39:17Z
hal.exporttrue
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=PLoS%20ONE&rft.date=2018-11-27&rft.volume=13&rft.issue=11&rft.spage=e0208037&rft.epage=e0208037&rft.eissn=1932-6203%20(Electronic)%201932-6203%20(Linking)&rft.issn=1932-6203%20(Electronic)%201932-6203%20(Linking)&rft.au=AYLLO%CC%81N-BENI%CC%81TEZ,%20Aaro%CC%81n&MOUGIN,%20Fleur&ALLALI,%20J.&THIEBAUT,%20Rodolphe&THEBAULT,%20P.&rft.genre=article


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record