Mostrar el registro sencillo del ítem
A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships
hal.structure.identifier | Centre de Bioinformatique de Bordeaux [CBIB] | |
hal.structure.identifier | Biological systems and models, bioinformatics and sequences [SYMBIOSE] | |
dc.contributor.author | LEMAITRE, Claire | |
hal.structure.identifier | Centre de Bioinformatique de Bordeaux [CBIB] | |
dc.contributor.author | BARRÉ, Aurélien | |
hal.structure.identifier | Interactions hôtes-agents pathogènes [Toulouse] [IHAP] | |
hal.structure.identifier | Institut National de la Recherche Agronomique [INRA] | |
dc.contributor.author | CITTI, Christine | |
hal.structure.identifier | Laboratoire de Lyon [ANSES] | |
dc.contributor.author | TARDY, Florence | |
hal.structure.identifier | Contrôle des maladies animales exotiques et émergentes [UMR CMAEE] | |
dc.contributor.author | THIAUCOURT, François | |
hal.structure.identifier | Biologie du fruit et pathologie [BFP] | |
dc.contributor.author | SIRAND-PUGNET, Pascal | |
hal.structure.identifier | Centre de Bioinformatique de Bordeaux [CBIB] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | THÉBAULT, Patricia | |
dc.date.issued | 2011 | |
dc.identifier.issn | 1471-2105 | |
dc.description.abstractEn | Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes. | |
dc.description.sponsorship | Etude à grande échelle des génomes des mycoplasmes de ruminants : évolution et adaptation de bactéries minimales à des hôtes complexes - ANR-07-GMGE-0001 | |
dc.language.iso | en | |
dc.publisher | BioMed Central | |
dc.subject | substitution matrix | |
dc.subject | mollicutes | |
dc.subject.en | orthologous predictions | |
dc.subject.en | biochemistry and molecular biology | |
dc.subject.en | biotechnology and applied microbiology | |
dc.subject.en | mathematical and computational biology | |
dc.title.en | A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1186/1471-2105-12-457 | |
dc.subject.hal | Sciences du Vivant [q-bio]/Biochimie, Biologie Moléculaire/Génomique, Transcriptomique et Protéomique [q-bio.GN] | |
dc.subject.hal | Sciences du Vivant [q-bio]/Bio-Informatique, Biologie Systémique [q-bio.QM] | |
dc.subject.hal | Informatique [cs]/Bio-informatique [q-bio.QM] | |
bordeaux.journal | BMC Bioinformatics | |
bordeaux.page | 457 | |
bordeaux.volume | 12 | |
bordeaux.issue | 1 | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-00784414 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00784414v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=BMC%20Bioinformatics&rft.date=2011&rft.volume=12&rft.issue=1&rft.spage=457&rft.epage=457&rft.eissn=1471-2105&rft.issn=1471-2105&rft.au=LEMAITRE,%20Claire&BARR%C3%89,%20Aur%C3%A9lien&CITTI,%20Christine&TARDY,%20Florence&THIAUCOURT,%20Fran%C3%A7ois&rft.genre=article |
Archivos en el ítem
Archivos | Tamaño | Formato | Ver |
---|---|---|---|
No hay archivos asociados a este ítem. |