Taming the complexity of 'n-ary' relations in comparative genomics
SHERMAN, David James
Models and Algorithms for the Genome [ MAGNOME]
Modèles et algorithmes pour la Bioinformatique et la Visualisation d'informations; [MABIOVIS]
Models and Algorithms for the Genome [ MAGNOME]
Modèles et algorithmes pour la Bioinformatique et la Visualisation d'informations; [MABIOVIS]
SHERMAN, David James
Models and Algorithms for the Genome [ MAGNOME]
Modèles et algorithmes pour la Bioinformatique et la Visualisation d'informations; [MABIOVIS]
< Leer menos
Models and Algorithms for the Genome [ MAGNOME]
Modèles et algorithmes pour la Bioinformatique et la Visualisation d'informations; [MABIOVIS]
Idioma
en
Communication dans un congrès
Este ítem está publicado en
9th International Conference on Genome Biology and Bioinformatics, 2013-11-07, Atlanta, Georgia. 2013-11
Resumen en inglés
Microbial genomes used in biotechnology applications are now routinely sequenced in groups rather than individually, in order to more unambiguously identify specific variations that are linked to phenotype. These 'paraphyletic' ...Leer más >
Microbial genomes used in biotechnology applications are now routinely sequenced in groups rather than individually, in order to more unambiguously identify specific variations that are linked to phenotype. These 'paraphyletic' sequencing strategies certainly result in growing volumes of sequence data, but these in turn are dominated by the n-ary relations between genomes obtained from systematic comparison, classification, and network inference. In the worse case, relations can grow geometrically while the genomes grow arithmetically. Comparative genomics is increasingly becoming a question of taming the complexity of these n-ary relations, and requires rethinking analyses in terms of new distributed computing paradigms. We will discuss a number of examples of large-scale comparative genomics in biotechnologically interesting hemiascomycete yeasts, and see how the MapReduce and NoSQL paradigms can be used to rethink representation, querying, and analysis of large groups of closely related genomes. I will further illustrate how reference-agnostic systematic comparisons can produce comprehensive views of the genomes as a group, and can drive comparative inference of metabolic models.< Leer menos
Palabras clave en italiano
comparative genomics
data-mining
NoSQL
Map-reduce
Orígen
Importado de HalCentros de investigación