BROQUEDIS, François; FURMENTO, Nathalie; GOGLIN, Brice; NAMYST, Raymond; WACRENIER, Pierre-André

doi:10.1007/978-3-642-02303-3_7

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Métadonnées

Afficher la notice complète

Licence d’utilisation du document

BROQUEDIS, François
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]

FURMENTO, Nathalie
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]

GOGLIN, Brice
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]

Langue

Communication dans un congrès

Ce document a été publié dans

International Workshop on OpenMP (IWOMP), 2009-06-03, Dresden. 2009

Résumé en anglais

Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture so as to avoid memory access penalties. Directive-based programming languages such as OpenMP provide programmers with an easy way to structure the parallelism of their application and to transmit this information to the runtime system. Our runtime, which is based on a multi-level thread scheduler combined with a NUMA-aware memory manager, converts this information into ``scheduling hints'' to solve thread/memory affinity issues. It enables dynamic load distribution guided by application structure and hardware topology, thus helping to achieve performance portability. First experiments show that mixed solutions (migrating threads and data) outperform Next-touch-based data distribution policies and open possibilities for new optimizations.< Réduire

Métadonnées

Partager cette publication !

Licence d’utilisation du document

Dynamic Task and Data Placement over NUMA Architectures: an OpenMP Runtime Perspective

Langue

Ce document a été publié dans

Résumé en anglais

URI

DOI

Origine

Unités de recherche