ForestGOMP: an efficient OpenMP environment for NUMA architectures
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | BROQUEDIS, François | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | FURMENTO, Nathalie | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | GOGLIN, Brice | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | WACRENIER, Pierre-André | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | NAMYST, Raymond | |
dc.date.accessioned | 2024-04-15T09:49:21Z | |
dc.date.available | 2024-04-15T09:49:21Z | |
dc.date.issued | 2010 | |
dc.identifier.issn | 0885-7458 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198248 | |
dc.description.abstractEn | Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture so as to avoid remote memory access penalties. Directive-based programming languages such as OpenMP, can greatly help to perform such a distribution by providing programmers with an easy way to structure the parallelism of their application and to transmit this information to the runtime system. Our runtime, which is based on a multi-level thread scheduler combined with a NUMA-aware memory manager, converts this information into Scheduling Hints related to thread-memory affinity issues. These hints enable dynamic load distribution guided by application structure and hardware topology, thus helping to achieve performance portability. Several experiments show that mixed solutions (migrating both threads and data) outperform work-stealing based balancing strategies and Next-Touch-based data distribution policies. These techniques provide insights about additional optimizations. | |
dc.language.iso | en | |
dc.publisher | Springer Verlag | |
dc.title.en | ForestGOMP: an efficient OpenMP environment for NUMA architectures | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1007/s10766-010-0136-3 | |
dc.subject.hal | Informatique [cs]/Système d'exploitation [cs.OS] | |
bordeaux.journal | International Journal of Parallel Programming | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.peerReviewed | oui | |
hal.identifier | inria-00496295 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//inria-00496295v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=International%20Journal%20of%20Parallel%20Programming&rft.date=2010&rft.eissn=0885-7458&rft.issn=0885-7458&rft.au=BROQUEDIS,%20Fran%C3%A7ois&FURMENTO,%20Nathalie&GOGLIN,%20Brice&WACRENIER,%20Pierre-Andr%C3%A9&NAMYST,%20Raymond&rft.genre=article |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |