An Efficient OpenMP Runtime System for Hierarchical Architectures
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | THIBAULT, Samuel | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | BROQUEDIS, François | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | GOGLIN, Brice | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | NAMYST, Raymond | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | WACRENIER, Pierre-André | |
dc.date.accessioned | 2024-04-15T09:56:40Z | |
dc.date.available | 2024-04-15T09:56:40Z | |
dc.date.issued | 2007 | |
dc.date.conference | 2007-06-04 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198855 | |
dc.description.abstractEn | Exploiting the full computational power of always deeper hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. The emergence of multi-core chips and NUMA machines makes it important to minimize the number of remote memory accesses, to favor cache affinities, and to guarantee fast completion of synchronization steps. By using the BubbleSched platform as a threading backend for the GOMP OpenMP compiler, we are able to easily transpose affinities of thread teams into scheduling hints using abstractions called bubbles. We then propose a scheduling strategy suited to nested OpenMP parallelism. The resulting preliminary performance evaluations show an important improvement of the speedup on a typical NAS OpenMP benchmark application. | |
dc.language.iso | en | |
dc.subject.en | SMP | |
dc.subject.en | OpenMP | |
dc.subject.en | Nested Parallelism | |
dc.subject.en | Hierarchical Thread Scheduling | |
dc.subject.en | Bubbles | |
dc.subject.en | Multi-Core | |
dc.subject.en | NUMA | |
dc.subject.en | SMP. | |
dc.title.en | An Efficient OpenMP Runtime System for Hierarchical Architectures | |
dc.type | Communication dans un congrès | |
dc.identifier.doi | 10.1007/978-3-540-69303-1_19 | |
dc.subject.hal | Informatique [cs]/Architectures Matérielles [cs.AR] | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
dc.identifier.arxiv | 0706.2073 | |
bordeaux.page | 148--159 | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | International Workshop on OpenMP (IWOMP) | |
bordeaux.country | CN | |
bordeaux.conference.city | Beijing | |
bordeaux.peerReviewed | oui | |
hal.identifier | inria-00154502 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | oui | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//inria-00154502v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2007&rft.spage=148--159&rft.epage=148--159&rft.au=THIBAULT,%20Samuel&BROQUEDIS,%20Fran%C3%A7ois&GOGLIN,%20Brice&NAMYST,%20Raymond&WACRENIER,%20Pierre-Andr%C3%A9&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |