THIBAULT, Samuel; BROQUEDIS, François; GOGLIN, Brice; NAMYST, Raymond; WACRENIER, Pierre-André

doi:10.1007/978-3-540-69303-1_19

hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	THIBAULT, Samuel
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	BROQUEDIS, François
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	GOGLIN, Brice
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	NAMYST, Raymond
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	WACRENIER, Pierre-André
dc.date.accessioned	2024-04-15T09:56:40Z
dc.date.available	2024-04-15T09:56:40Z
dc.date.issued	2007
dc.date.conference	2007-06-04
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/198855
dc.description.abstractEn	Exploiting the full computational power of always deeper hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. The emergence of multi-core chips and NUMA machines makes it important to minimize the number of remote memory accesses, to favor cache affinities, and to guarantee fast completion of synchronization steps. By using the BubbleSched platform as a threading backend for the GOMP OpenMP compiler, we are able to easily transpose affinities of thread teams into scheduling hints using abstractions called bubbles. We then propose a scheduling strategy suited to nested OpenMP parallelism. The resulting preliminary performance evaluations show an important improvement of the speedup on a typical NAS OpenMP benchmark application.
dc.language.iso	en
dc.subject.en	SMP
dc.subject.en	OpenMP
dc.subject.en	Nested Parallelism
dc.subject.en	Hierarchical Thread Scheduling
dc.subject.en	Bubbles
dc.subject.en	Multi-Core
dc.subject.en	NUMA
dc.subject.en	SMP.
dc.title.en	An Efficient OpenMP Runtime System for Hierarchical Architectures
dc.type	Communication dans un congrès
dc.identifier.doi	10.1007/978-3-540-69303-1_19
dc.subject.hal	Informatique [cs]/Architectures Matérielles [cs.AR]
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
dc.identifier.arxiv	0706.2073
bordeaux.page	148--159
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.conference.title	International Workshop on OpenMP (IWOMP)
bordeaux.country	CN
bordeaux.conference.city	Beijing
bordeaux.peerReviewed	oui
hal.identifier	inria-00154502
hal.version	1
hal.invited	non
hal.proceedings	oui
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//inria-00154502v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2007&rft.spage=148--159&rft.epage=148--159&rft.au=THIBAULT,%20Samuel&BROQUEDIS,%20Fran%C3%A7ois&GOGLIN,%20Brice&NAMYST,%20Raymond&WACRENIER,%20Pierre-Andr%C3%A9&rft.genre=unknown

Archivos en el ítem

Archivos	Tamaño	Formato	Ver
No hay archivos asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Mostrar el registro sencillo del ítem

An Efficient OpenMP Runtime System for Hierarchical Architectures

Archivos en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)