BROQUEDIS, François; FURMENTO, Nathalie; GOGLIN, Brice; WACRENIER, Pierre-André; NAMYST, Raymond

doi:10.1007/s10766-010-0136-3

The system will be going down for regular maintenance. Please save your work and logout.

hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	BROQUEDIS, François
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	FURMENTO, Nathalie
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	GOGLIN, Brice
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	WACRENIER, Pierre-André
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	NAMYST, Raymond
dc.date.accessioned	2024-04-15T09:49:21Z
dc.date.available	2024-04-15T09:49:21Z
dc.date.issued	2010
dc.identifier.issn	0885-7458
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/198248
dc.description.abstractEn	Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture so as to avoid remote memory access penalties. Directive-based programming languages such as OpenMP, can greatly help to perform such a distribution by providing programmers with an easy way to structure the parallelism of their application and to transmit this information to the runtime system. Our runtime, which is based on a multi-level thread scheduler combined with a NUMA-aware memory manager, converts this information into Scheduling Hints related to thread-memory affinity issues. These hints enable dynamic load distribution guided by application structure and hardware topology, thus helping to achieve performance portability. Several experiments show that mixed solutions (migrating both threads and data) outperform work-stealing based balancing strategies and Next-Touch-based data distribution policies. These techniques provide insights about additional optimizations.
dc.language.iso	en
dc.publisher	Springer Verlag
dc.title.en	ForestGOMP: an efficient OpenMP environment for NUMA architectures
dc.type	Article de revue
dc.identifier.doi	10.1007/s10766-010-0136-3
dc.subject.hal	Informatique [cs]/Système d'exploitation [cs.OS]
bordeaux.journal	International Journal of Parallel Programming
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.peerReviewed	oui
hal.identifier	inria-00496295
hal.version	1
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//inria-00496295v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=International%20Journal%20of%20Parallel%20Programming&rft.date=2010&rft.eissn=0885-7458&rft.issn=0885-7458&rft.au=BROQUEDIS,%20Fran%C3%A7ois&FURMENTO,%20Nathalie&GOGLIN,%20Brice&WACRENIER,%20Pierre-Andr%C3%A9&NAMYST,%20Raymond&rft.genre=article

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Show simple item record

ForestGOMP: an efficient OpenMP environment for NUMA architectures

Files in this item

This item appears in the following Collection(s)