JEANNOT, Emmanuel; MERCIER, Guillaume; TESSIER, François

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Afficher la notice abrégée

hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	JEANNOT, Emmanuel
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	MERCIER, Guillaume
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	TESSIER, François
dc.date.accessioned	2024-04-15T09:43:38Z
dc.date.available	2024-04-15T09:43:38Z
dc.date.issued	2013-03-22
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/197783
dc.description.abstract	Les générations actuelles de grappes de noeuds NUMA possèdent des processeurs multicoeurs ou manycore. Le programmation efficace de telles architectures est un véritable défi parce que de nombreux détails matériels doivent être pris en considération, en particulier la hiérarchie mémoire. Afin d'améliorer les performances des applications parallèles, une idée séduisante est de diminuer le coût de leurs communications en faisant correspondre leur schéma de communication à l'architecture matérielle sous-jacente. Dans ce rapport de recherche, nous détaillons l'algorithme et les techniques proposés afin d'obtenir ce résultat : d'abord, nous collectons deux informations-clefs, à savoir, le schéma de communication et les détails matériels de l'architecture-cible. Ensuite, nous calculons une permutation des numéros de rang des processus de l'application. Pour finir, ces nouveaux numéros de rang sont utilisés dans les opérations de communication en vue de diminuer les coûts de communication de l'application.
dc.description.abstractEn	Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous hardware characteristics have to be taken into account, especially the memory hierarchy. One appealing idea to improve the performance of parallel applications is to decrease their communication costs by matching the communication pattern to the underlying hardware architecture. In this report, we detail the algorithm and techniques proposed to achieve such a result: first, we gather both the communication pattern information and the hardware details. Then we compute a relevant reordering of the various process ranks of the application. Finally, those new ranks are used to reduce the communication costs of the application.
dc.language.iso	en
dc.subject.en	Parallel programming
dc.subject.en	High performance computing
dc.subject.en	Multicore processing
dc.title.en	Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques
dc.type	Rapport
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.page	32
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.type.institution	INRIA
bordeaux.type.report	rr
hal.identifier	hal-00803548
hal.version	1
hal.audience	Non spécifiée
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00803548v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2013-03-22&rft.spage=32&rft.epage=32&rft.au=JEANNOT,%20Emmanuel&MERCIER,%20Guillaume&TESSIER,%20Fran%C3%A7ois&rft.genre=unknown

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Afficher la notice abrégée

Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)