JAEGER, Julien; BARTHOU, Denis

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Afficher la notice abrégée

hal.structure.identifier	Parallélisme, Réseaux, Systèmes, Modélisation [PRISM]
dc.contributor.author	JAEGER, Julien
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	BARTHOU, Denis
dc.date.accessioned	2024-04-15T09:43:47Z
dc.date.available	2024-04-15T09:43:47Z
dc.date.issued	2012-12-18
dc.date.conference	2012-12-18
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/197790
dc.description.abstractEn	Stencil based computation on structured grids is a kernel at the heart of a large number of scientific applications. The variety of stencil kernels used in practice make this computation pattern difficult to assemble into a high performance computing library. With the multiplication of cores on a single chip, answering architectural alignment requirements became an even more important key to high performance. In addition to vector accesses, data layout optimization must also consider concurrent parallel accesses. In this paper, we develop a strategy to automatically generate stencil codes for multicore vector architectures, searching for the best data layout possible to answer architectural alignment problems. We introduce a new method for aligning multidimensional data structures, called multipadding, that can be adapted to specificities of multicores and GPUs architectures. We present multiple methods with different level of complexity. We show on different stencil patterns that generated codes with multipadding display better performances than existing optimizations.
dc.description.sponsorship	Vers le Petaflop pour LQCD - ANR-08-COSI-0010
dc.language.iso	en
dc.source.title	IEEE Proceedings of High Performance Computing conference
dc.title.en	Automatic efficient data layout for multithreaded stencil codes on CPUs and GPUs
dc.type	Communication dans un congrès
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.page	1-10
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.conference.title	High Performance Computing conference
bordeaux.country	IN
bordeaux.title.proceeding	IEEE Proceedings of High Performance Computing conference
bordeaux.peerReviewed	oui
hal.identifier	hal-00793201
hal.version	1
hal.invited	non
hal.proceedings	oui
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00793201v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.btitle=IEEE%20Proceedings%20of%20High%20Performance%20Computing%20conference&rft.date=2012-12-18&rft.spage=1-10&rft.epage=1-10&rft.au=JAEGER,%20Julien&BARTHOU,%20Denis&rft.genre=unknown

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Afficher la notice abrégée

Automatic efficient data layout for multithreaded stencil codes on CPUs and GPUs

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)