Afficher la notice abrégée

hal.structure.identifierParallélisme, Réseaux, Systèmes, Modélisation [PRISM]
dc.contributor.authorJAEGER, Julien
hal.structure.identifierEfficient runtime systems for parallel architectures [RUNTIME]
hal.structure.identifierLaboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.authorBARTHOU, Denis
dc.date.accessioned2024-04-15T09:43:47Z
dc.date.available2024-04-15T09:43:47Z
dc.date.issued2012-12-18
dc.date.conference2012-12-18
dc.identifier.urihttps://oskar-bordeaux.fr/handle/20.500.12278/197790
dc.description.abstractEnStencil based computation on structured grids is a kernel at the heart of a large number of scientific applications. The variety of stencil kernels used in practice make this computation pattern difficult to assemble into a high performance computing library. With the multiplication of cores on a single chip, answering architectural alignment requirements became an even more important key to high performance. In addition to vector accesses, data layout optimization must also consider concurrent parallel accesses. In this paper, we develop a strategy to automatically generate stencil codes for multicore vector architectures, searching for the best data layout possible to answer architectural alignment problems. We introduce a new method for aligning multidimensional data structures, called multipadding, that can be adapted to specificities of multicores and GPUs architectures. We present multiple methods with different level of complexity. We show on different stencil patterns that generated codes with multipadding display better performances than existing optimizations.
dc.description.sponsorshipVers le Petaflop pour LQCD - ANR-08-COSI-0010
dc.language.isoen
dc.source.titleIEEE Proceedings of High Performance Computing conference
dc.title.enAutomatic efficient data layout for multithreaded stencil codes on CPUs and GPUs
dc.typeCommunication dans un congrès
dc.subject.halInformatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.page1-10
bordeaux.hal.laboratoriesLaboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800*
bordeaux.institutionUniversité de Bordeaux
bordeaux.institutionBordeaux INP
bordeaux.institutionCNRS
bordeaux.conference.titleHigh Performance Computing conference
bordeaux.countryIN
bordeaux.title.proceedingIEEE Proceedings of High Performance Computing conference
bordeaux.peerReviewedoui
hal.identifierhal-00793201
hal.version1
hal.invitednon
hal.proceedingsoui
hal.popularnon
hal.audienceInternationale
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-00793201v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.btitle=IEEE%20Proceedings%20of%20High%20Performance%20Computing%20conference&rft.date=2012-12-18&rft.spage=1-10&rft.epage=1-10&rft.au=JAEGER,%20Julien&BARTHOU,%20Denis&rft.genre=unknown


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée