Automatic Mapping of Stream Programs on Multicore Architectures
hal.structure.identifier | Laboratoire d'Intégration des Systèmes et des Technologies [LIST (CEA)] | |
dc.contributor.author | DE OLIVEIRA CASTRO, Pablo | |
hal.structure.identifier | Laboratoire d'Intégration des Systèmes et des Technologies [LIST (CEA)] | |
dc.contributor.author | LOUISE, Stéphane | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | BARTHOU, Denis | |
dc.date.accessioned | 2024-04-15T09:48:01Z | |
dc.date.available | 2024-04-15T09:48:01Z | |
dc.date.conference | 2010-07-07 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198143 | |
dc.description.abstractEn | Stream languages explicitly describe fork-join and pipeline parallelism, o ering a powerful programming model for general multi- core systems. This parallelism description can be exploited on hybrid architectures, eg. composed of Graphics Processing Units (GPUs) and general purpose multicore processors. In this paper, we present a novel approach to optimize stream programs for hybrid architectures composed of GPU and multicore CPUs. The ap- proach focuses on memory and communication performance bottlenecks for this kind of architecture. The initial task graph of the stream program is rst transformed so as to reduce fork-join synchronization costs. The transformation is obtained through the application of a sequence of some optimizing elementary stream restructurations enabling communication e cient mappings. Then tasks are scheduled in a software pipeline and coarsened with a coarsening level adapted to their placement (CPU of GPU). Our experiments show the importance of both the synchroniza- tion cost reduction and of the coarsening step on performance, adapting the grain of parallelism to the CPUs and to the GPU. | |
dc.language.iso | en | |
dc.title.en | Automatic Mapping of Stream Programs on Multicore Architectures | |
dc.type | Communication dans un congrès | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | International Workshop on Compilers for Parallel Computers | |
bordeaux.country | AT | |
bordeaux.conference.city | Vienna | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-00551680 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | non | |
hal.conference.end | 2010-07-09 | |
hal.popular | non | |
hal.audience | Non spécifiée | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00551680v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.au=DE%20OLIVEIRA%20CASTRO,%20Pablo&LOUISE,%20St%C3%A9phane&BARTHOU,%20Denis&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |