Exploiting the Cell/BE architecture with the StarPU unified runtime system
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | AUGONNET, Cédric | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | THIBAULT, Samuel | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | NAMYST, Raymond | |
hal.structure.identifier | Department of Computer Science [Amsterdam] | |
dc.contributor.author | NIJHUIS, Maik | |
dc.contributor.editor | Springer Verlag | |
dc.date.accessioned | 2024-04-15T09:51:44Z | |
dc.date.available | 2024-04-15T09:51:44Z | |
dc.date.issued | 2009 | |
dc.date.conference | 2009-07-20 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198456 | |
dc.description.abstractEn | Core specialization is currently one of the most promising ways for designing power-efficient multicore chips. However, approaching the theoretical peak performance of such heterogeneous multicore architectures with specialized accelerators, is a complex issue. While substantial effort has been devoted to efficiently offloading parts of the computation, designing an execution model that unifies all computing units is the main challenge. We therefore designed the StarPU runtime system for providing portable support for heterogeneous multicore processors to high performance applications and compiler environments. StarPU provides a high-level, unified execution model which is tightly coupled to an expressive data management library. In addition to our previous results on using multicore processors alongside with graphic processors, we show that StarPU is flexible enough to efficiently exploit the heterogeneous resources in the Cell processor. We present a scalable design supporting multiple different accelerators while minimizing the overhead on the overall system. Using experiments with classical linear algebra algorithms, we show that StarPU improves programmability and provides performance portability. | |
dc.language.iso | en | |
dc.title.en | Exploiting the Cell/BE architecture with the StarPU unified runtime system | |
dc.type | Communication dans un congrès | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | SAMOS Workshop | |
bordeaux.country | GR | |
bordeaux.conference.city | SAMOS | |
bordeaux.peerReviewed | oui | |
hal.identifier | inria-00378705 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | oui | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//inria-00378705v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2009&rft.au=AUGONNET,%20C%C3%A9dric&THIBAULT,%20Samuel&NAMYST,%20Raymond&NIJHUIS,%20Maik&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |