Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing
BOKU, Taisuke
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
SATO, Mitsuhisa
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
See more >
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
BOKU, Taisuke
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
SATO, Mitsuhisa
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
KODAMA, Yuetsu
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
Graduate School of Systems and Information Engineering [Tsukuba]
Center for Computational Sciences [Tsukuba] [CCS]
NAMYST, Raymond
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
THIBAULT, Samuel
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
AUMAGE, Olivier
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
< Reduce
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
Language
en
Communication dans un congrès
This item was published in
The 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013), 2013-12-18, Vietri sul Mare. 2013-12-19
English Abstract
On the work sharing among GPUs and CPU cores on GPU equipped clusters, it is a critical issue to keep load balance among these heterogeneous computing resources. We have been developing a runtime system for this problem ...Read more >
On the work sharing among GPUs and CPU cores on GPU equipped clusters, it is a critical issue to keep load balance among these heterogeneous computing resources. We have been developing a runtime system for this problem on PGAS language named XcalableMP- dev/StarPU [1]. Through the development, we found the necessity of adaptive load balancing for GPU/CPU work sharing to achieve the best performance for various application codes. In this paper, we enhance our language system XcalableMP-dev/StarPU to add a new feature which can control the task size to be assigned to these heterogeneous resources dynamically during application execution. As a result of performance evaluation on several benchmarks, we confirmed the proposed feature correctly works and the performance with heterogeneous work sharing provides up to about 40% higher performance than GPU-only utilization even for relatively small size of problems.Read less <
Origin
Hal imported