Sparse direct solvers with accelerators over DAG runtimes
LACOSTE, Xavier
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
RAMET, Pierre
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
See more >
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
LACOSTE, Xavier
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
RAMET, Pierre
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
< Reduce
Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Language
en
Rapport
This item was published in
2012p. 11
English Abstract
The current trend in the high performance computing shows a dramatic increase in the number of cores on the shared memory compute nodes. Algorithms, especially those related to linear algebra, need to be adapted to these ...Read more >
The current trend in the high performance computing shows a dramatic increase in the number of cores on the shared memory compute nodes. Algorithms, especially those related to linear algebra, need to be adapted to these new computer architectures in order to be efficient. PASTIX is a sparse parallel direct solver, that incorporates a dynamic scheduler for strongly hierarchical modern architectures. In this paper, we study the replacement of this internal highly integrated scheduling strategy by two generic runtime frameworks: DAGUE and STARPU. Those runtimes will give the opportunity to execute the factorization tasks graph on emerging computers equipped with accelerators. As for previous work done in dense linear algebra, we present the kernels used for GPU computations inspired by the MAGMA library and the DAG algorithm used with those two runtimes. A comparative study of the performances of the supernodal solver with the three different schedulers is performed on manycore architectures and the improvements obtained with accelerators are presented with the STARPU runtime. These results demonstrate that these DAG runtimes provide uniform programming interfaces to obtain high performance on different architectures on irregular problems as sparse direct factorizations.Read less <
Origin
Hal imported