Sparse direct solvers with accelerators over DAG runtimes
hal.structure.identifier | Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS] | |
dc.contributor.author | LACOSTE, Xavier | |
hal.structure.identifier | Parallel tools for Numerical Algorithms and Resolution of essentially Hyperbolic problems [BACCHUS] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | RAMET, Pierre | |
hal.structure.identifier | Innovative Computing Laboratory [Knoxville] [ICL] | |
dc.contributor.author | FAVERGE, Mathieu | |
hal.structure.identifier | Innovative Computing Laboratory [Knoxville] [ICL] | |
dc.contributor.author | ICHITARO, Yamazaki | |
hal.structure.identifier | Innovative Computing Laboratory [Knoxville] [ICL] | |
dc.contributor.author | DONGARRA, Jack | |
dc.date.accessioned | 2024-04-15T09:45:22Z | |
dc.date.available | 2024-04-15T09:45:22Z | |
dc.date.issued | 2012 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/197914 | |
dc.description.abstractEn | The current trend in the high performance computing shows a dramatic increase in the number of cores on the shared memory compute nodes. Algorithms, especially those related to linear algebra, need to be adapted to these new computer architectures in order to be efficient. PASTIX is a sparse parallel direct solver, that incorporates a dynamic scheduler for strongly hierarchical modern architectures. In this paper, we study the replacement of this internal highly integrated scheduling strategy by two generic runtime frameworks: DAGUE and STARPU. Those runtimes will give the opportunity to execute the factorization tasks graph on emerging computers equipped with accelerators. As for previous work done in dense linear algebra, we present the kernels used for GPU computations inspired by the MAGMA library and the DAG algorithm used with those two runtimes. A comparative study of the performances of the supernodal solver with the three different schedulers is performed on manycore architectures and the improvements obtained with accelerators are presented with the STARPU runtime. These results demonstrate that these DAG runtimes provide uniform programming interfaces to obtain high performance on different architectures on irregular problems as sparse direct factorizations. | |
dc.language.iso | en | |
dc.title.en | Sparse direct solvers with accelerators over DAG runtimes | |
dc.type | Rapport | |
dc.subject.hal | Informatique [cs]/Analyse numérique [cs.NA] | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.page | 11 | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.type.institution | INRIA | |
bordeaux.type.report | rr | |
hal.identifier | hal-00700066 | |
hal.version | 1 | |
hal.audience | Non spécifiée | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00700066v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2012&rft.spage=11&rft.epage=11&rft.au=LACOSTE,%20Xavier&RAMET,%20Pierre&FAVERGE,%20Mathieu&ICHITARO,%20Yamazaki&DONGARRA,%20Jack&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |