LACOSTE, Xavier; FAVERGE, Mathieu; RAMET, Pierre; THIBAULT, Samuel; BOSILCA, George

doi:10.1109/IPDPSW.2014.9

The system will be going down for regular maintenance. Please save your work and logout.

hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
dc.contributor.author	LACOSTE, Xavier
hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	FAVERGE, Mathieu
hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	RAMET, Pierre
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	THIBAULT, Samuel
hal.structure.identifier	Innovative Computing Laboratory [Knoxville] [ICL]
dc.contributor.author	BOSILCA, George
dc.date.accessioned	2024-04-15T09:41:29Z
dc.date.available	2024-04-15T09:41:29Z
dc.date.created	2014-01-06
dc.date.issued	2014-05-19
dc.date.conference	2014-05-19
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/197601
dc.description.abstract	Les architectures de calcul intègrent de plus en plus de coeurs de calcul partageant une même mémoire nécessairement hiérarchique. Les algorithmes, en particulier ceux relatifs à l'algèbre linéaire, nécessitent d'être adaptés à ces nouvelles architectures pour être efficaces. PaStIX est un solveur direct parallèle pour matrices creuses qui intègre un ordonnanceur dynamique pour des architectures hiérarchiques de grande taille. Dans ce papier, nous étudions la possibilité de remplacer cette stratégie interne d'ordonnancement par deux supports d'exécution génériques~: PaRSEC et StarPU. Ces supports d'exécution offrent la possibilité de dérouler le graphe de tâches de la factorisation numérique sur des noeuds de calcul disposant d'accélérateurs. Nous présentons une étude comparative des performances de notre solveur supernodal avec ces trois ordonnanceurs sur des architectures multicoeurs, et en particulier les gains obtenus avec plusieurs accélérateurs GPU. Ces résultats montrent qu'une approche basée sur un \DAG{} offre une interface de programmation uniforme pour réaliser du calcul haute performance sur des problèmes irréguliers comme ceux de l'algèbre linéaire creuse.
dc.description.abstractEn	The ongoing hardware evolution exhibits an escalation in the number, as well as in the heterogeneity, of the computing resources. The pressure to maintain reasonable levels of performance and portability, forces the application developers to leave the traditional programming paradigms and explore alternative solutions. PaStiX is a parallel sparse direct solver, based on a dynamic scheduler for modern hierarchical architectures. In this paper, we study the replacement of the highly specialized internal scheduler in PaStiX by two generic runtime frameworks: PaRSEC and StarPU. The tasks graph of the factorization step is made available to the two runtimes, providing them with the opportunity to optimize it in order to maximize the algorithm efficiency for a predefined execution environment. A comparative study of the performance of the PaStiX solver with the three schedulers - native PaStiX, StarPU and PaRSEC schedulers - on different execution contexts is performed. The analysis highlights the similarities from a performance point of view between the different execution supports. These results demonstrate that these generic DAG-based runtimes provide a uniform and portable programming interface across heterogeneous environments, and are, therefore, a sustainable solution for hybrid environments.
dc.description.sponsorship	Solveurs pour architectures hétérogènes utilisant des supports d'exécution - ANR-13-MONU-0007
dc.language.iso	en
dc.publisher	IEEE
dc.subject.en	multicore
dc.subject.en	GPU
dc.subject.en	DAG based runtime
dc.subject.en	Sparse linear solver
dc.title.en	Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes
dc.type	Communication dans un congrès
dc.identifier.doi	10.1109/IPDPSW.2014.9
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
dc.identifier.arxiv	1405.2636
bordeaux.page	29-38
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.conference.title	HCW'2014 workshop of IPDPS
bordeaux.country	US
bordeaux.conference.city	Phoenix
bordeaux.peerReviewed	oui
hal.identifier	hal-00987094
hal.version	1
hal.invited	non
hal.proceedings	oui
hal.conference.end	2014-05-23
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00987094v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2014-05-19&rft.spage=29-38&rft.epage=29-38&rft.au=LACOSTE,%20Xavier&FAVERGE,%20Mathieu&RAMET,%20Pierre&THIBAULT,%20Samuel&BOSILCA,%20George&rft.genre=unknown

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Show simple item record

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes

Files in this item

This item appears in the following Collection(s)