LACOSTE, Xavier; FAVERGE, Mathieu; RAMET, Pierre; THIBAULT, Samuel; BOSILCA, George

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Afficher la notice abrégée

hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
dc.contributor.author	LACOSTE, Xavier
hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	FAVERGE, Mathieu
hal.structure.identifier	High-End Parallel Algorithms for Challenging Numerical Simulations [HiePACS]
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
dc.contributor.author	RAMET, Pierre
hal.structure.identifier	Laboratoire Bordelais de Recherche en Informatique [LaBRI]
hal.structure.identifier	Efficient runtime systems for parallel architectures [RUNTIME]
dc.contributor.author	THIBAULT, Samuel
hal.structure.identifier	Innovative Computing Laboratory [Knoxville] [ICL]
dc.contributor.author	BOSILCA, George
dc.date.accessioned	2024-04-15T09:41:56Z
dc.date.available	2024-04-15T09:41:56Z
dc.date.created	2014-01-06
dc.date.issued	2014-01-06
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/197642
dc.description.abstract	Les architectures de calcul intègrent de plus en plus de coeurs de calcul partageant une même mémoire nécessairement hiérarchique. Les algorithmes, en particulier ceux relatifs à l'algèbre linéaire, nécessitent d'être adaptés à ces nouvelles architectures pour être efficaces. PaStIX est un solveur direct parallèle pour matrices creuses qui intègre un ordonnanceur dynamique pour des architectures hiérarchiques de grande taille. Dans ce papier, nous étudions la possibilité de remplacer cette stratégie interne d'ordonnancement par deux supports d'exécution génériques~: PaRSEC et StarPU. Ces supports d'exécution offrent la possibilité de dérouler le graphe de tâches de la factorisation numérique sur des noeuds de calcul disposant d'accélérateurs. Nous présentons une étude comparative des performances de notre solveur supernodal avec ces trois ordonnanceurs sur des architectures multicoeurs, et en particulier les gains obtenus avec plusieurs accélérateurs GPU. Ces résultats montrent qu'une approche basée sur un \DAG{} offre une interface de programmation uniforme pour réaliser du calcul haute performance sur des problèmes irréguliers comme ceux de l'algèbre linéaire creuse.
dc.description.abstractEn	The ongoing hardware evolution exhibits an escalation in the number, as well as in the heterogeneity, of the computing resources. The pressure to maintain reasonable levels of performance and portability, forces the application developers to leave the traditional programming paradigms and explore alternative solutions. PaStiX is a parallel sparse direct solver, based on a dynamic scheduler for modern hierarchical architectures. In this paper, we study the replacement of the highly specialized internal scheduler in PaStiX by two generic runtime frameworks: PaRSEC and StarPU. The tasks graph of the factorization step is made available to the two runtimes, providing them with the opportunity to optimize it in order to maximize the algorithm efficiency for a predefined execution environment. A comparative study of the performance of the PaStiX solver with the three schedulers - native PaStiX, StarPU and PaRSEC schedulers - on different execution contexts is performed. The analysis highlights the similarities from a performance point of view between the different execution supports. These results demonstrate that these generic DAG-based runtimes provide a uniform and portable programming interface across heterogeneous environments, and are, therefore, a sustainable solution for hybrid environments.
dc.language.iso	en
dc.subject.en	Sparse linear solver
dc.subject.en	DAG based runtime
dc.subject.en	multicore
dc.subject.en	GPU
dc.title.en	Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes
dc.type	Rapport
dc.subject.hal	Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC]
bordeaux.page	25
bordeaux.hal.laboratories	Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.type.institution	INRIA
bordeaux.type.report	rr
hal.identifier	hal-00925017
hal.version	1
hal.audience	Non spécifiée
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00925017v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2014-01-06&rft.spage=25&rft.epage=25&rft.au=LACOSTE,%20Xavier&FAVERGE,%20Mathieu&RAMET,%20Pierre&THIBAULT,%20Samuel&BOSILCA,%20George&rft.genre=unknown

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800

Afficher la notice abrégée

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)