PARCOACH: Combining static and dynamic validation of MPI collective communications
hal.structure.identifier | DAM Île-de-France [DAM/DIF] | |
dc.contributor.author | SAILLARD, Emmanuelle | |
hal.structure.identifier | DAM Île-de-France [DAM/DIF] | |
dc.contributor.author | CARRIBAULT, Patrick | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
dc.contributor.author | BARTHOU, Denis | |
dc.date.accessioned | 2024-04-15T09:57:58Z | |
dc.date.available | 2024-04-15T09:57:58Z | |
dc.date.issued | 2014 | |
dc.identifier.issn | 1094-3420 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/198959 | |
dc.description.abstractEn | Nowadays most scientific applications are parallelized based on MPI communications. Collective MPI communications have to be executed in the same order by all processes in their communicator and the same number of times, otherwise it is not conforming to the standard and a deadlock or other undefined behavior can occur. As soon as the control-flow involving these collective operations becomes more complex, in particular including conditionals on process ranks, ensuring the correction of such code is error-prone. We propose in this paper a static analysis to detect when such situation occurs, combined with a code transformation that prevents from deadlocking. We focus on blocking MPI collective operations in SPMD applications, assuming MPI calls are not nested in multithreaded regions. We show on several benchmarks the small impact on performance and the ease of integration of our techniques in the development process. | |
dc.language.iso | en | |
dc.publisher | SAGE Publications | |
dc.subject.en | MPI | |
dc.subject.en | debugging | |
dc.subject.en | collective | |
dc.subject.en | static analysis | |
dc.subject.en | correctness | |
dc.title.en | PARCOACH: Combining static and dynamic validation of MPI collective communications | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1177/1094342014552204 | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.journal | International Journal of High Performance Computing Applications | |
bordeaux.page | 10.1177/1094342014552204 | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-01078762 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-01078762v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=International%20Journal%20of%20High%20Performance%20Computing%20Applications&rft.date=2014&rft.spage=10.1177/1094342014552204&rft.epage=10.1177/1094342014552204&rft.eissn=1094-3420&rft.issn=1094-3420&rft.au=SAILLARD,%20Emmanuelle&CARRIBAULT,%20Patrick&BARTHOU,%20Denis&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |