pioman: a pthread-based Multithreaded Communication Engine
DENIS, Alexandre
Efficient runtime systems for parallel architectures [RUNTIME]
Topology-Aware System-Scale Data Management for High-Performance Computing [TADAAM]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
Topology-Aware System-Scale Data Management for High-Performance Computing [TADAAM]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
DENIS, Alexandre
Efficient runtime systems for parallel architectures [RUNTIME]
Topology-Aware System-Scale Data Management for High-Performance Computing [TADAAM]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
< Réduire
Efficient runtime systems for parallel architectures [RUNTIME]
Topology-Aware System-Scale Data Management for High-Performance Computing [TADAAM]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Langue
en
Communication dans un congrès
Ce document a été publié dans
Euromicro International Conference on Parallel, Distributed and Network-based Processing, 2015-03, Turku. 2015-03
Résumé en anglais
Recent cluster architectures include dozens of cores per node, with all cores sharing the network resources. To program such architectures, hybrid models mixing MPI+threads, and in particular MPI+OpenMP are gaining popularity. ...Lire la suite >
Recent cluster architectures include dozens of cores per node, with all cores sharing the network resources. To program such architectures, hybrid models mixing MPI+threads, and in particular MPI+OpenMP are gaining popularity. This imposes new requirements on communication libraries, such as the need for MPI_THREAD_MULTIPLE level of multi-threading support. Moreover, the high number of cores brings new op-portunities to parallelize communication libraries, so as to have proper background progression of communication and commu-nication/computation overlap. In this paper, we present pioman, a generic framework to be used by MPI implementations, that brings seamless asynchronous progression of communication by opportunistically using available cores. It uses system threads and thus is composable with any runtime system used for multithreading. Through various benchmarks, we demonstrate that our pioman-based MPI implementation exhibits very good properties regarding overlap, progression, and multithreading, and outperforms state-of-art MPI implementations.< Réduire
Mots clés en anglais
MPI
pioman
NewMadeleine
Origine
Importé de halUnités de recherche