Analysing the Variability of OpenMP Programs Performances on Multicore Architectures
TOUATI, Sid
Parallélisme, Réseaux, Systèmes, Modélisation [PRISM]
Architectures, Languages and Compilers to Harness the End of Moore Years [ALCHEMY]
Parallélisme, Réseaux, Systèmes, Modélisation [PRISM]
Architectures, Languages and Compilers to Harness the End of Moore Years [ALCHEMY]
BARTHOU, Denis
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
TOUATI, Sid
Parallélisme, Réseaux, Systèmes, Modélisation [PRISM]
Architectures, Languages and Compilers to Harness the End of Moore Years [ALCHEMY]
Parallélisme, Réseaux, Systèmes, Modélisation [PRISM]
Architectures, Languages and Compilers to Harness the End of Moore Years [ALCHEMY]
BARTHOU, Denis
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
< Reduce
Efficient runtime systems for parallel architectures [RUNTIME]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Language
en
Communication dans un congrès
This item was published in
Fourth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2011), 2011-01-23, Heraklion. 2011p. 14
English Abstract
In [8], we demonstrated that contrary to sequential applications, parallel OpenMP appli- cations su er from a severe instability in performances. That is, running the same parallel OpenMP application with the same data ...Read more >
In [8], we demonstrated that contrary to sequential applications, parallel OpenMP appli- cations su er from a severe instability in performances. That is, running the same parallel OpenMP application with the same data input multiple times may exhibit a high variability of execution times. In this article, we continue our research e ort to analyse the reason of such performance variability. With the architectural complexity of the new state of the art hardware designs, comes a need to better understand the interactions between the operating system layers, the applications and the underlying hardware platforms. The ability to characterise and to quantify those interactions can be useful in the processes of performance evaluation and analysis, compiler optimisations and operating system job scheduling allowing to achieve better performance stability, reproducibility and predictability. Under- standing the performance instability in current multicore architectures is even more complicated by the variety of factors and sources in uencing the applications performances. This article focus on the e ects of thread binding, co-running processes, L2 cache sharing, automatic hardware prefetcher and memory page sizes.Read less <
Origin
Hal imported