Sizing and Partitioning Strategies for Burst-Buffers to Reduce IO Contention
Language
en
Communication dans un congrès
This item was published in
IPDPS 2019 - 33rd IEEE International Parallel and Distributed Processing Symposium, 2019-05-20, Rio de Janeiro.
English Abstract
Burst-Buffers are high throughput and small size storage which are being used as an intermediate storage between the PFS (Parallel File System) and the computational nodes of modern HPC systems. They can allow to hinder ...Read more >
Burst-Buffers are high throughput and small size storage which are being used as an intermediate storage between the PFS (Parallel File System) and the computational nodes of modern HPC systems. They can allow to hinder to contention to the PFS, a shared resource whose read and write performance increase slower than processing power in HPC systems. A second usage is to accelerate data transfers and to hide the latency to the PFS. In this paper, we concentrate on the first usage. We propose a model for Burst-Buffers and application transfers. We consider the problem of dimensioning and sharing the Burst-Buffers between several applications. This dimensioning can be done either dynamically or statically. The dynamic allocation considers that any application can use any available portion of the Burst-Buffers. The static allocation considers that when a new application enters the system, it is assigned some portion of the Burst-Buffers, which cannot be used by the other applications until that application leaves the system and its data is purged from it. We show that the general sharing problem to guarantee fair performance for all applications is an NP-Complete problem. We propose a polynomial time algorithms for the special case of finding the optimal buffer size such that no application is slowed down due to PFS contention, both in the static and dynamic cases. Finally, we provide evaluations of our algorithms in realistic settings. We use those to discuss how to minimize the overhead of the static allocation of buffers compared to the dynamic allocation.Read less <
ANR Project
Ordonnancement de données pour le calcul haute-performance - ANR-17-CE25-0004
Initiative d'excellence de l'Université de Bordeaux - ANR-10-IDEX-0003
Initiative d'excellence de l'Université de Bordeaux - ANR-10-IDEX-0003
Origin
Hal imported