Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs
hal.structure.identifier | Reformulations based algorithms for Combinatorial Optimization [Realopt] | |
dc.contributor.author | BEAUMONT, Olivier | |
hal.structure.identifier | Reformulations based algorithms for Combinatorial Optimization [Realopt] | |
dc.contributor.author | LAMBERT, Thomas | |
hal.structure.identifier | École normale supérieure de Lyon [ENS de Lyon] | |
hal.structure.identifier | Optimisation des ressources : modèles, algorithmes et ordonnancement [ROMA] | |
dc.contributor.author | MARCHAL, Loris | |
hal.structure.identifier | École normale supérieure - Rennes [ENS Rennes] | |
dc.contributor.author | THOMAS, Bastien | |
dc.date.accessioned | 2024-04-04T03:05:09Z | |
dc.date.available | 2024-04-04T03:05:09Z | |
dc.date.conference | 2018-05-21 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/193221 | |
dc.description.abstractEn | In this paper we concentrate on a crucial parameter for efficiency in Big Data and HPC applications: data locality. We focus on the scheduling of a set of independant tasks, each depending on an input file. We assume that each of these input files has been replicated several times and placed in local storage of different nodes of a cluster, similarly of what we can find on HDFS system for example. We consider two optimization problems, related to the two natural metrics: makespan optimization (under the constraint that only local tasks are allowed) and communication optimization (under the constraint of never letting a processor idle in order to optimize makespan). For both problems we investigate the performance of dynamic schedulers, in particular the basic greedy algorithm we can for example find in the default MapReduce scheduler. First we theoretically study its performance, with probabilistic models, and provide a lower bound for communication metric and asymptotic behaviour for both metrics. Second we propose simulations based on traces from a Hadoop cluster to compare the different dynamic schedulers and assess the expected behaviour obtained with the theoretical study. | |
dc.description.sponsorship | Solveurs pour architectures hétérogènes utilisant des supports d'exécution - ANR-13-MONU-0007 | |
dc.language.iso | en | |
dc.publisher | IEEE | |
dc.title.en | Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs | |
dc.type | Communication dans un congrès | |
dc.identifier.doi | 10.1109/IPDPSW.2018.00187 | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.page | 1-8 | |
bordeaux.hal.laboratories | Institut de Mathématiques de Bordeaux (IMB) - UMR 5251 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | IPDPSW 2018 IEEE International Parallel and Distributed Processing Symposium Workshops | |
bordeaux.country | CA | |
bordeaux.conference.city | Vancouver | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-01878977 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | oui | |
hal.conference.end | 2018-05-25 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-01878977v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.spage=1-8&rft.epage=1-8&rft.au=BEAUMONT,%20Olivier&LAMBERT,%20Thomas&MARCHAL,%20Loris&THOMAS,%20Bastien&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |