Centralized Versus Distributed Schedulers for Multiple Bag-of-Tasks Applications
BEAUMONT, Olivier
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Algorithmics for computationally intensive applications over wide scale distributed platforms [CEPAGE]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Algorithmics for computationally intensive applications over wide scale distributed platforms [CEPAGE]
CARTER, Larry
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
FERRANTE, Jeanne
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
Voir plus >
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
BEAUMONT, Olivier
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Algorithmics for computationally intensive applications over wide scale distributed platforms [CEPAGE]
Laboratoire Bordelais de Recherche en Informatique [LaBRI]
Algorithmics for computationally intensive applications over wide scale distributed platforms [CEPAGE]
CARTER, Larry
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
FERRANTE, Jeanne
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
Department of Computer Science and Engineering [Univ California San Diego] [CSE - UC San Diego]
MARCHAL, Loris
Algorithms and Scheduling for Distributed Heterogeneous Platforms [GRAAL]
Laboratoire de l'Informatique du Parallélisme [LIP]
< Réduire
Algorithms and Scheduling for Distributed Heterogeneous Platforms [GRAAL]
Laboratoire de l'Informatique du Parallélisme [LIP]
Langue
en
Article de revue
Ce document a été publié dans
IEEE Transactions on Parallel and Distributed Systems. 2008, vol. 19, p. 698―709
Institute of Electrical and Electronics Engineers
Résumé en anglais
Multiple applications that execute concurrently on heterogeneous platforms compete for CPU and network resources. In this paper we consider the problem of scheduling applications to ensure fair and efficient execution on ...Lire la suite >
Multiple applications that execute concurrently on heterogeneous platforms compete for CPU and network resources. In this paper we consider the problem of scheduling applications to ensure fair and efficient execution on a distributed network of processors. We limit our study to the case where communication is restricted to a tree embedded in the network, and the applications consist of a large number of independent tasks (Bags of Tasks) that originate at the tree\'s root. The tasks of a given application all have the same computation and communication requirements, but these requirements can vary for different applications. The goal of scheduling is to maximize throughput of each application while ensuring a fair sharing of resources between applications. We can find the optimal asymptotic rates by solving a linear programming problem that expresses all necessary problem constraints, and we show how to construct a periodic schedule from any linear program solution. For single-level trees, the solution is characterized by processing tasks with larger communication-to-computation ratios at children with larger bandwidths. For multi-level trees, this approach requires global knowledge of all application and platform parameters. For large-scale platforms, such global coordination by a centralized scheduler may be unrealistic. Thus, we also investigate decentralized schedulers that use only local information at each participating resource. We assess their performance via simulation, and compare to an optimal centralized solution obtained via linear programming. The best of our decentralized heuristics achieves the same performance on about two-thirds of our test cases, but is far worse in a few cases. While our results are based on simple assumptions and do not explore all parameters (such as the maximum number of tasks that can be held on a node), they provide insight into the important question of fairly and optimally scheduling heterogeneous applications on heterogeneous grids.< Réduire
Project ANR
ALgorithmique des Plates-formes A Grande Echelle - ANR-05-MMSA-0006
Origine
Importé de halUnités de recherche