Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | JEANNOT, Emmanuel | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | MERCIER, Guillaume | |
hal.structure.identifier | Laboratoire Bordelais de Recherche en Informatique [LaBRI] | |
hal.structure.identifier | Efficient runtime systems for parallel architectures [RUNTIME] | |
dc.contributor.author | TESSIER, François | |
dc.date.accessioned | 2024-04-15T09:42:09Z | |
dc.date.available | 2024-04-15T09:42:09Z | |
dc.date.created | 2013-05-29 | |
dc.date.issued | 2014 | |
dc.identifier.issn | 1045-9219 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/197659 | |
dc.description.abstractEn | Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous hardware characteristics have to be taken into account, especially the memory hierarchy. One appealing idea to improve the performance of parallel applications is to decrease their communication costs by matching the communication pattern to the underlying hardware architecture. In this report, we detail the algorithm and techniques proposed to achieve such a result: first, we gather both the communication pattern information and the hardware details. Then we compute a relevant reordering of the various process ranks of the application. Finally, those new ranks are used to reduce the communication costs of the application. | |
dc.language.iso | en | |
dc.publisher | Institute of Electrical and Electronics Engineers | |
dc.title.en | Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1109/TPDS.2013.104 | |
dc.subject.hal | Informatique [cs]/Calcul parallèle, distribué et partagé [cs.DC] | |
bordeaux.journal | IEEE Transactions on Parallel and Distributed Systems | |
bordeaux.page | 993 - 1002 | |
bordeaux.volume | 25 | |
bordeaux.hal.laboratories | Laboratoire Bordelais de Recherche en Informatique (LaBRI) - UMR 5800 | * |
bordeaux.issue | 4 | |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-00921605 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-00921605v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=IEEE%20Transactions%20on%20Parallel%20and%20Distributed%20Systems&rft.date=2014&rft.volume=25&rft.issue=4&rft.spage=993%20-%201002&rft.epage=993%20-%201002&rft.eissn=1045-9219&rft.issn=1045-9219&rft.au=JEANNOT,%20Emmanuel&MERCIER,%20Guillaume&TESSIER,%20Fran%C3%A7ois&rft.genre=article |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |