Download PDFOpen PDF in browser

Automatic Mapping of Parallel Pattern-based Algorithms on Heterogeneous Architectures

EasyChair Preprint no. 5597

15 pagesDate: May 23, 2021


Nowadays, specialized hardware is often found in clusters to improve compute performance and energy efficiency. The porting and tuning of scientific codes to these heterogeneous clusters requires significant development efforts. To mitigate these efforts while maintaining high performance, modern parallel programming models introduce a second layer of abstraction, where an architecture-agnostic source code can be maintained and automatically optimized for the target architecture. However, with increasing heterogeneity, the mapping of an application to a specific architecture itself becomes a complex decision requiring a differentiated consideration of processor features and algorithmic properties. Furthermore, architecture-agnostic global transformations are necessary to maximize the simultaneous utilization of different processors. Therefore, we introduce a combinatorial optimization approach to globally transform and automatically map parallel algorithms to heterogeneous architectures. We derive a global transformation and mapping algorithm which bases on a static performance model. Moreover, we demonstrate the approach on four typical algorithmic kernels showing automatic and global transformations such as re-ordering, pipelining, and cache blocking and optimal mapping strategies to an exemplary CPU-GPU compute node. Our algorithm achieves performance on par with hand-tuned implementations of all four kernels.

Keyphrases: abstract pattern tree, Algorithmic efficiency, automatic mapping, heterogeneous systems, parallel algorithms, performance analysis, performance optimization, programming languages, programming models

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Lukas Trümper and Julian Miller and Christian Terboven and Matthias S. Müller},
  title = {Automatic Mapping of Parallel Pattern-based Algorithms on Heterogeneous Architectures},
  howpublished = {EasyChair Preprint no. 5597},

  year = {EasyChair, 2021}}
Download PDFOpen PDF in browser