You are here:
Theoretical Results on Optimal Partitoning for Matrix-Matrix Multiplication with Two Processors

Theoretical Results on Optimal Partitoning for Matrix-Matrix Multiplication with Two Processors

Publication Type  Report
Year of Publication  2011
Authors  Ashley DeFlumere; Alexey Lastovetsky
Key Words  matrix multiplication; data partitioning; parallel computing
Abstract  

In this report, we consider a simple but important linear algebra kernel, matrix-matrix multiplication. Building multi-core processors based on heterogeneous cores is an important current trend. In this context, it is of great interest to study optimal matrix partitioning algorithms for small cases (i.e. small number of cores). Indeed, the general case, with relatively high numbers of heterogeneous resources is now well understood, however the problem is in general NP-Complete when one aims at balancing the load while minimizing the communications. Nonetheless several approximation algorithms have been successfully designed. Nevertheless, negative complexity results do not apply for very few heterogeneous cores. Additionally, the case of a small number of processors is useful as a model for heterogeneous clusters and clusters of clusters. In this paper, we provide a complete study of 2 heterogeneous resources and we prove that in this case, the optimal partitioning is based on non-standard decomposition techniques.

Export  Tagged XML BibTex
AttachmentSize
ucd-csi-2011-09.pdf678.89 KB