Parallel matrix multiplication
Nikola TomikjFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, MacedoniaMarjan GuševFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, Macedonia
2018en
ABI
Abstract
Utilizing all CPU cores available for numerical computations is a topic of considerable interest in HPC. This paper analyzes and compares four different parallel algorithms for matrix multiplication without block partitioning using OpenMP. The comparison of the algorithms is based on the achieved speed, memory bandwidth and efficient use of the cache of the algorithms.
Identifiers
Citations and references
Cited by 20 references