← Назад к работе
Работы, на которые ссылается эта работа
Работ: 29
Работа: Accelerating Matrix Multiplication with CPU Multithreading and CUDA Block-Based GPU Parallelization
Optimizing Machine Learning Models with CUDA: A Comprehensive Performance Analysis
L Niteesh, M B Ampareeshan, Suganiya Murugan
Статья2025Цитирований: 4ABIA Flexible Parallel Runtime for Large Scale Block-Based Matrix Multiplication
Keyan Liu, Shaohua Song, Ningnan Zhou +1
Глава2012Цитирований: 2ABILarge Matrix Multiplication Algorithms: Analysis and Comparison
Khalil Mouhah, Hind Faiz, Safae Bourhnane
Статья2023Цитирований: 2ABI