First Page | Document Content | |
---|---|---|
Date: 2016-04-21 15:49:27Algebra Computing Computer architecture Parallel computing Graphics hardware Numerical linear algebra GPGPU Video cards Basic Linear Algebra Subprograms General-purpose computing on graphics processing units Matrix CUDA | Performance, Design, and Autotuning of Batched GEMM for GPUs Ahmad Abdelfattah1 , Azzam Haidar1 , Stanimire Tomov1 , and Jack Dongarra123 1Add to Reading ListSource URL: icl.cs.utk.eduDownload Document from Source WebsiteFile Size: 1,27 MBShare Document on Facebook |