<--- Back to Details
First PageDocument Content
Parallel computing / GPGPU / Numerical linear algebra / Computational science / Programming paradigms / OpenCL / General-purpose computing on graphics processing units / Basic Linear Algebra Subprograms / Automatic vectorization / Compute kernel / Kernel / Matrix multiplication algorithm
Date: 2016-01-27 04:16:13
Parallel computing
GPGPU
Numerical linear algebra
Computational science
Programming paradigms
OpenCL
General-purpose computing on graphics processing units
Basic Linear Algebra Subprograms
Automatic vectorization
Compute kernel
Kernel
Matrix multiplication algorithm

Writing a performance-portable matrix multiplication

Add to Reading List

Source URL: www.des.udc.es

Download Document from Source Website

File Size: 445,49 KB

Share Document on Facebook

Similar Documents

Diplom- / Master- / Bachelorarbeit OpenCL/CUDA Reliability Analysis OPENCL & Image Source: NVIDIA

DocID: 1uzZ2 - View Document

IntelĀ® OpenCL Implicit Vectorization Module Nadav Rotem Software Developer, IntelĀ® November 2011

DocID: 1upMB - View Document

Patterns and Rewrite Rules for Systematic Code Generation From High-Level Functional Patterns to High-Performance OpenCL Code Michel Steuwer Christian Fensch

DocID: 1tL2m - View Document

Step-by-step: Building OpenCL-enabled OpenCV from source OpenCV version: 2.4.6 Dr. Harris Gasparakis, 1 This tutorial is a very introductory, step-by-step guide to obtaining, configuring, and bui

DocID: 1tibZ - View Document

Overhauling SC Atomics in C11 and OpenCL Mark Batty Alastair F. Donaldson John Wickerson

DocID: 1t6VZ - View Document