<--- Back to Details
First PageDocument Content
Computing / Concurrent computing / Parallel computing / Computer programming / OpenMP / Roofline model / Multi-core processor / Manycore processor / Thread / Benchmark / CUDA / Data parallelism
Date: 2014-11-13 12:51:32
Computing
Concurrent computing
Parallel computing
Computer programming
OpenMP
Roofline model
Multi-core processor
Manycore processor
Thread
Benchmark
CUDA
Data parallelism

Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis Yu Jung Lo, Samuel Williams, Brian Van Straalen, Terry J. Ligocki, Matthew J. Cordery, Nicholas J. Wright, Mary W. Hall, and Leonid Oliker U

Add to Reading List

Source URL: www.dcs.warwick.ac.uk

Download Document from Source Website

File Size: 339,96 KB

Share Document on Facebook

Similar Documents

ENParallel Programming   CUDA allows some synchronization between threads via __syncthreads()

ENParallel Programming  CUDA allows some synchronization between threads via __syncthreads()

DocID: 1v8qO - View Document

PL/CUDA – In-database massive parallel analytics KaiGai Kohei <> The PG-Strom Development Team Case Study: Drug-Discovery In-database

PL/CUDA – In-database massive parallel analytics KaiGai Kohei <> The PG-Strom Development Team Case Study: Drug-Discovery In-database

DocID: 1v2rB - View Document

Diplom- / Master- / Bachelorarbeit OpenCL/CUDA Reliability Analysis OPENCL & Image Source: NVIDIA

Diplom- / Master- / Bachelorarbeit OpenCL/CUDA Reliability Analysis OPENCL & Image Source: NVIDIA

DocID: 1uzZ2 - View Document

1  Toward Parallel CFA with Datalog, MPI, and CUDA THOMAS GILRAY, University of Maryland SIDHARTH KUMAR, University of Utah We present our recent experience working to design parallel functional control-flow analysis (CF

1 Toward Parallel CFA with Datalog, MPI, and CUDA THOMAS GILRAY, University of Maryland SIDHARTH KUMAR, University of Utah We present our recent experience working to design parallel functional control-flow analysis (CF

DocID: 1u2XE - View Document

MPI-CUDA parallel linear solvers for block-tridiagonal matrices in the context of SLEPc’s eigensolversI A. Lamas Davi˜ na, J. E. Roman∗ D. Sistemes Inform` atics i Computaci´

MPI-CUDA parallel linear solvers for block-tridiagonal matrices in the context of SLEPc’s eigensolversI A. Lamas Davi˜ na, J. E. Roman∗ D. Sistemes Inform` atics i Computaci´

DocID: 1tNci - View Document