<--- Back to Details
First PageDocument Content
Computer architecture / Computing / Parallel computing / Computer programming / Roofline model / Software optimization / Software testing / FLOPS / Central processing unit / Supercomputer / Benchmark / Instructions per cycle
Date: 2018-05-02 20:35:50
Computer architecture
Computing
Parallel computing
Computer programming
Roofline model
Software optimization
Software testing
FLOPS
Central processing unit
Supercomputer
Benchmark
Instructions per cycle

arXiv:1801.09212v2 [cs.PF] 2 MayBOPS, N OT FLOPS! A N EW M ETRIC AND R OOFLINE P ERFORMANCE M ODEL F OR D ATACENTER C OMPUTING

Add to Reading List

Source URL: arxiv.org

Download Document from Source Website

File Size: 1,16 MB

Share Document on Facebook

Similar Documents

BOPS, Not FLOPS! A New Metric, Measuring Tool, and Roofline Performance Model For Datacenter Computing Chen Zheng ICT,CAS

BOPS, Not FLOPS! A New Metric, Measuring Tool, and Roofline Performance Model For Datacenter Computing Chen Zheng ICT,CAS

DocID: 1xVt0 - View Document

1  Cache-aware Roofline model: Upgrading the loft Aleksandar Ilic, Frederico Pratas, and Leonel Sousa INESC-ID/IST, Technical University of Lisbon, Portugal {ilic,fcpp,las}@inesc-id.pt

1 Cache-aware Roofline model: Upgrading the loft Aleksandar Ilic, Frederico Pratas, and Leonel Sousa INESC-ID/IST, Technical University of Lisbon, Portugal {ilic,fcpp,las}@inesc-id.pt

DocID: 1rBXE - View Document

Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis Yu Jung Lo, Samuel Williams, Brian Van Straalen, Terry J. Ligocki, Matthew J. Cordery, Nicholas J. Wright, Mary W. Hall, and Leonid Oliker U

Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis Yu Jung Lo, Samuel Williams, Brian Van Straalen, Terry J. Ligocki, Matthew J. Cordery, Nicholas J. Wright, Mary W. Hall, and Leonid Oliker U

DocID: 1rrNN - View Document

Design of Parallel and High Performance Computing HS 2013 Markus P¨ uschel, Torsten Hoefler Department of Computer Science ETH Zurich

Design of Parallel and High Performance Computing HS 2013 Markus P¨ uschel, Torsten Hoefler Department of Computer Science ETH Zurich

DocID: 1rlc8 - View Document

Auto-tuning the 27-point Stencil for Multicore Kaushik Datta2 , Samuel Williams1 , Vasily Volkov2 , Jonathan Carter1 , Leonid Oliker1 , John Shalf1 , and Katherine Yelick1 1  CRD/NERSC, Lawrence Berkeley National Laborat

Auto-tuning the 27-point Stencil for Multicore Kaushik Datta2 , Samuel Williams1 , Vasily Volkov2 , Jonathan Carter1 , Leonid Oliker1 , John Shalf1 , and Katherine Yelick1 1 CRD/NERSC, Lawrence Berkeley National Laborat

DocID: 1r4gA - View Document