To main content

Graphics processing unit (GPU) programming strategies and trends in GPU computing

Abstract

In this article, we have given an overview of hardware and traditional optimization techniques for the GPU. We have furthermore given a step-by-step guide to profile driven development, in which bottlenecks and possible solutions are outlined. The focus is on state-of-the-art hardware with accompanying tools, and we have addressed the most prominent bottlenecks: memory, arithmetics, and latencies.

Category

Academic article

Language

English

Author(s)

Affiliation

  • SINTEF Digital / Mathematics and Cybernetics
  • University of Oslo

Date

04.05.2012

Year

2013

Published in

Journal of Parallel and Distributed Computing

ISSN

0743-7315

Publisher

Academic Press

Volume

73

Issue

1

Page(s)

4 - 13

View this publication at Cristin