"An Optimizing Compiler For Gpgpu Programs With Input-Data Sharing" by Yi Yang, Ping Xiang et al.

Scopus Export 2010-2014

Title

An Optimizing Compiler For Gpgpu Programs With Input-Data Sharing

Creator

Yi Yang, NC State University
Ping Xiang, University of Central Florida
Jingfei Kong, University of Central Florida
Huiyang Zhou, NC State University

Keywords

Compiler; GPGPU

Abstract

Developing high performance GPGPU programs is challenging for application developers since the performance is dependent upon how well the code leverages the hardware features of specific graphics processors. To solve this problem and relieve application developers of low-level hardware-specific optimizations, we introduce a novel compiler to optimize GPGPU programs. Our compiler takes a naive GPU kernel function, which is functionally correct but without any consideration for performance optimization. The compiler then analyzes the code, identifies memory access patterns, and generates optimized code. The proposed compiler optimizations target at one category of scientific and media processing algorithms, which has the characteristics of input-data sharing when computing neighboring output pixels/elements. Many commonly used algorithms, such as matrix multiplication, convolution, etc., share such characteristics. For these algorithms, novel approaches are proposed to enforce memory coalescing and achieve effective data reuse. Data prefetching and hardware-specific tuning are also performed automatically with our compiler framework. The experimental results based on a set of applications show that our compiler achieves very high performance, either superior or very close to the highly fine-tuned library, NVIDIA CUBLAS 2.1.

Publication Date

3-15-2010

Publication Title

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Number of Pages

343-344

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1145/1693453.1693505

Copyright Status

Unknown

Socpus ID

77749268130 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/77749268130

STARS Citation

Yang, Yi; Xiang, Ping; Kong, Jingfei; and Zhou, Huiyang, "An Optimizing Compiler For Gpgpu Programs With Input-Data Sharing" (2010). Scopus Export 2010-2014. 1586.
https://stars.library.ucf.edu/scopus2010/1586

This document is currently not available here.

COinS

Scopus Export 2010-2014

Title

Creator

Keywords

Abstract

Publication Date

Publication Title

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Explore

Connect

Scopus Export 2010-2014

Title

Creator

Keywords

Abstract

Publication Date

Publication Title

Number of Pages

Document Type

Personal Identifier

DOI Link

Copyright Status

Socpus ID

Source API URL

STARS Citation

Share

Explore

Connect