Neighborhood-Aware Address Translation For Irregular Gpu Applications
Keywords
Computer architecture; GPU; Virtual address
Abstract
Recent studies on commercial hardware demonstrated that irregular GPU workloads could bottleneck on virtual-To-physical address translations. GPU's single-instructionmultiple-Thread (SIMT) execution can generate many concurrent memory accesses, all of which require address translation before accesses can complete. Unfortunately, many of these address translation requests often miss in the TLB, generating many concurrent page table walks. In this work, we investigate how to reduce address translation overheads for such applications. We observe that many of these concurrent page walk requests, while irregular from the perspective of a single GPU wavefront, still fall on neighboring virtual page addresses. The address mappings for these neighboring pages are typically stored in the same 64-byte cache line. Since cache lines are the smallest granularity of memory access, the page table walker implicitly reads address mappings (i.e., page table entries or PTEs) of many neighboring pages during the page walk of a single virtual address (VA). However, in the conventional hardware, mappings not associated with the original request are simply discarded. In this work, we propose mechanisms to coalesce the address translation needs of all pending page table walks in the same neighborhood that happen to have their address mappings fall on the same cache line. This is almost free; the page table walker (PTW) already reads a full cache line containing address mappings of all pages in the same neighborhood. We find this simple scheme can reduce the number of accesses to the inmemory page table by 37% on average. This speeds up a set of GPU workloads by an average of 1:7×.
Publication Date
12-12-2018
Publication Title
Proceedings of the Annual International Symposium on Microarchitecture, MICRO
Volume
2018-October
Number of Pages
352-363
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/MICRO.2018.00036
Copyright Status
Unknown
Socpus ID
85060020814 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85060020814
STARS Citation
Shin, Seunghee; Lebeane, Michael; Solihin, Yan; and Basu, Arkaprava, "Neighborhood-Aware Address Translation For Irregular Gpu Applications" (2018). Scopus Export 2015-2019. 10048.
https://stars.library.ucf.edu/scopus2015/10048