Gridgas: An I/O-Efficient Heterogeneous Fpga+Cpu Computing Platform For Very Large-Scale Graph Analytics

Keywords

Graph processing; Heterogeneous system

Abstract

In this paper, we develop a highly scalable approach to constructing an efficient heterogeneous graph processing engine in order to handle extremely large graph size beyond its on-board memory capacity. Our FPGA-based computing engine not only surpasses cutting-edge GPU-based engines in terms of computing performance and energy efficiency, but also proves to be highly versatile and thus can be applied to many types of low-latency and high-Throughput graph analytic tasks central to the next-generation graph-based machine learning. We analyze in detail the difference between GPU's and FPGA's architectures and provide several fundamental reasons why, for irregular computations, FPGA may surpass GPU in computing latency and energy efficiency, and discuss some 'golden rules' for designing an efficient FPGA+CPU heterogeneous platform and GPU's inefficiency when handling extremely large-scale graph datasets. To validate our approach, we implement our FPGA-based GridGAS computing engine with a KC705 Xilinx FPGA board and a baseline implementation using a Quadro K420 GPU following the same approach and test with large-scale graph datasets. Using PCIe 2.0 x8 only, our architecture achieves up to 170.4 MTEPS and 14.8 times speedup over the GPU baseline for datasets exceeding 1.4 GB in size.

Publication Date

12-1-2018

Publication Title

Proceedings - 2018 International Conference on Field-Programmable Technology, FPT 2018

Number of Pages

249-252

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/FPT.2018.00045

Socpus ID

85068316326 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/85068316326

This document is currently not available here.

Share

COinS