Repository at Hanyang University: Optimization of a GPU-based Sparse Matrix Multiplication for Large Sparse Networks

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)COMPUTER SCIENCE(컴퓨터소프트웨어학부)Articles

288 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	김상욱	-
dc.date.accessioned	2021-10-26T05:57:01Z	-
dc.date.available	2021-10-26T05:57:01Z	-
dc.date.issued	2020-04	-
dc.identifier.citation	2020 IEEE 36th International Conference on Data Engineering (ICDE), page. 925-936	en_US
dc.identifier.isbn	978-1-7281-2903-7	-
dc.identifier.issn	2375-026X	-
dc.identifier.uri	https://ieeexplore.ieee.org/document/9101654?arnumber=9101654&SID=EBSCO:edseee	-
dc.identifier.uri	https://repository.hanyang.ac.kr/handle/20.500.11754/165755	-
dc.description.abstract	Sparse matrix multiplication (spGEMM) is widely used to analyze the sparse network data, and extract important information based on matrix representation. As it contains a high degree of data parallelism, many efficient implementations using data-parallel programming platforms such as CUDA and OpenCL have been introduced on graphic processing units (GPUs). Several well-known spGEMM techniques, such as cuS- PARSE and CUSP, often do not utilize the GPU resources fully, owing to the load imbalance between threads in the expansion process and high memory contention in the merge process. Furthermore, even though several outer-product-based spGEMM techniques are proposed to solve the load balancing problem on expansion, they still do not utilize the GPU resources fully, because severe computation load variations exist among the multiple thread blocks.To solve these challenges, this paper proposes a new optimization pass called Block Reorganizer, which balances the total computations of each computing unit on target GPUs, based on the outer-product-based expansion process, and reduces the memory pressure during the merge process. For expansion, it first identifies the actual computation amount for each block, and then performs two thread block transformation processes based on their characteristics: 1) B-Splitting to transform a heavy-computation blocks into multiple small blocks and 2) B- Gathering to aggregate multiple small-computation blocks to a larger block. While merging, it improves the overall performance by performing B-Limiting to limit the number of blocks on each computing unit. Experimental results show that it improves the total performance of kernel execution by 1.43x, on an average, when compared to the row-product-based spGEMM, for NVIDIA Titan Xp GPUs on real-world datasets.	en_US
dc.description.sponsorship	Thanks to Myung-Hwan Jang and Hyuck-Moo Gwon for all their help and feedback. We also thank the anonymous reviewers who provided good suggestions for improving the quality of this work. This work was supported by Samsung Research Funding & Incubation Center of Samsung Electronics under Project Number SRFC-IT1901- 03. Yongjun Park is the corresponding author.	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE ICDE 2020	en_US
dc.subject	Sparse matrix multiplication	en_US
dc.subject	sparse network	en_US
dc.subject	GPU	en_US
dc.subject	linear algebra	en_US
dc.title	Optimization of a GPU-based Sparse Matrix Multiplication for Large Sparse Networks	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/ICDE48307.2020.00085	-
dc.relation.page	925-936	-
dc.contributor.googleauthor	Lee, Jeongmyung	-
dc.contributor.googleauthor	Kang, Seokwon	-
dc.contributor.googleauthor	Yu, Yongseung	-
dc.contributor.googleauthor	Jo, Yong-Yeon	-
dc.contributor.googleauthor	Kim, Sang-Wook	-
dc.contributor.googleauthor	Park, Yongjun	-
dc.relation.code	20200060	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	SCHOOL OF COMPUTER SCIENCE	-
dc.identifier.pid	wook	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > COMPUTER SCIENCE(컴퓨터소프트웨어학부) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE