Repository at Hanyang University: Asymptotically Optimal Merging on ManyCore GPUs

Browse

My Repository

Repository at Hanyang UniversityCOLLEGE OF ENGINEERING[S](공과대학)INFORMATION SYSTEMS(정보시스템학과)Articles

463 0

Full metadata record

DC Field	Value	Language
dc.contributor.author	KUTZNER ARNE HOLGER	-
dc.date.accessioned	2018-03-19T00:11:31Z	-
dc.date.available	2018-03-19T00:11:31Z	-
dc.date.issued	2012-12	-
dc.identifier.citation	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, 95(12), P.2769-2777	en_US
dc.identifier.issn	0916-8532	-
dc.identifier.uri	https://www.jstage.jst.go.jp/article/transinf/E95.D/12/E95.D_2769/_article	-
dc.identifier.uri	http://hdl.handle.net/20.500.11754/48466	-
dc.description.abstract	We propose a family of algorithms for efficiently merging on contemporary GPUs, so that each algorithm requires O(m log(n/m + 1)) element comparisons, where m and n are the sizes of the input sequences with m <= n. According to the lower bounds for merging all proposed algorithms are asymptotically optimal regarding the number of necessary comparisons. First we introduce a parallely structured algorithm that splits a merging problem of size 2(l) into 2(i) subproblems of size 2(l-i), for some arbitrary i with (0 <= i <= l). This algorithm represents a merger for i = 1 but it is rather inefficient in this case. The efficiency is boosted by moving to a two stage approach where the splitting process stops at some predetermined level and transfers control to several parallely operating block-mergers. We formally prove the asymptotic optimality of the splitting process and show that for symmetrically sized inputs our approach delivers up to 4 times faster runtimes than the thrust: :merge function that is part of the Thrust library. For assessing the value of our merging technique in the context of sorting we construct and evaluate a MergeSort on top of it. In the context of our benchmarking the resulting MergeSort clearly outperforms the MergeSort implementation provided by the Thrust library as well as Cederman's GPU optimized variant of QuickSort.	en_US
dc.language.iso	en	en_US
dc.publisher	Institute of Electronics, Information and Communication Engineers	en_US
dc.subject	parallel algorithms	en_US
dc.subject	GPGPU	en_US
dc.subject	complexity	en_US
dc.subject	merging	en_US
dc.subject	sorting	en_US
dc.title	Asymptotically Optimal Merging on ManyCore GPUs	en_US
dc.type	Article	en_US
dc.relation.no	12	-
dc.relation.volume	E95D	-
dc.identifier.doi	10.1587/transinf.E95.D.2769	-
dc.relation.page	2769-2777	-
dc.relation.journal	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS	-
dc.contributor.googleauthor	Kutzner, Arne	-
dc.contributor.googleauthor	Kim, Pok-Son	-
dc.contributor.googleauthor	Park, Won-Kwang	-
dc.relation.code	2012203910	-
dc.sector.campus	S	-
dc.sector.daehak	COLLEGE OF ENGINEERING[S]	-
dc.sector.department	DEPARTMENT OF INFORMATION SYSTEMS	-
dc.identifier.pid	kutzner	-

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > INFORMATION SYSTEMS(정보시스템학과) > Articles

Files in This Item:

Export: RIS (EndNote); XLS (Excel); XML

Show simple item record

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository

Browse

My Repository

BROWSE