245 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author박영준-
dc.date.accessioned2019-11-24T16:35:30Z-
dc.date.available2019-11-24T16:35:30Z-
dc.date.issued2017-04-
dc.identifier.citationIEICE ELECTRONICS EXPRESS, v. 14, no. 7, Article no. 20161158en_US
dc.identifier.issn1349-2543-
dc.identifier.urihttps://www.jstage.jst.go.jp/article/elex/14/7/14_14.20161158/_article-
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/113738-
dc.description.abstractGPU spatial multitasking has been proven to be quite effective at executing different applications concurrently using SM partitioning. However, while it maximizes total throughput, latency-critical applications often cannot meet their deadlines due to the increased execution time. Furthermore, SM partitioning cannot allocate the appropriate L1 cache size per kernel. To solve these problems, this paper proposes a new application-aware resource allocation framework called GPU Fine-Tuner, for assigning appropriate resources to GPU kernels. To minimize the execution time of latency-constrained applications, it assigns them more SMs when performance is not affected. It also increases the cache size of SMs for cache-sensitive kernels using resource borrowing from neighbors for cache-insensitive kernels. Experimental results show that the Fine-Tuner outperforms GPU spatial multitasking with up to 15% less average latency without performance degradation.en_US
dc.description.sponsorshipThis work was supported in part by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. NRF-2015R1C1A1A01053844 and No. NRF-2015K2A1A2070541) and the Korea Institute for Advancement of Technology (KIAT) grant funded by the Korean government (Motie: Ministry of Trade, Industry & Energy, HRD Program for Software-SoC convergence) (No. N0001883).en_US
dc.language.isoen_USen_US
dc.publisherIEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENGen_US
dc.subjectGPGPUen_US
dc.subjectmultitaskingen_US
dc.subjectenergyen_US
dc.subjectresource sharingen_US
dc.subjectworkload balancingen_US
dc.titleEfficient GPU multitasking with latency minimization and cache boostingen_US
dc.typeArticleen_US
dc.relation.no7-
dc.relation.volume14-
dc.identifier.doi10.1587/elex.14.20161158-
dc.relation.page1-12-
dc.relation.journalIEICE ELECTRONICS EXPRESS-
dc.contributor.googleauthorKim, Jiho-
dc.contributor.googleauthorChu, Minsung-
dc.contributor.googleauthorPark, Yongjun-
dc.relation.code2017010142-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF ENGINEERING[S]-
dc.sector.departmentDEPARTMENT OF COMPUTER SCIENCE-
dc.identifier.pidyongjunpark-
Appears in Collections:
COLLEGE OF ENGINEERING[S](공과대학) > COMPUTER SCIENCE(컴퓨터소프트웨어학부) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE