2019-04 | Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs | 박영준 |
2016-06 | APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs | 박영준 |
2016-07 | A bypass first policy for energy-efficient last level caches | 박영준 |
2017-05 | A Comparative Study of Programming Environments Exploiting Heterogeneous Systems | 박영준 |
2019-06 | A compiler-based approach for GPGPU performance calibration using TLP modulation (WIP paper) | 박영준 |
2020-07 | Convergence-Aware Neural Network Training | 박영준 |
2015-11 | An eDRAM-Based Approximate Register File for GPUs | 박영준 |
2017-04 | Efficient GPU multitasking with latency minimization and cache boosting | 박영준 |
2015-07 | Enabling Efficient Alias Speculation | 박영준 |
2017-06 | Enabling Energy Efficient Image Encryption using Approximate Memoization | 박영준 |
2019-06 | GATE: A Generalized Dataflow-level Approximation Tuning Engine For Data Parallel Architectures | 박영준 |
2019-06 | Improving GPU Multitasking Efficiency Using Dynamic Resource Sharing | 박영준 |
2020-08 | LOCKED-Free Journaling: Improving the Coalescing Degree in EXT4 Journaling | 박영준 |
2019-04 | Microarchitecture-Aware Code Generation for Deep Learning on Single-ISA Heterogeneous Multi-Core Mobile Processors | 박영준 |
2020-07 | Navigator: Dynamic Multi-kernel Scheduling to Improve GPU Performance | 박영준 |
2020-04 | Optimization of GPU-based Sparse Matrix Multiplication for Large Sparse Networks | 박영준 |
2020-02 | PreScaler: An Efficient System-aware Precision Scaling Framework on Heterogeneous Systems | 박영준 |
2020-11 | Resource-Aware Device Allocation of Data-Parallel Applications on Heterogeneous Systems | 박영준 |
2017-06 | Selective DRAM cache bypassing for improving bandwidth on DRAM/NVM hybrid main memory systems | 박영준 |
2012-03 | SIMD Defragmenter: Efficient ILP Realization on Data-parallel Architectures | 박영준 |