DEEPTOOLS: Compiler and Execution Runtime Extensions for RAPiD AI Accelerator
- Title
- DEEPTOOLS: Compiler and Execution Runtime Extensions for RAPiD AI Accelerator
- Author
- 최정욱
- Keywords
- Deep Learning; Machine learning accelerators; Software stack for AI
- Issue Date
- 2019-09
- Publisher
- IEEE COMPUTER SOC
- Citation
- IEEE MICRO, v. 39, no. 5, Page. 102-111
- Abstract
- The ubiquitous adoption of systems specialized for AI requires bridging two seemingly conflicting challenges-the need to deliver extreme processing efficiencies while employing familiar programming interfaces, making them compelling even for nonexpert users. We take a significant first step towards this goal and present an end-to-end software stack for the RAPID AI accelerator developed by IBM Research. We present a set of software extensions, called DEEPTOOLS, that leverage and work within popular deep learning frameworks. DEEPTOOLS requires no additional user input and enables aggressive, accelerator-specific performance optimization akin to a full, custom framework. DEEPTOOLS has two key components: 1) a compiler runtime called DeepRT, which automatically identifies how best to execute a given DNN graph on RAPID and constructs the requisite program binaries; and 2) an execution runtime called RAPiDLiB, which triggers and manages the execution of compute and data-transfer operations on RAPID. We integrate DEEPTOOLS with TensorFlow and map popular DNNs (AlexNet, VGG, ResNet, LSTM) to RAPID. We demonstrate substantial improvement in performance over hand-tuned mappings.
- URI
- https://ieeexplore.ieee.org/document/8782645https://repository.hanyang.ac.kr/handle/20.500.11754/154102
- ISSN
- 0272-1732; 1937-4143
- DOI
- 10.1109/MM.2019.2931584
- Appears in Collections:
- COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML