375 0

Full metadata record

DC FieldValueLanguage
dc.contributor.author최정욱-
dc.date.accessioned2022-10-27T01:47:23Z-
dc.date.available2022-10-27T01:47:23Z-
dc.date.issued2021-02-
dc.identifier.citation2021 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE (ISSCC), page. 144-145en_US
dc.identifier.issn0193-6530en_US
dc.identifier.urihttps://ieeexplore.ieee.org/document/9365791en_US
dc.identifier.urihttps://repository.hanyang.ac.kr/handle/20.500.11754/175861-
dc.description.abstractLow-precision computation is the key enabling factor to achieve high compute densities (TOPS/W and TOPS/mm 2 ) in AI hardware accelerators across cloud and edge platforms. However, robust deep learning (DL) model accuracy equivalent to high-precision computation must be maintained. Improvements in bandwidth, architecture, and power management are also required to harness the benefit of reduced precision by feeding and supporting more parallel engines to achieve high sustained utilization and optimize performance within a given product power envelope. In this work, we present a 4-core AI chip in 7nm EUV technology that exploits cutting-edge algorithmic advances for iso-accurate models in low-precision training and inference [1, 2] and aggressive circuit/architecture optimization to achieve leading-edge power-performance. The chip supports fp16 (DLFIoat16 [8]) and hybrid-fp8 (hfp8) [1] formats for training and inference of DL models, as well as int4 and int2 formats for highly scaled inference.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.titleA 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttlingen_US
dc.typeArticleen_US
dc.identifier.doi10.1109/ISSCC42613.2021.9365791en_US
dc.relation.page144-145-
dc.contributor.googleauthorAgrawal, Ankur-
dc.contributor.googleauthorLee, Sae Kyu-
dc.contributor.googleauthorSilberman, Joel-
dc.contributor.googleauthorZiegler, Matthew-
dc.contributor.googleauthorKang, Mingu-
dc.contributor.googleauthorVenkataramani, Swagath-
dc.contributor.googleauthorCao, Nianzheng-
dc.contributor.googleauthorFleischer, Bruce-
dc.contributor.googleauthorGuillorn, Michael-
dc.contributor.googleauthorChoi, Jungwook-
dc.sector.campusS-
dc.sector.daehakCOLLEGE OF ENGINEERING[S]-
dc.sector.departmentSCHOOL OF ELECTRONIC ENGINEERING-
dc.identifier.pidchoij-
Appears in Collections:
COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE