Repository at Hanyang University: 경량 딥러닝 모델의 초저정밀도 양자화를 위한 학습 방식의 개선

247 0

경량 딥러닝 모델의 초저정밀도 양자화를 위한 학습 방식의 개선

Other Titles: Improving training method for very low bit weight quantization of Light Deep Learning Model

Abstract: Deep Learning Model Quantization is the most effective technique to make a model much lighter and cost efficient in terms of computation. Above many quantization algorithms, PROFIT[1] is a specialized algorithm for sub 4-bit mobile network quantization. But this method has sudden accuracy degradation in 2-bit width precision. In this paper, we propose a better training method to deal with this problem in 2-bit weight quantization. We adopt AIWQ, a metric for the activation’s instability induced by weight quantization [1] and make threshold value with this metric. Using threshold value, we stop training some quantized layers which have high sensitivity to weight quantization and fine-tune the rest of the quantized layers with different learning rate and scheduler. With this advanced training method, we improved 2-bit weight quantization accuracy of light deep learning models including EfficientNetB0 and MobilenetV2.

URI: https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE10521871 https://repository.hanyang.ac.kr/handle/20.500.11754/172399

Appears in Collections:: COLLEGE OF ENGINEERING[S](공과대학) > ELECTRONIC ENGINEERING(융합전자공학부) > Articles

한양대학교 리포지터리는 국립중앙도서관 OAK 보급사업으로 구축되었습니다. Feedback 개인정보처리방침

Hanyang University repository