
High-Precision Lightweight Quantization Inference Method for Prevalent Activation Functions in Transformer Models in Edge Device Deployment
YANG Yun-hui, CHENG Hu, WEI Jing-he, LIU Guo-zhu, SANG Xian-zhen
ACTA ELECTRONICA SINICA ›› 2024, Vol. 52 ›› Issue (10) : 3301-3311.
High-Precision Lightweight Quantization Inference Method for Prevalent Activation Functions in Transformer Models in Edge Device Deployment
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 |
|
〉 |