面向Transformer模型边缘端部署的常用激活函数高精度轻量级量化推理方法
杨赟辉, 程虎, 魏敬和, 刘国柱, 桑贤侦
High-Precision Lightweight Quantization Inference Method for Prevalent Activation Functions in Transformer Models in Edge Device Deployment
YANG Yun-hui, CHENG Hu, WEI Jing-he, LIU Guo-zhu, SANG Xian-zhen
电子学报
.
2024, (10): 3301
-3311
.
DOI: 10.12263/DZXB.20240435