High-Precision Lightweight Quantization Inference Method for Prevalent Activation Functions in Transformer Models in Edge Device Deployment

YANG Yun-hui, CHENG Hu, WEI Jing-he, LIU Guo-zhu, SANG Xian-zhen

ACTA ELECTRONICA SINICA ›› 2024, Vol. 52 ›› Issue (10) : 3301-3311.

PDF(2067 KB)
CIE Homepage  |  Join CIE  |  Login CIE  |  中文 
PDF(2067 KB)
ACTA ELECTRONICA SINICA ›› 2024, Vol. 52 ›› Issue (10) : 3301-3311. DOI: 10.12263/DZXB.20240435
PAPERS

High-Precision Lightweight Quantization Inference Method for Prevalent Activation Functions in Transformer Models in Edge Device Deployment

    {{javascript:window.custom_author_en_index=0;}}
  • {{article.zuoZhe_EN}}
Author information +

HeighLight

{{article.keyPoints_en}}

Abstract

{{article.zhaiyao_en}}

Key words

QR code of this article

Cite this article

Download Citations
{{article.zuoZheEn_L}}. {{article.title_en}}[J]. {{journal.qiKanMingCheng_EN}}, 2024, 52(10): 3301-3311. https://doi.org/10.12263/DZXB.20240435

References

References

{{article.reference}}

Funding

RIGHTS & PERMISSIONS

{{article.copyrightStatement_en}}
{{article.copyrightLicense_en}}
PDF(2067 KB)

Accesses

Citation

Detail

Sections
Recommended

/