[1] SAK H,SENIOR A,BEAUFAYS F.Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition[J].arXiv Preprint,2014,arXiv:1402.1128. [2] MIKOLOV T,KARAFIÁT M,BURGET L,et al.Recurrent neural network based language model[A].Eleventh Annual Conference of the International Speech Communication Association[C].Chiba,Japan:ISCA.2010.1045-1048. [3] CHO K,VANMerriënboer B,GULCEHRE C,et al.Learning phrase representations using RNN encoder decoder for statistical machine translation[J].arXiv Preprint,2014,arXiv:1406.1078. [4] BYEON W,BREUEL T M,RAUE F,et al.Scene labeling with LSTM recurrent neural networks[A].Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition[C].Boston,Massachusetts,USA:IEEE.2015.3547-3555. [5] HOCHREITER S,SCHMIDHUBER.J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780. [6] LI Z,WANG S,DING C,et al.Efficient recurrent neural networks using structured matrices in FPGAs[J].arXiv Preprint,2018,arXiv:1803.07661. [7] HAN S,KANG J,MAO H,et al.Ese:Efficient speech recognition engine with sparse LSTM on FPGA[A].Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays[C].Monterey,California,USA:ACM,2017.75-84. [8] ALI S M,WANG S J,MA N,et al.A bandwidth in-sensitive low stall sparse matrix vector multiplication architecture on reconfigurable FPGA platform[A].2017 13th IEEE International Conference on.IEEE[C].Yangzhou,China:Electronic Measurement & Instruments (ICEMI),2017.171-176. [9] NEIL D,LEE J H,DELBRUCK T,et al.Delta networks for optimized recurrent network computation[A].Proceedings of the 34th International Conference on Machine Learning[C].Australia:JMLR,2017.2584-2593. [10] HOPFIELD J J.Neural networks and physical systems with emergent collective computational abilities[J].Proceedings of the National Academy of Sciences,1982,79(8):2554-2558 [11] Williams S,Waterman A,Patterson D A,et al.Roofline:An insightful visual performance model for multicore architectures[J].Communications of the ACM,2009,52(4):65-76. [12] AYAT S O,MOHAMED K-H,AB R A A H.Optimizing FPGA-based CNN accelerator for energy efficiency with an extended Roofline model[J].Turkish Journal of Electrical Engineering & Computer Sciences,2018,26(2):919-935. [13] LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324. [14] 宋翔,周凡,陈耀武,等.基于FPGA的实时双精度浮点矩阵乘法器设计[J].浙江大学学报(工学版),2008,42(9):1611-1615. SONG X,ZHOU F,CHEN Y W,et al.Design of real time double precision floating point matrix multiplier based on FPGA[J].Journal of Zhejiang University,2008,42(9):1611-1615.(in Chinese) |