[1] 鲍长春.数字语音编码原理[M].西安:西安电子科技大学出版社.2007. Bao Chang-chun.Principles of Digital Speech Coding[M].Xi'an,China:Xidian University Press.2007.(in Chinese)
[2] Xiao-ming Li,Chang-chun Bao,W Bastiaan Kleijn.Speech coding based on pitch synchrony and two-stage transformation[A].Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing(ICASSP2013)[C].Vancouver,Canada:IEEE,2013.8159-8163.
[3] Takehiro Moriya.Technologies forspeech and audio coding[A].Proceedings of the IEEE International Symposium on Consumer Electronics[C].Kyoto,Japan:IEEE,2009.148-149.
[4] ITU-T G.729.1.An 8-32 kb/s Scalable Wideband Coder Bit-stream Interoperable with G.729[S].2006-05.
[5] 贾懋,鲍长春.一种符合ITU-T指标要求的嵌入式立体声语音频编码新方法[J].电子学报,2009,37(10):2291-2297. JIA Mao-shen,BAO Chang-chun.An embedded stereo speech and audio coding method meeting the requirements of ITU-T terms of reference[J].Acta Electronica Sinica,2009,37(10):2291-2297.(in Chinese)
[6] ITU-T G.718.Frame Error Robust Narrowband and Wideband Embedded Variable Bit-rate Coding of Speech and Audio from 8-32 kb/s[S].2008.
[7] 3GPP.TS 26.290 V6.3.0.Extended Adaptive Multi-Rate-Wideband(AMR-WB+)Codec[S].2005-6.
[8] H Malvar.Lapped transforms for efficient transform/subband coding[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1990,38(6):969-978.
[9] N Ahmed,T Natarajan,K R Rao.Discretecosine transform[J].IEEE Transactions on Computers,1974,C-23(1):90-93.
[10] 刘靖宇,鲍长春,李如玮.基于离散余弦变换的波形内插语音编码算法[J].电子学报,2009,37(7):1599-1605. LIU Jing-Yu,Bao Chang-chun,LI Ru-wei.Waveform inerpolation speech coding based on DCT[J].Acta Electronica Sinica,2009,37(7):1599-1605.(in Chinese)
[11] Ted Painter,Andreas Spanias.Perceptual coding of digital audio[J].Proceedings of the IEEE,2000,88(4):451-513.
[12] K Brandenburg,G Stoll,Y Dehery,et al.ISO-MPEG-1 Audio:A generic standard for coding of high-quality digital audio[J].AES:Journal of the Audio Engineering Society,1994,42(10):780-792.
[13] Neuendorf Max,Multrus Markus,et al.MPEG unified speech and audio coding-the ISO/MPEG standard for high-efficiency audio coding of all content types[A].Proceedings of the 132nd Audio Engineering Society Convention[C].USA:AES,2012.248-269.
[14] M Wolters,K Kjorling,D Homm,H Purnhagen.A closer look into MPEG-4 High Efficiency AAC[A].Proceedings of the 115th AES Convention[C].New York,USA:AES,2003.5871-5886.
[15] M Nilsson,B Resch,M Y Kim,W B Kleijn.A canonical representation of speech[A].Proceedings of the IEEE International Conference on Acoustics,Speech and Signal Processing[C].USA:IEEE,2007,vol.4.IV849-IV852.
[16] Resch M Nilsson,A Ekman,W B Kleijn.Estimation of the instantaneous pitch of speech[J].IEEE Transactions on Speech Audio Processing,2007,15(3):813-822.
[17] M Unser,A Aldroubi,M Eden.B-spline signal processing:PartⅠ-theory[J].IEEE Transactions on Signal Processing,1993,41(2):821-833.
[18] M Unser,A Aldroubi,M Eden.B-spline signal processing:PartⅡ-efficient design and applications[J].IEEE Transactions on Signal Processing,1993,41(2):834-848.
[19] 薛二娟,鲍长春,李如玮.基于二维非负矩阵分解的1kb/s WI语音编码算法[J].电子学报,2010,38(7):1574-1579. XUE Er-juan,BAO Chang-chun,LI Ru-wei.1kb/s waveform interpolative speech coding based on two-dimensional nonnegative matrix factorization[J].Acta Electronica Sinica,2010,38(7):1574-1579.(in Chinese)
[20] ITU-T Recommendation G.722.1 Low-Complexity Coding at 24 and 32 kbit/s for Hands-Free Operation in Systems with Low Frame Loss[S].Geneva,2005-05.
[21] 3GPP TS 26.171.Adaptive Multi-Rate-Wideband(AMR-WB)Speech Codec;General Description[S].2002.
[22] F Baumgarte,A Lerch.Document 6QI18-E. Implementation of Recommendation lTU-R BS.1387,Delayed Contribution[S].February 2001.
[23] ITU-T Recommendation P.862.Perceptual Evaluation of Speech Quality(PESQ):An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codes[S].2001. |