[1] Wei S,Hu G P,Hu Y,Wang R H.A new method for mispronunciation detection using support vector machine based on pronunciation space models[J].Speech Communication,2009,51(10):896-905.
[2] Witt S M,Young S J.Phone-level pronunciation scoring and assessment for interactive language learning[J].Speech Communication,2000,30 (2-3):95-108.
[3] 葛凤培,潘复平,董滨,颜永红.汉语发音质量评估的实验研究[J].声学学报,2010,35(2):261-266. GE Feng-pei,PAN Fu-ping,DONG Bin,YAN Yong-hong.Experiment investigation of Putonghua quality assessment[J].Acta Acustica,2010,35(2):261-266.(in Chinese)
[4] Lo W K,Zhang S,Meng H.Automatic derivation of phonological rules for mispronunciation detection in a computer-assisted pronunciation training system[A].Proceedings of Interspeech[C].JAPAN:ISCA,2010.765-768.
[5] Qian X,Soong F,Meng H.Discriminative acoustic model for improving mispronunciation detection and diagnosis in computer-aided pronunciation training (CAPT)[A].Proceedings of Interspeech[C].JAPAN:IEEE,2010.757-760.
[6] Luo D,Yang X,Wang L.Improvement of segmental mispronunciation detection with prior knowledge extracted from large L2 speech corpus[A].Proceedings of Interspeech[C].Italy:ISCA,2011.1593-1596.
[7] Qian X,Meng H,Soong F.The use of DBN-HMMs for mispronunciation detection and diagnosis in L2 English to support computer-aided pronunciation training[A].Proceedings of Interspeech[C].USA:ISCA,2012.775-778.
[8] Lee A,Zhang Y,Glass J.Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams[A].Proceedings of ICASSP[C].Canada:IEEE,2013.8227-8231.
[9] Hu W,Qian Y,Soong F.A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL)[A].Proceedings of Interspeech[C].France:ISCA,2013.1886-1890.
[10] Juang B H,Katagiri S.Discriminative learning for minimum error classification[J].IEEE Transactions on Signal Processing,1992,40(12):3043-3054.
[11] Bahl L R,Brown P F,Souza P,Mercer R.Maximum mutual information estimation of hidden Markov model parameters for speech recognition[A].Proceedings of ICASSP[C].JAPAN:IEEE,1986.49-52.
[12] Povey D,Woodland P.Minimum phone error and I-smoothing for improved discriminative training[A].Proceedings of ICASSP[C].USA:IEEE,2002.105-108.
[13] Huang H,Wang J,Abudureyimu H.Maximum F1-score discriminative training for automatic mispronunciation detection in computer-assisted language learning[A].Proceedings of Interspeech[C].USA:ISCA.2012.815-818.
[14] Droppo J,Acero A.Maximum mutual information SPLICE transform for seen and unseen conditions[A].Proceedings of Interspeech[C].Portugal:ISCA,2005.989-992.
[15] Povey D,Kingsbury B,Mangu L,et al.fMPE:discriminatively trained features for speech recognition[A].Proceedings of ICASSP[C].USA:IEEE,2005.961-964.
[16] Zhang B,Matsoukas S,Schwartz R.Discriminatively trained region dependent feature transforms for speech recognition[A].Proceedings of ICASSP[C].FRANCE:IEEE,2006.I313-I316.
[17] Nocedal J,Wright S J.Numerical Optimization[M].Germany:Springer,1999.
[18] 竺博.区分性训练和区分性自适应在自动语音识别声学模型优化中的应用[D].安徽合肥:中国科学技术大学,2009. ZHU Bo.Application of discriminative training and discriminative training-adaptation in acoustic modeling of ASR[D].Hefei,Anhui:University of Science and Technology of China,2009.(in Chinese) |