为实现音频信号的低速率编码,提出一种正弦参数量化的新方案.该方案利用人耳掩蔽效应和实时参数统计特性,基于高速率原理建立并显式求解率失真优化问题,得到具有解析形式的参数量化器,实现了在正弦分量间和正弦参数间动态分配编码比特.本量化方案无迭代运算过程,适合实时低速率音频编码应用.与经典方法比较,平均比特速率每正弦减少约17%.当速率大于15bit/正弦,重建信号的感觉失真小于球形量化方案.
Abstract
In order to achieve low bit rate coding of audio signals,this paper presents a new scheme for quantization of sinusoidal parameters.By using the masking effect of the human auditory system and real-time parameter statistical properties,the scheme based on high-rate theory to establish a rate-distortion optimization problem that is solved explicitly,derives the parameter quantizers which have analytical expressions,and dynamically implements the coding rate distribution between sinusoidal components and sinusoidal parameters.The quantization scheme without iterative calculation processes,is suitable for real-time low-rate audio coding applications.Compared with the classical method,the average bit rate of the proposed scheme is reduced 17% per sinusoid.When the rate is larger than 15 bit per sinusoid,the perceptual distortion of the reconstructed signals is smaller than the spherical quantization scheme's perceptual distortion of the reconstructed signals.
关键词
高速率原理 /
正弦编码 /
匹配追踪 /
自适应联合量化
{{custom_keyword}} /
Key words
high-rate theory /
sinusoidal coding /
matching pursuit /
adaptive joint quantization
{{custom_keyword}} /
中图分类号:
TN911.21
{{custom_clc.code}}
({{custom_clc.text}})
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Christensen M G,van de Par S.Efficient parametric coding of transients[J].IEEE Transactions on Audio,Speech,and Language Processing,2006,14(4):1340-1351.
[2] Purnhagen H.Very low bit rate parametric audio coding[D].Hannover,Germany:PhD dissertation,Hannover University,2008.
[3] Korten P E L,Jensen J,Heusdens R.High rate spherical quantization of sinusoidal parameters[A].12th European Signal Processing Conference[C].Vienna,Austria:EURASIP,2004.1757-1760.
[4] Vafin R,Prakash D,Kleijn W B.On frequency quantization in sinusoidal audio coding[J].IEEE Signal Processing Letters,2005,12(3):210-213.
[5] Vafin R,Kleijn W B.Entropy-constrained polar quantization and its application to audio coding[J].IEEE Transactions on Speech and Audio Processing,2005,13(2):220-232.
[6] Gray R M.Source Coding Theory[M].Norwell,MA:Kluwer,1990.
[7] Gray R M,Neuhoff D L.Quantization[J].IEEE Transactions on Information Theory,1998,44(6):2325-2383.
[8] Wilson G.Magnitude/phase quantization of independent Gaussian variates[J].IEEE Transactions on Communications,1980,28(11):1924-1929.
[9] Swaszek P F,Ku T W.Asymptotic performance of unrestricted polar quantizer[J].IEEE Transactions on Information Theory,1986,32(2):330-333.
[10] Korten P,Jensen J,Heusdens R.High-resolution spherical quantization of sinusoidal parameters[J].IEEE Transactions on Audio,Speech,and Language Processing,2007,15(3):966-981.
[11] Korten P,Jensen J,Heusdens R.High resolution spherical quantization of sinusoidal parameters using a erceptual distortion measure[A].International Conference on Acoustics,Speech,and Signal Processing[C].Philadelphia,USA:IEEE,2005.177-180.
[12] Moore B C J.An Introduction to the Psychology of Hearing (fourth edition)[M].London:Academic Press,1997.
[13] van de Par S,Kohlrausch A,et al.A perceptual model for sinusoidal audio coding based on spectral integration[J].EURASIP Journal on Applied Signal Processing,Special Issue on Anthropomorphic Signal Processing,2005,(9):1292-1304.
[14] Komzsik L.Applied Calculus of Variations for Engineers[M].Boca Raton:CRC Press,2009.
[15] Christensen R A,van de Par S.Rate-distortion efficient amplitude modulated sinusoidal audio coding[A].38th Asilomar Conference on Signals,Systems and Computers[C].USA:IEEE,2004.2280-2284.
[16] Goodwin M.Adaptive signal models:theory,algorithms,and audio applications[D].Berkeley,USA:PhD dissertation of University of California,Berkeley,1997.
[17] Papoulis A,Pillai S U.Probability,Random Variables and Stochastic Processes[M].New York:McGraw-Hill,2002.
[18] European Broadcasting Union.Sound Quality Assessment Material Recordings for Subjective Tests[DB].Geneva:EBU,1988.
[19] 王嵩,鲍长春.基于组合正弦的音频信号建模[J].信号处理,2009,25(8A):135-138. Wang S,Bao C C.An audio signal modeling based on composite sinusoids[J].Signal Processing,2009,25(8A):135-138.(in Chinese)
[20] Derrien O,Badeau R,Richard G.Entropy-constrained quantization of exponentially damped sinusoids parameters[A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].Prague,Czech Republic:IEEE,2011.4064-4067.
[21] Derrien O,Badeau R,Richard G.Parametric audio coding with exponentially damped sinusoids[J].IEEE Transactions on Audio,Speech,and Language Processing,2013,21(7):1489-1501.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
北京市教育委员会科技计划 (No.KM201310017009)
{{custom_fund}}