非均匀二次声源分布下的2.5D高阶Ambisonics声场合成算法

doi:10.12263/DZXB.20200010

PDF(2552 KB)

电子学报 ›› 2021, Vol. 49 ›› Issue (1) : 14-22. DOI: 10.12263/DZXB.20200010

学术论文

非均匀二次声源分布下的2.5D高阶Ambisonics声场合成算法

董石¹, 章强¹, 谢夏风¹, 钟睿²

作者信息 +

Non-Uniform Secondary Source Distribution for 2.5D Higher Order Ambisonics Sound Field Synthesis

DONG Shi¹, ZHANG Qiang¹, XIE Xia-feng¹, ZHONG Rui²

Author information +

文章历史 +

摘要

为了提高感兴趣区域音源的声场合成质量，提出了一种非均匀二次声源分布的声场分析与合成方法.首先，将连续二次声源的采样分为两个不同的采样区域，在总扬声器数目一定的情况下，增大感兴趣区域的样点数目并减少感兴趣区域外样点数目.然后，利用采样混叠矩阵的Hermitian特性，使用瑞利熵来估计不同非均匀采样方案的最大误差，以指导非均匀采样方案的选择.最终得到非均匀二次声源驱动信号的频域计算方法.实验结果表明在2.5D圆形区域边界下，所提非均匀排布方法与典型的均匀排布的最佳模态带宽相比，对感兴趣区域音源的无混叠半径提高了12.9%~22.6%，验证了提出方法的有效性.

Abstract

In order to improve the sound field synthesis quality of the sound source in the region of interest, a non-uniform secondary source distribution method for sound field analysis and synthesis is proposed. First, the sampling of the continuous secondary source is divided into two different sampling regions. With a certain number of loudspeakers, the number of samples in the region of interest is increased and the number of samples outside the region of interest is reduced. Then, using the Hermitian characteristic of the sampling aliasing matrix, Rayleigh entropy is used to estimate the maximum error of different non-uniform sampling schemes to guide the selection of non-uniform sampling schemes. Finally, the frequency domain calculation method of the driving signal for the non-uniform secondary source distribution is obtained. The experimental results show that the proposed non-uniform distribution method improves the aliasing-free radius by 12.9%~22.6% for the sound sources from the region of interest under the 2.5D circular boundary condition, compared with the optimal modal bandwidth of the uniform distribution. The effectiveness of the proposed method is verified.

导出引用

董石, 章强, 谢夏风, 钟睿. 非均匀二次声源分布下的2.5D高阶Ambisonics声场合成算法[J]. 电子学报, 2021, 49(1): 14-22. https://doi.org/10.12263/DZXB.20200010

DONG Shi, ZHANG Qiang, XIE Xia-feng, ZHONG Rui. Non-Uniform Secondary Source Distribution for 2.5D Higher Order Ambisonics Sound Field Synthesis[J]. Acta Electronica Sinica, 2021, 49(1): 14-22. https://doi.org/10.12263/DZXB.20200010

中图分类号： TN911

参考文献

[1] Kim S,Lee Y W,Pulkki V.New 10.2-channel vertical su-rround system (10.2-VSS);Comparison study of perceiv-ed audio quality in various multichannel sound systems with height loudspeakers[A].Audio Engineering Society Convention 129[C].New York,USA:Audio Engineering Society,2010.Paper Number 8296.
[2] Ando A.Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field[J].IEEE Transactions on Audio,Speech,and Language Processing,2010,19(6):1467-1475.
[3] Herre J,Hilpert J,Kuntz A,et al.MPEG-H audio:the new standard for universal spatial/3D audio coding[J].Journal of the Audio Engineering Society,2014,62(12):821-830.
[4] Politis A,Vilkamo J,Pulkki V,et al.Sector-based parametric sound field reproduction in the spherical harmonic domain[J].IEEE Journal of Selected Topics in Signal Processing,2015,9(5):852-866.
[5] Ward D B,Abhayapala T D.Reproduction of a plane-wave sound field using an array of loudspeakers[J].IEEE Transactions on Speech and Audio Processing,2001,9(6):697-707.
[6] Poletti M A.Three-dimensional surround sound systems based on spherical harmonics[J].Journal of the Audio Engineering Society,2005,53(11):1004-1025.
[7] Gupta A,Abhayapala T D.Three-dimensional sound field reproduction using multiple circular loudspeaker arrays[J].IEEE Transactions on Audio,Speech,and Language Processing,2011,19(5):1149-1159.
[8] Ahrens J,Spors S.A modal analysis of spatial discretization of spherical loudspeaker distributions used for sound field synthesis[J].IEEE Transactions on Audio,Speech,and Language Processing,2012,20(9):2564-2574.
[9] Zhang W,Abhayapala T D.Three dimensional sound field reproduction using multiple circular loudspeaker arrays:functional analysis guided approach[J].IEEE Transactions on Audio,Speech,and Language Processing,2014,22(7):1184-1194.
[10] Winter F,Ahrens J,Spors S,et al.On analytic methods for 2.5D local sound field synthesis using circular distributions of secondary sources[J].IEEE Transactions on Audio,Speech,and Language Processing,2016,24(5):914-926.
[11] Koyama S,Furuya K,Hiwasaki Y,et al.Wave field reconstruction filtering in cylindrical harmonic domain for with-height recording and reproduction[J].IEEE Transactions on Audio,Speech,and Language Processing,2014,22(10):1546-1557.
[12] Firtha G,Fiala P,Schultz F,et al.Improved referencing schemes for 2.5D wave field synthesis driving functions[J].IEEE Transactions on Audio,Speech,and Language Processing,2017,25(5):1117-1127.
[13] Firtha G,Fiala P,Schultz F,et al.On the general relation of wave field synthesis and spectral division method for linear arrays[J].IEEE Transactions on Audio,Speech,and Language Processing,2018,26(12):2393-2403.
[14] Winter F,Wierstorf H,Hold C,et al.Colouration in local wave field synthesis[J].IEEE Transactions on Audio,Speech,and Language Processing,2018,26(10):1913-1924.
[15] Hahn N,Winter F,Spors S,et al.Synthesis of a spatially band-limited plane wave in the time-domain using wave field synthesis[A].The 25th European Signal Processing Conference(EUSIPCO)[C].Kos,Greece:IEEE,2017.673-677.
[16] 周岭松,鲍长春,贾懋珅,步兵.基于高阶Ambisonics的2.5维近场声源合成[J].电子学报,2016,45(3):520-526. ZHOU Ling-song,BAO Chang-chun,JIA Mao-shen,BU Bing.2.5D near-field sound source synthesis using higher order Ambisonics[J].Acta Electronica Sinica,2016,45(3):520-526.(in Chinese)
[17] Jia M,Wang W,Sun J,et al.Sound field reproduction of focused virtual sources with symmetry sigmoid regularization function[J].Chinese Journal of Electronics,2018,27(4):835-844.
[18] 李娟,李军锋,颜永红.波场合成中声像感知距离重建[J].声学学报,2013,38(06):743-748. LI Juan,LI Jun-feng,YAN Yong-hong.Synthesis of perceived distance based on wave field synthesis[J].Acta Acustica,2013,38(06):743-748.(in Chinese)
[19] 李娟,付强,颜永红.波场合成与波场分析的有源房间补偿方法[J].声学学报,2014,39(1):137-144. LI Juan,FU Qiang,YAN Yong-hong.Active listening room compensation based on wave field synthesis and wave field analysis[J].Acta Acustica,2014,39(1):137-144.(in Chinese)
[20] Kleijn W B,Allen A,Skoglund J,et al.Incoherent idempotent ambisonics rendering[A].IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)[C].New Paltz,NY,USA:IEEE,2017.209-213.
[21] Kleijn W B.Directional emphasis in ambisonics[J].IEEE Signal Processing Letters,2018,25(7):1079-1083.
[22] Okamoto T.2.5D higher order ambisonics for a sound field described by angular spectrum coefficients[A].IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)[C].Shanghai,China:IEEE,2016.326-330.
[23] Winter F,Schultz F,Firtha G,et al.A geometric model for prediction of spatial aliasing in 2.5D sound field synthesis[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2019,27(6):1031-1046.
[24] Kentgens M,Jax P.Space warping based dimensionality reduction of higher order ambisonics signals[A].IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)[C].Brighton,United Kingdom,United Kingdom:IEEE,2019.131-135.
[25] Mai H,Xie B,Jiang J.Influence of the number of loudspeakers on the timbre in mixed-order ambisonics reprodution[A].IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)[C].Calgary,AB,Canada:IEEE,2018.556-560.
[26] Hold C,Gamper H,Pulkki V,et al.Improving binaural ambisonics decoding by spherical harmonics domain tape-ring and coloration compensation[A].IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)[C].Brighton,United Kingdom:IEEE,2019.261-265.
[27] Menzies D,Fazi F M.Ambisonic decoding for compensated amplitude panning[J].IEEE Signal Processing Letters,2019,26(3):470-474.
[28] Okamoto T,Cui Z L,Iwaya Y,et al.Implementation of a high-definition 3D audio-visual display based on higher-order ambisonics using a 157-loudspeaker array combined with a 3D projection display[A].The 2nd IEEE International Conference on Network Infrastructure and Digital Content[C].Beijing,China:IEEE,2010.179-183.
[29] André C R,Corteel E,Embrechts J J,et al.Subjective evaluation of the audiovisual spatial congruence in the case of stereoscopic-3D video and wave field synthesis[J].International Journal of Human-Computer Studies,2014,72(1):23-32.
[30] Yen J.On nonuniform sampling of bandwidth-limited signals[J].IRE Transactions on Circuit Theory,1956,3(4):251-257.
[31] Margolis E,Eldar Y C.Nonuniform sampling of periodic bandlimited signals[J].IEEE Transactions on Signal Processing,2008,56(7):2728-2745.
[32] Ahrens J.Analytic Methods of Sound Field Synthesis[M].Berlin,Germany:Springer,2012.
[33] Earl G Williams.Fourier Acoustics[M].USA:Academic Press,1999.183-234.
[34] Horn R A,Johnson C R.Matrix Analysis[M].Cambridge,United Kingdom:Cambridge University Press,2012.
[35] Rabenstein R,Spors S.Spatial aliasing artifacts produced by linear and circular loudspeaker arrays used for wave field synthesis[A].Proceedings of the Audio Engineering Society[C].USA:AES,2006.No.6711.