北京工业大学信息学部语音与音频信号处理实验室,北京 100124
[ "周 静 男,1993年生于四川广安.现为北京工业大学博士研究生.主要研究方向为语音增强.E-mail: zhoujing@emails.bjut.edu.cn" ]
[ "鲍长春(通讯作者) 男,1965年生于内蒙古赤峰,中国电子学会会士,IEEE高级会员.现为北京工业大学教授、博士生导师.主要研究方向为语音编码与语音增强.E-mail: chchbao@bjut.edu.cn" ]
[ "张 旭 女,1995生于内蒙古乌兰察布.现为北京工业大学博士研究生.主要研究方向为语音分离.E-mail: zhangxu@emails.bjut.edu.cn" ]
收稿:2022-03-01,
修回:2022-10-03,
纸质出版:2023-01-25
移动端阅览
周静,鲍长春,张旭.基于聚焦信号子空间估计导向矢量的干扰声源抑制方法[J].电子学报,2023,51(01):76-85.
ZHOU Jing,BAO Chang-chun,ZHANG Xu.Suppression Method of the Interference Sound Sources by Estimated Steering Vector Based on the Focusing Signal Subspace[J].ACTA ELECTRONICA SINICA,2023,51(01):76-85.
周静,鲍长春,张旭.基于聚焦信号子空间估计导向矢量的干扰声源抑制方法[J].电子学报,2023,51(01):76-85. DOI: 10.12263/DZXB.20220210.
ZHOU Jing,BAO Chang-chun,ZHANG Xu.Suppression Method of the Interference Sound Sources by Estimated Steering Vector Based on the Focusing Signal Subspace[J].ACTA ELECTRONICA SINICA,2023,51(01):76-85. DOI: 10.12263/DZXB.20220210.
针对最小方差无失真响应(Minimum Variance Distortionless Response
MVDR)波束形成器对导向矢量失配较敏感的问题,本文提出了一种有效的干扰声源抑制方法.该方法首先将语音信号的频带划分为多个子带,通过聚焦信号子空间方法估计各子带的声源到达方向(Direction of Arrival
DOA),并采用统计直方图估计各声源的初始DOA;其次,为了减小导向矢量失配,利用声源的空间稀疏性,通过Capon功率构建目标声源导向矢量估计的代价函数,约束目标声源导向矢量远离干扰声源空间;最后,根据估计的导向矢量,估计干扰声源加噪声协方差矩阵,以获得MVDR波束形成器的权重.基于TIMIT语料库的实验结果证明,提出的干扰声源抑制方法的输出信干噪比(SINR)及语音质量感知评价(PESQ)优于参考方法,具有更佳的抗导向矢量失配性能.
Based on the problem that the minimum variance distortionless response (MVDR) beamformer is very sensitive to the mismatch of the steering vector
an effective method of suppressing the interference sound sources is proposed in this paper. First
the bandwidth of speech signal is divided into multiple sub-bands
and the direction of arrival (DOA) of sound sources at each sub-band is estimated by the focusing signal subspace method. Specially
the initial DOA of each sound source is estimated via statistical histogram. Second
in order to reduce the mismatch of the steering vector
based on the spatial sparsity of sound sources
the cost function used for the steering vector estimation of the target sound source is constructed by Capon power so that the steering vector of the target sound source is constrained away from the space of interference sound sources. Finally
the covariance matrix of interference sound source plus noise is estimated based on the estimated steering vector for obtaining the weights of the MVDR beamformer. The experimental results on the TIMIT corpus show that the proposed method outperforms the reference methods on the tests of the output signal to interference-plus-noise ratio (SINR) and the perceptual evaluation of speech quality (PESQ) and has a better performance for preventing the mismatch of the steering vector.
BENESTY J , CHEN J , HUANG Y . Microphone Array Signal Processing [M]. Berlin, Heidelberg : Springer , 2008 .
BRANDSTEIN M , WARD D . Microphone Arrays: Signal Processing Techniques and Applications [M]. Berlin, Heidelberg : Springer , 2001 .
LIN J , PENG Q , HUANG Q . Adaptive beamforming with robustness against both finite-sample effects and steering vector mismatches [J]. IEICE Transaction on Fundamentals of Electronics, Communications and Computer Sciences , 2006 , 89 ( A9 ): 2356 - 2362 .
HENDRIKS R , GERKMANN T . Noise correlation matrix estimation for multi-microphone speech enhancement [J]. IEEE Transactions on Audio, Speech, and Language Processing , 2012 , 20 ( 1 ): 223 - 233 .
何礼 , 周翊 , 刘宏清 . 利用相位时频掩蔽的麦克风阵列噪声消除方法 [J] . 信号处理 , 2018 , 34 ( 12 ): 1490 - 1498 .
HE L , ZHOU Y , LIU H . Microphone array noise cancellation method using phase time-frequency masking [J]. Journal of Signal Processing , 34 ( 12 ): 1490 - 1498 . (in Chinese)
QIAN F , VAN VEEN B . Coherent interference suppression via partially adaptive beamforming [C]// IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP) . San Francisco : IEEE , 1992 : 441 - 444 .
HERBORDT W , BUCHNER H , Nakamura S , et al . Multichannel bin-wise robust frequency domain adaptive filtering and its application to adaptive beamforming [J]. IEEE Transactions on Audio, Speech, and Language Processing , 2007 , 15 ( 4 ): 1340 - 1351 .
ZHANG K , WEI Y , WU D , et al . Adaptive speech separation based on beamforming and frequency domain independent component analysis [J]. Applied Sciences , 2020 , 10 ( 7 ): 2593 .
ZHANG P , YANG Z , LIAO G , et al . An RCB-like steering vector estimation method based on interference matrix reduction [J]. IEEE Transactions on Aerospace and Electronic Systems , 2021 , 57 ( 1 ): 636 - 646 .
杨志伟 , 张攀 , 陈颖 , 等 . 导向矢量和协方差矩阵联合迭代估计的稳健波束形成算法 [J]. 电子与信息学报 , 2018 , 40 ( 12 ): 2874 - 2880 .
YANG Z , ZHANG P , CHEN Y , et al . Steering vector and covariance matrix joint iterative estimations for robust beamforming [J]. Journal of Electronics and Information Technology , 2018 , 40 ( 12 ): 2874 - 2880 . (in Chinese)
DMOCHOWSKI J , BENESTY J , Affes S . Broadband music: Opportunities and challenges for multiple source localization [C]// IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . New York : IEEE , 2007 : 18 - 21 .
HERZOG A , HABETS E . Eigenbeam-ESPRIT for DOA-vector estimation [J]. IEEE Signal Processing Letters , 2019 , 26 ( 4 ): 572 - 576 .
SU G , MORF M . The signal subspace approach for multiple wide-band emitter location [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing , 1983 , 31 ( 6 ): 1502 - 1522 .
WANG H , KAVEH M . Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing , 1985 , 33 ( 4 ): 823 - 831 .
MA F , ZHANG X . Wideband DOA estimation based on focusing signal subspace [J]. Signal, Image and Video Processing , 2019 , 13 : 675 - 682 .
BEIT-ON H , RAFAELY B . Focusing and frequency smoothing for arbitrary arrays with application to speaker localization [J]. IEEE/ACM Transactions on Audio , Speech, and Language Processing, 2020 , 28 : 2184 - 2193 .
曹司磊 , 曾维贵 , 王磊 . 色噪声下基于差分聚焦的宽带DOA估计方法 [J]. 哈尔滨工业大学学报 , 2021 , 53 ( 2 ): 140 - 145 .
CAO S , ZENG W , WANG L . DOA estimation of wide-band array with differential focusing under colored noise [J]. Journal of Harbin Institute of Technology , 2021 , 53 ( 2 ): 140 - 145 . (in Chinese)
贾思宇 , 路茗 , 丁华泽 , 等 . 一种改进的信号子空间聚焦宽带DOA估计算法 [J]. 计算机工程 , 2022 , 48 ( 1 ): 175 - 181 .
JIA S , LU M , DING H , et al . A modified wideband DOA estimation algorithm for focusing signal subspace [J]. Computer Engineering , 2022 , 48 ( 1 ): 175 - 181 . (in Chinese)
LIU J , LI J . Robust detection in MIMO radar with steering vector mismatches [J]. IEEE Transactions on Signal Processing , 2019 , 67 ( 20 ): 5270 - 5280 .
STOICA P , WANG Z , LI J . Robust Capon beamforming [J]. IEEE Signal Processing Letters , 2003 , 10 ( 6 ): 172 - 175 .
GU Y , LESHEM A . Robust adaptive beamforming based on interference covariance matrix reconstruction and steering vector estimation [J]. IEEE Transactions on Signal Processing , 2012 , 60 ( 7 ): 3881 - 3885 .
YANG Z , ZHANG P , LIAO G , et al . Robust beamforming via alternating iteratively estimating the steering vector and interference-plus-noise covariance matrix [J]. Digital Signal Processing , 2020 , 99 : 102620 .
ZHOU Z , LIU Y , CHRISTENSEN M G , et al . A robust approach to the order detection for the damped sinusoids based on the shift-invariance property [C]// 15th IEEE International Conference on Signal Processing (ICSP) . Beijing : IEEE , 2020 : 472 - 477 .
HIGUCHI T , ITO N , ARAKI S , et al . Online MVDR Beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR [J]. IEEE/ACM Transactions on Audio , Speech, and Language Processing, 2017 , 25 ( 4 ): 780 - 793 .
ZHOU M , MA X , SHEN P , et al . Weighted subspace-constrained adaptive beamforming for sidelobe control [J]. IEEE Communications Letters , 2019 , 23 ( 3 ): 458 - 461 .
LI J , LIN Q , KANG C , et al . DOA estimation for underwater wideband weak targets based on coherent signal subspace and compressed sensing [J]. Sensors , 2018 , 18 ( 3 ): 902 .
ZHU X , XU X , YE Z . Robust adaptive beamforming via subspace for interference covariance matrix reconstruction [J]. Signal Processing , 2020 , 167 : 107289 .
KE Y , ZHENG C , PENG R , et al . Robust adaptive beamforming using noise reduction preprocessing-based fully automatic diagonal loading and steering vector estimation [J]. IEEE Access , 2017 , 5 : 12974 - 12987 .
CHENG R , BAO C , CUI Z . MASS: Microphone array speech simulator in room acoustic environment for multi-channel speech coding and enhancement [J]. Applied Sciences , 2020 , 10 ( 4 ): 1484(1-17 .
WAX M , ANU Y . Performance analysis of the minimum variance beamformer in the presence of steering vector errors [J]. IEEE Transactions on Signal Processing , 1996 , 44 ( 4 ): 938 - 947 .
鄢社锋 . 优化阵列信号处理: 波束优化理论与方法 [M]. 北京 : 科学出版社 , 2018 .
YAN S . Optimal Array Signal Processing: Beamforming Design Theory and Methods [M]. Beijing : Science Press , 2018 . (in Chinese)
0
浏览量
17
下载量
1
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621