电子学报 ›› 2013, Vol. 41 ›› Issue (2): 295-300.DOI: 10.3969/j.issn.0372-2112.2013.02.014
颜鑫, 李应
收稿日期:
2012-05-17
修回日期:
2012-09-28
出版日期:
2013-02-25
作者简介:
基金资助:
YAN Xin, LI Ying
Received:
2012-05-17
Revised:
2012-09-28
Online:
2013-02-25
Published:
2013-02-25
Supported by:
摘要: 针对真实环境中各种背景噪声下的鸟类声音识别问题,提出了一种基于新型抗噪特征提取的鸟类声音识别技术.首先,根据适用于高度非平稳环境下的噪声估计算法求出噪声功率谱.其次,使用多频带谱减法对声音功率谱进行降噪处理.接着,结合降噪的声音功率谱提取抗噪幂归一化倒谱系数(APNCC).最后,采用支持向量机(SVM)分别对提取的APNCC,幂归一化倒谱系数(PNCC)和Mel频率倒谱系数(MFCC)对34种鸟类声音进行不同环境和信噪比情况下的对比实验.实验表明,提取的APNCC具有较好的平均识别效果及较强的噪声鲁棒性,更适用于信噪比低于30dB环境下的鸟类声音识别.
中图分类号:
颜鑫, 李应. 利用抗噪幂归一化倒谱系数的鸟类声音识别[J]. 电子学报, 2013, 41(2): 295-300.
YAN Xin, LI Ying . Anti-Noise Power Normalized Cepstral Coefficients in Bird Sounds Recognition[J]. Acta Electronica Sinica, 2013, 41(2): 295-300.
[1] Somervuo P,Harma A.Bird song recognition based on syllable pair histograms[A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].Montreal,Canada:IEEE Press,2004:825-828. [2] Cheng J,Sun Y,Ji L.A call-independent and automatic acoustic system for the individual recognition of animals:a novel model using four passerines[J].Pattern Recognition,2010,43(11):3846-3852. [3] 冯霞,龚晓峰,张利丹,武瑞娟.基于纹理特征的背景噪声提取的应用研究[J].电子学报,2009,37(9):2092-2095. Feng Xia,Gong Xiao-feng,Zhang Li-dan,Wu Rui-juan.Research of background noise extraction based on texture feature[J].Acta Electronica Sinica,2009,37(9):2092-2095.(in Chinese) [4] Chu W,et al.Noise robust bird song detection using syllable pattern-based hidden markov models[A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].Prague,Czech Republic:IEEE Press,2011:345-348. [5] Bardeli R,Wolff D,Kurth F,et al.Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring[J].Pattern Recognition Letters,2010,31(12):1524-1534. [6] Kim C,Stern R.Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring[A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].Dallas,TX:IEEE Press,2010.4574-4577. [7] Rangachari S,Loizou P C.A noise estimation algorithm for highly non-stationary environments[J].Speech Communication,2006,48(2):220-231. [8] Kamath S,et al.A multi-band spectral subtraction method for enhancing speech corrupted by colored noise [A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].Orlando,FL:IEEE Press,2002.IV-4164-IV-4164. [9] 王,钱志鸿,王雪,程光明.基于伽马通滤波器组的听觉特征提取算法研究[J].电子学报,2010,38(3):525-528. Wang Yue,Qian Zhi-hong,Wang Xue,Cheng Guang-ming.An auditory feature extraction algorithm based on γ-tone filter-banks[J].Acta Electronica Sinica,2010,38(3):525-528.(in Chinese) [10] Slaney M.Auditory toolbox version 2[CP/OL].https://engineering.purdue.edu/~malcolm/interval/1998-010/AuditoryToolbox.zip,2012-5-14. [11] Universitat Pompeu Fabra.Repository of sound under the creative commons license,Freesound.org[CP/OL].http://www.freesound.org,2012-5-14. [12] Chang C C,Lin C J.Libsvm version 3.12[CP/OL].http://www.csie.ntu.edu.tw/~cjlin/libsvm/ libsvm-3.12.zip,2012-5-14. |
[1] | 彭锦佳, 王辉兵. 基于异构卷积神经网络集成的无监督行人重识别方法[J]. 电子学报, 2023, (): 1-13. |
[2] | 郭凯红, 崔明茜, 刘婷婷. 模糊知识测度下图像脉冲噪声去除方法[J]. 电子学报, 2023, (): 1-14. |
[3] | 吕杭, 蒋明峰, 李杨, 张鞠成, 王志康. 基于混合时频域特征的卷积神经网络心律失常分类方法的研究[J]. 电子学报, 2023, 51(3): 701-711. |
[4] | 张晶, 王翌歆, 任永功. 统一全局空间表达的脑电信号跨被试情感识别[J]. 电子学报, 2023, (): 1-9. |
[5] | 但志平, 方帅领, 孙航, 李晶, 万俊. 基于双判别器异构CycleGAN框架下多阶通道注意力校准的室外图像去雾[J]. 电子学报, 2023, (): 1-14. |
[6] | 张智, 易华挥, 郑锦. 聚焦小目标的航拍图像目标检测算法[J]. 电子学报, 2023, (): 1-12. |
[7] | 姚睿, 朱享彬, 周勇, 王鹏, 张艳宁, 赵佳琦. 基于重要特征的视觉目标跟踪可迁移黑盒攻击方法[J]. 电子学报, 2023, (): 1-9. |
[8] | 孙锐, 张磊, 余益衡, 张旭东. 基于局部异构聚合图卷积网络的跨模态行人重识别[J]. 电子学报, 2023, (): 1-16. |
[9] | 张杰, 廖盛斌, 张浩峰, 陈得宝. 基于类别扩展的广义零样本图像分类方法[J]. 电子学报, 2023, (): 1-13. |
[10] | 吴晓雨, 蒲禹江, 王生进, 刘子豪. 基于语义嵌入学习的特类视频识别[J]. 电子学报, 2023, (): 1-13. |
[11] | 苏天康, 宋慧慧, 樊佳庆, 张开华. 深度信号引导学习混合变换器的高性能无监督视频目标分割[J]. 电子学报, 2023, (): 1-8. |
[12] | 林彬, 王华通, 封全喜. 基于双模型竞争机制的目标跟踪算法[J]. 电子学报, 2023, (): 1-7. |
[13] | 王子为, 鲁继文, 周杰. 基于自适应梯度优化的二值神经网络[J]. 电子学报, 2023, 51(2): 257-266. |
[14] | 唐利明, 熊点华, 方壮. 基于比尔朗伯定律的变分水平集模型[J]. 电子学报, 2023, 51(2): 416-426. |
[15] | 杨利平, 侯振威, 辜小花, 郝峻永. 弱标签声音事件检测的空间-通道特征表征与自注意池化[J]. 电子学报, 2023, 51(2): 297-306. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||