1.合肥工业大学计算机与信息学院, 安徽合肥 230601
2.大数据知识工程教育部重点实验室(合肥工业大学),安徽合肥 230601
3.智能互联系统安徽省实验室(合肥工业大学),安徽合肥 230009
4.工业安全应急技术安徽省重点实验室(合肥工业大学),安徽合肥 230601
[ "苏兆品 女, 1983年8月出生于山东省菏泽市.现为合肥工业大学计算机与信息学院副教授、 硕士生导师.获安徽省自然科学奖1项.在国内外发表学术论文40余篇.中国电子学会会员编号:E190027825M. E-mail: szp@hfut.edu.cn" ]
[ "张 羚 女, 1995年4月出生于甘肃省武威市.硕士研究生, 主要研究方向为音频隐写和隐写分析.E-mail: 1772950753@qq.com" ]
[ "张国富(通讯作者) 男, 1979年3月出生于安徽省合肥市.现为合肥工业大学计算机与信息学院教授、 硕士生导师.主要研究方向为联盟博弈、 进化计算、 音频安全." ]
[ "岳 峰 男, 1981年2月出生于安徽省合肥市.现为合肥工业大学计算机与信息学院副研究员、 硕士生导师.主要研究方向为软件工程、信息安全.E-mail: yuefeng@hfut.edu.cn" ]
收稿:2022-05-14,
修回:2022-06-24,
纸质出版:2023-05-25
移动端阅览
苏兆品,张羚,张国富等.基于多特征融合和BiLSTM的语音隐写检测算法[J].电子学报,2023,51(05):1300-1309.
SU Zhao-pin,ZHANG Ling,ZHANG Guo-fu,et al.A Speech Steganalysis Algorithm Based on Multi-Feature Fusion and BiLSTM[J].ACTA ELECTRONICA SINICA,2023,51(05):1300-1309.
苏兆品,张羚,张国富等.基于多特征融合和BiLSTM的语音隐写检测算法[J].电子学报,2023,51(05):1300-1309. DOI: 10.12263/DZXB.20220553.
SU Zhao-pin,ZHANG Ling,ZHANG Guo-fu,et al.A Speech Steganalysis Algorithm Based on Multi-Feature Fusion and BiLSTM[J].ACTA ELECTRONICA SINICA,2023,51(05):1300-1309. DOI: 10.12263/DZXB.20220553.
针对传统互联网低比特率编解码器(internet Low Bit Rate Codec,iLBC)语音隐写主要集中在线性频谱频率系数矢量量化、码本搜索矢量量化或增益量化的单个阶段,难以应对多阶段下的联合隐写检测等问题,提出一种基于多特征融合和双向长短时记忆(Bi-Directional Long Short-Term Memory,BiLSTM)网络的iLBC语音隐写检测算法.通过分析隐写对不同阶段参数带来的影响,提取线性频谱频率系数矢量量化、码本搜索矢量量化和增益量化过程中的多种隐写特征,并分别输入到相应的BiLSTM检测网络,最后将各检测网络的结果进行融合,得到最终隐写检测结果.实验表明,所提算法可以实现多阶段下的联合隐写检测,而且在语音时长较短时,仍能取得优异的检测结果,平均检测准确率达到了90%以上.
The traditional internet low bit rate codec (iLBC) based speech steganography mainly focuses on a single stage of the linear spectrum frequency coefficient vector quantization
the codebook search vector quantization
or the gain quantization
which is difficult to deal with the multi-stage joint steganalysis. To this end
an iLBC speech steganalysis algorithm based on the multi-feature fusion and the bi-directional long short-term memory (BiLSTM) network is proposed. Specifically
the impact of steganography on iLBC parameters is first analyzed in the linear spectrum frequency coefficient vector quantization process
the dynamic codebook search process
and the gain quantization process. Then
multiple steganographic features in the above three stages are extracted and input to three different detection models based on BiLSTM
respectively. Finally
a fusion strategy is presented to merge the detection results of each model. Experimental results show that the proposed algorithm can achieve multi-stage joint steganalysis and good detection results with an average detection accuracy of more than 90%
even if the speech duration is short.
WU Z , SHA Y . An implementation of speech steganography for iLBC by using fixed codebook [C]// IEEE International Conference on Computer and Communications . Chengdu : IEEE Press , 2016 : 1970 - 1974 .
HUANG Y , TAO H , XIAO B , et al . Steganography in low bit-rate speech streams based on quantization index modulation controlled by keys [J]. Science China Technological Sciences , 2017 , 60 ( 10 ): 1585 - 1596 .
SU Z , LI W , ZHANG G , et al . A steganographic method based on gain quantization for iLBC speech streams [J]. Multimedia Systems , 2020 , 26 ( 2 ): 223 - 233 .
苏兆品 , 张羚 , 张国富 . 低比特率语音流大容量分层隐写方法 [J/OL]. 中国图象图形学报 , 2022 , DOI: 10.11834/jig.210307 http://dx.doi.org/10.11834/jig.210307 .
SU Zhao-pin , ZHANG Ling , ZHANG Guo-fu . High-capacity hierarchical steganography in a low-bit rate speech codec [J/OL]. Journal of Image and Graphics , 2022 , DOI: 10.11834/jig.210307. http://dx.doi.org/10.11834/jig.210307. (in Chinese)
LIU Q , SUNG A H , QIAO M . Temporal derivative-based spectrum and mel-cepstrum audio steganalysis [J]. IEEE Transactions on Information Forensics and Security , 2009 , 4 ( 3 ): 359 - 368 .
LIN Z , HUANG Y , WANG J . RNN-SM: Fast steganalysis of VoIP streams using recurrent neural network [J]. IEEE Transactions on Information Forensics and Security , 2018 , 13 ( 7 ): 1854 - 1868 .
GONG C , YI X , ZHAO X , et al . Recurrent convolutional neural networks for AMR steganalysis based on pulse position [C]// ACM Workshop on Information Hiding and Multimedia Security , Paris : ACM Press , 2019 : 2 - 13 .
REN Y , LIU D , LIU C , et al . A universal audio steganalysis scheme based on multiscale spectrograms and deep-ResNet [J/OL]. IEEE Transactions on Dependable and Secure Computing , 2022 . DOI: 10.1109/TDSC.2022.3141121 http://dx.doi.org/10.1109/TDSC.2022.3141121 .
YANG H , YANG Z , BAO Y , et al . FCEM: A novel fast correlation extract model for real time steganalysis of VoIP stream via multi-head attention [C]// International Conference on Acoustics, Speech and Signal Processing , Barcelona : IEEE , 2020 : 2822 - 2826 .
YANG H , YANG Z , BAO Y , et al . Fast steganalysis method for VoIP streams [J]. IEEE Signal Processing Letters , 2020 , 14 : 286 - 290 .
李望望 . 面向iLBC语音流的隐写与隐写分析技术研究 [D]. 合肥 : 合肥工业大学计算机与信息学院 , 2019 .
张浩 , 胡昌华 , 杜党波 等 . 多状态影响下基于Bi-LSTM网络的锂电池剩余寿命预测方法 [J]. 电子学报 , 2022 , 50 ( 3 ): 619 - 624 .
ZHANG H , HU C , DU D , et al . Remaining useful life prediction method of lithium-ion battery based on Bi-LSTM network under multi-state influence [J]. Acta Electronica Sinica , 2022 , 50 ( 3 ): 619 - 624 . (in Chinese)
李敬轩 , 胡润文 , 阮观奇 , 等 . 基于手工特征提取与结果融合的CNN音频隐写分析算法 [J]. 计算机学报 , 2021 , 44 ( 10 ): 2061 - 2075 .
LI J , HU R , RUAN G , et al . A CNN based audio steganalysis algorithm by manual feature extraction and result merging [J]. Chinese Journal of Computers , 2021 , 44 ( 10 ): 2061 - 2075 . (in Chinese)
0
浏览量
10
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621