1.华侨大学计算机科学与技术学院,福建厦门 361021
2.厦门市数据安全与区块链技术重点实验室,福建厦门 361021
3.福建省大数据智能与安全重点实验室,福建厦门 361021
4.华侨大学机电及自动化学院,福建厦门 361021
[ "田 晖 男,1982年10月出生,湖北赤壁人.博士,教授,博士生导师.主要研究领域为网络与信息安全、数据安全、人工智能安全、信息隐藏及检测、数字取证等.E-mail: htian@hqu.edu.cn" ]
[ "严 艳 女,1997年2月出生,江西赣州人.华侨大学计算机科学与技术学院硕士研究生.主要研究方向为信息隐藏及检测、深度学习." ]
收稿:2021-11-01,
修回:2022-09-02,
纸质出版:2023-01-25
移动端阅览
田晖,严艳,汤莉莉等.基于分数基音延迟动态搜索的语音隐写算法[J].电子学报,2023,51(01):67-75.
TIAN Hui,YAN Yan,TANG Li-li,et al.Speech Steganography Based on Dynamic Search of Fractional Pitch Delay[J].ACTA ELECTRONICA SINICA,2023,51(01):67-75.
田晖,严艳,汤莉莉等.基于分数基音延迟动态搜索的语音隐写算法[J].电子学报,2023,51(01):67-75. DOI: 10.12263/DZXB.20211473.
TIAN Hui,YAN Yan,TANG Li-li,et al.Speech Steganography Based on Dynamic Search of Fractional Pitch Delay[J].ACTA ELECTRONICA SINICA,2023,51(01):67-75. DOI: 10.12263/DZXB.20211473.
论文提出了一种基于分数基音延迟动态搜索的语音隐写算法.该算法可根据隐藏容量(
x
比特/子帧)的需要将分数基音延迟候选值集合划分为2
x
个子集,每个子集代表不同的
x
比特信息.在闭环基音搜索过程中,可为每个子帧选择既能表示待嵌入隐秘信息且内插后的归一化相关系数最大的分数基音延迟候选值,从而有效降低隐写操作对于原始载体的影响.以目前IP语音系统中广泛使用的自适应多速率语音编码为例,对该算法从隐藏容量、不可感知性及抗检测性三方面进行了性能评估并与相关工作进行了对比分析.实验结果表明,本文提出的隐写算法较之现有基于基音延迟的隐写算法可在确保较高隐写容量的同时达到更好隐写安全性(即更好抗检测能力和不可感知性).
In this paper
we present a speech steganography algorithm based on dynamic search of fractional pitch delay. The algorithm can divide the candidate value set of fractional pitch delay into 2
x
subsets according to the needs of the covert capacity (
x
bits/subframe)
where each subset represents different
x
bits of information. In the closed-loop pitch search process
the algorithm can select for each subframe the best candidate value of pitch delay that can not only denote the secret information but also make the interpolated normalized correlation coefficient largest. In this way
the impact of steganographic operations on the original carriers can be effectively reduced. Taking adaptive multi-rate speech codec widely used in the current Voice-over-IP systems as an example
the performance of presented algorithm has been evaluated from the aspects of covert capacity
imperceptibility and anti-detection
and compared with related works. Experimental results show that the proposed steganographic algorithm can achieve better steganography security (better resistance to detection and imperceptibility) than the existing steganographic methods based on pitch delay
while maintaining relatively high steganographic capacity.
PROVOS N , HONEYMAN P . Hide and seek: An introduction to steganography [J]. IEEE Security&Privacy , 2003 , 1 ( 3 ): 32 - 44 .
田晖 , 郭舒婷 , 秦界 , 等 . 基于可量化性能分级的自适应 IP 语音隐写方法 [J]. 电子学报 , 2016 , 44 ( 11 ): 2735 - 2741 .
TIAN H , GUO S T , QIN J , et al . Adaptive voice-over-IP steganography based on quantitative performance ranking [J]. Acta Electronica Sinica , 2016 , 44 ( 11 ): 2735 - 2741 . (in Chinese)
MAZURCZYK W . VoIP steganography and its detection—a survey [J]. ACM Computing Surveys , 2013 , 46 ( 2 ): 1 - 21 .
ZIELIN'SKA E , MAZURCZYK W , SZCZYPIORSKI K . Trends in steganography [J]. Communications of the ACM , 2014 , 57 ( 3 ): 86 - 95 .
XU T T , YANG Z . Simple and effective speech steganography in G.723.1 low-rate codes [C]// Proceedings of the 2009 International Conference on Wireless Communications &Signal Processing . Nanjing : IEEE , 2009 : 1 - 4 .
HUANG Y F , YUAN J , CHEN M C , et al . Key distribution over the covert communication based on VoIP [J]. Chinese Journal of Electronics , 2011 , 20 ( 2 ): 357 - 360 .
王继军 , 李国祥 , 夏国恩 , 等 . 图像插值空间完全可逆可分离密文域信息隐藏算法 [J]. 电子学报 , 2020 , 48 ( 1 ): 92 - 100 .
WANG J J , LI G X , XIA G E , et al . A separable and reversible data hiding algorithm in encrypted domain based on image interpolation space [J]. Acta Electronica Sinica , 2020 , 48 ( 1 ): 92 - 100 . (in Chinese)
YANG Z L , ZHANG S Y , HU Y T , et al . VAE-Stega: Linguistic steganography based on variational auto-encoder [J]. IEEE Transactions on Information Forensics and Security , 2020 , 16 : 880 - 895 .
RANA S , KAMRA R , SUR A . Motion vector based video steganography using homogeneous block selection [J]. Multimedia Tools and Applications , 2020 , 79 ( 9 ): 5881 - 5896 .
XIAO B , HUANG Y , et al . An approach to information hiding in low bit-rate speech stream [C]// Proceedings of the IEEE GLOBECOM 2008 . New Orleans : IEEE , 2008 : 1 - 5 .
ITU-T G . 723 . 1: Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s [S/OL].[2021-04-20] . http://www.rosoo.net/Files/UpFiles/RsProduct/unsorted/20071/2007189302247358.pdf http://www.rosoo.net/Files/UpFiles/RsProduct/unsorted/20071/2007189302247358.pdf .
ITU-T G . 729: Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction(CS-ACELP) [S/OL]. [ 2021-04-20 ]. https://www.doc88.com/p-5911700978695.html https://www.doc88.com/p-5911700978695.html .
ETSI EN 301 704 V7.2.1: Adaptive multi-rate (AMR) speech transcoding [S/OL]. [ 2021-04-20 ]. https://www.etsi.org/deliver/etsi_en/301700_301799/301704/07.02.01_60/en_301704v070201p.pdf https://www.etsi.org/deliver/etsi_en/301700_301799/301704/07.02.01_60/en_301704v070201p.pdf .
LIU P , LI S , et al . Steganography integrated into linear predictive coding for low bit-rate speech codec [J]. Multimedia Tools and Applications , 2017 , 76 ( 2 ): 2837 - 2859 .
孙鑫昊 , 王开西 . 基于最短欧氏距离替换码元的VoIP隐写算法 [J/OL]. 计算机工程与应用 , 2021 : 1 - 9 . http://kns.cnki.net/kcms/detail/11.2127.TP.20210419.1512.086.html http://kns.cnki.net/kcms/detail/11.2127.TP.20210419.1512.086.html .
SUN X H , WANG K X . The codeword replacement based on shortest euclidean distance for VoIP steganography [J/OL]. Computer Engineering and Applications , 2014 : 1 - 9 . http://kns.cnki.net/kcms/detail/11.2127.TP.20210419.1512.086.html. http://kns.cnki.net/kcms/detail/11.2127.TP.20210419.1512.086.html. (in Chinese)
GEISER B , VARY P . High rate data hiding in ACELP speech codecs [C]// Proceedings of the 2008 IEEE International Conference on Acoustics, Speech and Signal Processing . Las Vegas : IEEE , 2008 : 4005 - 4008 .
MIAO H B , HUANG L S , CHEN Z L , et al . A new scheme for covert communication via 3G encoded speech [J]. Computers & Electrical Engineering , 2012 , 38 ( 6 ): 1490 - 1501 .
SU Z P , LI W W , ZHANG G F , et al . A steganographic method based on gain quantization for iLBC speech streams [J]. Multimedia Systems , 2020 , 26 ( 2 ): 223 - 233 .
余迟 , 黄刘生 , 等 . 一种针对基音周期的 3G 信息隐藏方法 [J]. 小型微型计算机系统 , 2012 , 33 ( 7 ): 1445 - 1449 .
YU C , HUANG L S , et al . A 3G speech data hiding method based on pitch period [J]. Journal of Chinese Computer Systems , 2012 , 33 ( 7 ): 1445 - 1449 . (in Chinese)
HUANG Y F , et al . Steganography integration into a low-bit rate speech codec [J]. IEEE Transactions on Information Forensics and Security , 2012 , 7 ( 6 ): 1865 - 1875 .
严书凡 , 等 . 基于基音周期预测的低速率语音隐写 [J]. 计算机应用研究 , 2015 , 32 ( 6 ): 1774 - 1777 .
YAN S F , et al . Steganography for low bit-rate speech based on pitch period prediction [J]. Application Research of Computers , 2015 , 32 ( 6 ): 1774 - 1777 . (in Chinese)
刘程浩 , 柏森 , 黄永峰 , 等 . 一种基于基音预测的信息隐藏算法 [J]. 计算机工程 , 2013 , 39 ( 2 ): 137 - 140 .
LIU C H , BAI S , HUANG Y F , et al . An information hiding algorithm based on pitch prediction [J]. Computer Engineering , 2013 , 39 ( 2 ): 137 - 140 . (in Chinese)
吴志军 , 等 . 基于随机位置选择和矩阵编码的语音信息隐藏方法 [J]. 电子与信息学报 , 2020 , 42 ( 2 ): 355 - 363 .
WU Z J , et al . Speech information hiding method based on random position selection and matrix coding [J]. Journal of Electronics & Information Technology , 2020 , 42 ( 2 ): 355 - 363 . (in Chinese)
LIU X K , TIAN H , et al . A novel steganographic method for algebraic-code-excited-linear-prediction speech streams based on fractional pitch delay search [J]. Multimedia Tools and Applications , 2019 , 78 ( 7 ): 8447 - 8461 .
REN Y Z , YANG J , WANG J W , et al . AMR steganalysis based on second-order difference of pitch delay [J]. IEEE Transactions on Information Forensics and Security , 2016 , 12 ( 6 ): 1345 - 1357 .
TIAN H , et al . Steganalysis of adaptive multi-rate speech using statistical characteristics of pitch delay [J]. Journal of Universal Computer Science , 2019 , 25 : 1131 .
LIU X K , TIAN H , LIU J , et al . Steganalysis of adaptive multiple-rate speech using parity of pitch-delay value [C]// Proceedings of the International Conference on Security and Privacy in New Computing Environments . Tianjin, China : Springer , 2019 : 282 - 297 .
TIAN H , WU Y P , et al . Distributed steganalysis of compressed speech [J]. Soft Computing , 2017 , 21 ( 3 ): 795 - 804 .
TIAN H , WU Y P , CHANG C C , et al . Steganalysis of adaptive multi-rate speech using statistical characteristics of pulse pairs [J]. Signal Processing , 2017 , 134 ( C ): 9 - 22 .
TIAN H , WU Y P , CHANG C C , et al . Steganalysis of analysis by-synthesis speech exploiting pulse-position distribution characteristics [J]. Security and Communication Networks , 2016 , 9 ( 15 ): 2934 - 2944 .
ITU-T Recommendation P . 862 . Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs [S/OL]. [2021-04-20] . https://www.itu.int/rec/T-REC-P.862 https://www.itu.int/rec/T-REC-P.862 .
0
浏览量
13
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621