

浏览全部资源
扫码关注微信
1.郑州轻工业大学计算机与通信工程学院,河南郑州 450000
2.江南大学计算机与人工智能学院,江苏无锡 214000
Received:16 June 2021,
Revised:2022-02-12,
Published:25 April 2022
移动端阅览
庾骏,黄伟,张晓波等.基于松弛Hadamard矩阵的多模态融合哈希方法[J].电子学报,2022,50(04):909-920.
YU Jun,HUANG Wei,ZHANG Xiao-bo,et al.Multimodal Fusion Hash Learning Method Based on Relaxed Hadamard Matrix[J].ACTA ELECTRONICA SINICA,2022,50(04):909-920.
庾骏,黄伟,张晓波等.基于松弛Hadamard矩阵的多模态融合哈希方法[J].电子学报,2022,50(04):909-920. DOI: 10.12263/DZXB.20210760.
YU Jun,HUANG Wei,ZHANG Xiao-bo,et al.Multimodal Fusion Hash Learning Method Based on Relaxed Hadamard Matrix[J].ACTA ELECTRONICA SINICA,2022,50(04):909-920. DOI: 10.12263/DZXB.20210760.
哈希作为一种有效的数据表征技术,已经在应对爆炸式增长的多媒体数据中扮演了重要的角色.它由于低存储和高效率的优势,在多媒体检索领域受到了越来越多的关注.目前多模态哈希学习方法在多媒体检索任务中得到了较好的研究和发展.然而,多数的方法通过编码特征的内积重构成对相似度来保持原始数据的结构信息,但是带来较复杂的优化问题.此外一些模型缺乏判别性使得检索性能的提升受到限制.为了克服上述问题,本文提出一种新型的多模态融合哈希方法,在类别信息的监督下利用Hadamard矩阵为数据生成目标编码,通过松弛严格的二值约束增大类间的间隔,同时采用图嵌入的方式促进类内的紧凑性.本文提出的方法既保证了模型具有很好的判别能力也简化了优化过程.在3个公开数据集上的实验结果表明,本文提出的方法在多媒体数据检索中是非常有效的,平均性能上相比最优的对比方法提高了8.47%.
Hashing
as an effective data representation technology
has played an important role in dealing with the explosive growth of multimedia data. Due to the advantages of its low storage and high efficiency
it has received more and more attention in the field of multimedia retrieval. At present
multi-modal hashing methods have been well researched and developed in multimedia retrieval tasks. However
most of these methods usually use the inner product of hashing features to reconstruct larger pairwise similarity
aiming to preserve the structural information of the original data
which will bring more complex optimization problems. Besides
some models lack discriminant ability
which leads to limitations in the improvement of retrieval performance. In order to overcome the above-mentioned problems
this paper proposes a new multi-modal fusion hashing method. Under the supervision of category information
Hadamard matrix is used to generate target codes for data
and the margin between categories is increased by relaxing strict binary constraints. At the same time
the graph embedding approach is used to promote compactness within the class. The proposed method in this paper not only ensures the strong discriminative ability of the model
but also simplifies the optimization process. The experimental results on three public datasets show that the method proposed in this paper is very effective in multimedia data retrieval
and the average performance is 8.47% higher than that of the optimal comparison method.
WANG J , SHEN H T , SONG J , et al . Hashing for similarity search: A survey [EB/OL]. ( 2014-08-13 )[ 2022-02-12 ]. https://arxiv.org/abs/1408.2927 https://arxiv.org/abs/1408.2927 .
李志欣 , 凌锋 , 张灿龙 , 等 . 融合两级相似度的跨媒体图像文本检索 [J]. 电子学报 , 2021 , 49 ( 2 ): 268 - 274 .
LI Z X , LING F , ZHANG C L , et al . Cross-media image-text retrieval with two level similarity [J]. Acta Electronica Sinica , 2021 , 49 ( 2 ): 268 - 274 . (in Chinese)
李武军 , 周志华 . 大数据哈希学习: 现状与趋势 [J]. 科学通报 , 2015 , 60 ( 5 ): 485 - 490 .
LI W J , ZHOU Z H . Learning to hash for big data: Current status and future trends [J]. Chinese Science Bulletin , 2015 , 60 ( 5 ): 485 - 490 . (in Chinese)
高文 . “ 存得下,查得快”拥抱多媒体大数据时代 [J]. 创新科技 , 2013 , 25 ( 6 ): 7 .
GAO W . " Save it, check it quickly" Embrace the era of multimedia big data [J]. Innovation Science and Technology , 2013 , 25 ( 6 ): 7 . (in Chinese)
刘昊淼 , 王瑞平 , 山世光 , 等 . 基于离散优化的哈希编码学习方法 [J]. 计算机学报 , 2019 , 42 ( 5 ): 1149 - 1160 .
LIU H M , WANG R P , SHAN S G , et al . Learning to hash with discrete optimization [J]. Chinese Journal of Computers , 2019 , 42 ( 5 ): 1149 - 1160 . (in Chinese)
GONG Y , LAZEBNIK S , GORDO A , et al . Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2012 , 35 ( 12 ): 2916 - 2929 .
KULIS B , GRAUMAN K . Kernelized locality-sensitive hashing for scalable image search [C]// Proceedings of 12th International Conference on Computer Vision . Kyoto : IEEE , 2009 : 2130 - 2137 .
JI R , LIU H , CAO L , et al . Toward optimal manifold hashing via discrete locally linear embedding [J]. IEEE Transactions on Image Processing , 2017 , 26 ( 11 ): 5411 - 5420 .
KOUTAKI G , SHIRAI K , AMBAI M . Hadamard coding for supervised discrete hashing [J]. IEEE Transactions on Image Processing , 2018 , 27 ( 11 ): 5378 - 5392 .
JIN L , LI Z , PAN Y , et al . Weakly-supervised image hashing through masked visual-semantic graph-based reasoning [C]// Proceedings of the 28th ACM International Conference on Multimedia . Seattle : ACM , 2020 : 916 - 924 .
LI Z , TANG J , ZHANG L , et al . Weakly-supervised semantic guided hashing for social image retrieval [J]. International Journal of Computer Vision , 2020 , 128 : 2265 - 2278 .
姚涛 , 孔祥维 , 付海燕 , 等 . 基于映射字典学习的跨模态哈希检索 [J]. 自动化学报 , 2018 , 44 ( 8 ): 1475 - 1485 .
YAO T , KONG X W , FU H Y , et al . Projective dictionary learning hashing for cross-modal retrieval [J]. Acta Automatica Sinica , 2018 , 44 ( 8 ): 1475 - 1485 .
WANG D , GAO X , WANG X , et al . Label consistent matrix factorization hashing for large-scale cross-modal similarity search [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2018 , 41 ( 10 ): 2466 - 2479 .
WANG D , WANG Q , GAO X . Robust and flexible discrete hashing for cross-modal similarity search [J]. IEEE Transactions on Circuits and Systems for Video Technology , 2017 , 28 ( 10 ): 2703 - 2715 .
刘昊鑫 , 吴小俊 , 庾骏 . 联合哈希特征和分类器学习的跨模态检索算法 [J]. 模式识别与人工智能 , 2020 , 33 ( 2 ): 160 - 165 .
LIU H X , WU X J , YU J . Joint hashing feature and classifier learning for cross-modal retrieval [J]. Pattern Recognition and Artificial Intelligence , 2020 , 33 ( 2 ): 160 - 165 . (in Chinese)
LIU H , JI R , WU Y , et al . Supervised matrix factorization for cross-modality hashing [C]// Proceedings of International Joint Conference on Artificial Intelligence . New York : IJCAI , 2016 : 1767 - 1773 .
LIN Z , DING G , HU M , et al . Semantics-preserving hashing for cross-view retrieval [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . Boston : IEEE , 2015 : 3864 - 3872 .
王锦荟 , 金露 , 李泽超 , 等 . 基于知识蒸馏的跨模态哈希 [J/OL]. (2021-11-04)[2022-02-12] . https://kns.cnki.net/kcms/detail/11.5844.TH.20220331.1444.008.html https://kns.cnki.net/kcms/detail/11.5844.TH.20220331.1444.008.html .
WANG J Y , JIN L , LI Z C , et al . Cross-Modal Knowledge Distillation Hashing [J/OL]. ( 2021-11-04 )[ 2022-02-12 ]. https://kns.cnki.net/kcms/detail/11.5844.TH.20220331 https://kns.cnki.net/kcms/detail/11.5844.TH.20220331 . 1444 . 008 . html . (in Chinese)
LI Z , TANG J , MEI T . Deep collaborative embedding for social image understanding [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2018 , 41 ( 9 ): 2070 - 2083 .
SONG J , YANG Y , HUANG Z , et al . Effective multiple feature hashing for large scale near duplicate video retrieval [J]. IEEE Transaction on Multimedia , 2013 , 15 ( 8 ): 1997 - 2008 .
LIU X , HE J , LIU D , et al . Compact kernel hashing with multiple features [C]// Proceedings of the ACM International Conference on Multimedia . Seattle : ACM , 2012 : 881 - 884 .
SHEN X , SHEN F , SUN Q , et al . Multi-view latent hashing for efficient multimedia search [C]// Proceedings of the ACM International Conference on Multimedia , Seattle : ACM , 2015 : 831 - 834 .
SHEN X , SHEN F , LIU L , et al . Multiview discrete hashing for scalable multimedia search [J]. ACM Transactions on Intelligent Systems and Technology , 2018 , 9 ( 5 ): 53 - 73 .
LU X , ZHU L , CHENG Z , et al . Online multi-modal hashing with dynamic query-adaption [C]// Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval . Paris : ACM , 2019 : 715 - 724 .
SYLVESTER J J . LX. Thoughts on inverse orthogonal matrices, simultaneous signsuccessions, and tessellated pavements in two or more colours, with applications to Newton's rule, ornamental tile-work, and the theory of numbers [J]. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science , 1867 , 34 ( 232 ): 461 - 475 .
GIONIS A , INDYK P , MOTWANI R . Similarity search in high dimensions via hashing [C]// Proceedings of the International Conference on Very Large Date Bases . Sydney : IEEE , 1999 : 518 - 529 .
LIN M , JI R , LIU H , et al . Hadamard matrix guided online hashing [J]. International Journal of Computer Vision , 2020 , 128 ( 6 ): 2279 - 2306 .
XIANG S , NIE F , MENG G , et al . Discriminative least squares regression for multiclass classification and feature selection [J]. IEEE Transactions on Neural Networks and Learning Systems , 2012 , 23 ( 11 ): 1738 - 1754 .
RASIWASIA N , COSTA P J , COVIELLO E , et al . A new approach to cross-modal multimedia retrieval [C]// Proceedings of the 18th ACM International Conference on Multimedia , Seattle : ACM , 2010 : 251 - 260 .
BLEI D M , NG A Y , JORDAN M I , et al . Latent dirichlet allocation [J]. Journal of Machine Learning Research , 2003 , 3 ( 2 ): 993 - 1022 .
RASHTCHIAN C , YOUNG P , HODOSH M , et al . Collecting image annotations using amazon's mechanical turk [C]// Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk . Los Angeles : ACM , 2010 : 139 - 147 .
WEI Y , ZHAO Y , LU C , et al . Cross-modal retrieval with CNN visual features: A new baseline [J]. IEEE Transactions on Cybernetics , 2016 , 47 ( 2 ): 449 - 460 .
KANG Y , KIM S , CHOI S . Deep learning to hash with multiple representations [C]// Proceedings of the 12th IEEE Conference on Data Mining . Brussels : IEEE , 2012 : 930 - 935 .
WANG D , CUI P , OU M , et al . Deep multimodal hashing with orthogonal regularization [C]// Proceedings of the 24th International Joint Conference on Artificial Intelligence . Buenos, Aires : IEEE , 2015 : 2291 - 2297 .
ZHU L , LU X , CHENG Z , et al . Deep collaborative multi-view hashing for large-scale image search [J]. IEEE Transactions on Image Processing , 2020 , 29 : 4643 - 4655 .
LU X , LIU L , NIE L , et al . Semantic-driven Interpretable Deep Multi-modal Hashing for Large-scale Multimedia Retrieval [J]. IEEE Transactions on Multimedia , 2021 , 23 : 4541 - 4554 .
0
Views
6
下载量
1
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621