1.西安电子科技大学人工智能学院, 陕西西安 710071
2.鹏城实验室,广东深圳 518055
3.西安电子科技大学生命科学技术学院, 陕西西安 710071
[ "石光明 男, 1965年06月出生于江西省南昌市. 长江学者特聘教授, IEEE Fellow, IET Fellow. 现任西安电子科技大学人工智能学院教授. 主要研究方向为人工智能、语义通信.E-mail: gmshi@xidian.edu.cn" ]
[ "高大化 男, 1979年08月出生于河南省开封市. 现任西安电子科技大学人工智能学院教授. 主要研究方向为智能信息处理、智能感知. E-mail: dhgao@xidian.edu.cn" ]
[ "杨旻曦 男, 1996年10月出生于四川省成都市. 现为西安电子科技大学人工智能学院博士研究生. 主要研究方向为表征学习、语义通信.E-mail: mxyang@stu.xidian.edu.cn" ]
[ "谢雪梅 女, 1967年01月出生于陕西省西安市. 现任西安电子科技大学人工智能学院教授. 主要研究方向为场景理解与视频分析、多模态融合." ]
[ "董明皓 男, 1984年05月出生于陕西省西安市. 现为西安电子科技大学分子与神经影像教育部工程研究中心副教授. 主要研究方向为脑机混合智能、人体效能增强.E-mail: dminghao@xidian.edu.cn" ]
[ "李雷达 男, 1982年10月出生于江苏省徐州市. 现任西安电子科技大学人工智能学院教授. 主要研究方向为图像感知质量评价.E-mail: ldli@xidian.edu.cn" ]
[ "于 凯 男,1996年11月出生于山东省日照市莒县. 2018至2021年于西安电子科技大学人工智能学院攻读硕士学位. 主要研究方向为图像识别、增量学习.E-mail: yukai_nathan@163.com" ]
收稿:2021-02-09,
修回:2022-02-02,
纸质出版:2022-09-25
移动端阅览
石光明,高大化,杨旻曦等.信号的语义刻画与度量[J].电子学报,2022,50(09):2068-2078.
SHI Guang-ming,GAO Da-hua,YANG Min-xi,et al.Semantic Characterization and Measurement of Signals[J].ACTA ELECTRONICA SINICA,2022,50(09):2068-2078.
石光明,高大化,杨旻曦等.信号的语义刻画与度量[J].电子学报,2022,50(09):2068-2078. DOI: 10.12263/DZXB.20210238.
SHI Guang-ming,GAO Da-hua,YANG Min-xi,et al.Semantic Characterization and Measurement of Signals[J].ACTA ELECTRONICA SINICA,2022,50(09):2068-2078. DOI: 10.12263/DZXB.20210238.
相比于基于比特数据的信息处理及通信技术,人类通过语义处理和传递信息的方式,在面对智能体间传递处理海量信息这一问题时显得更为高效和自然.然而由于目前缺乏关于语义度量和刻画的数学描述,涉及语义的应用无法兼顾可解释性和泛化性,无法发挥语义的高效自然的优势.本文围绕语义的度量和刻画,首先依据信息科学和神经科学相关结论,讨论了语义的内涵,并指出语义具有模块化、多模态、层级化的特点;接着提出了一种多模态信号的语义刻画和度量的数学描述;然后为了验证所提信号语义的刻画和度量的可行性和有效性,在MNIST(Mixed National Institute of Standards and Technology database)手写数字识别和水声目标识别两个应用中进行了实验,获得比传统深度学习更好的性能;最后将语义用于视频编码,实现了远超传统方法的压缩比,展现了语义在通信领域的实用价值.这为未来建立以语义为基础的新型信息处理与通信技术奠定了理论和实践基础.
Compared with the modern information processing and communication technology based on bits
the semantic-based way of processing and transmitting information used by humans is more efficient and natural in the face of the massive information that needs to be sent between agents. However
due to the lack of mathematical description of semantic measurement and characterization
semantics' applications cannot consider both interpretability and generalization. Therefore
they cannot give play to the advantages of efficient and natural semantics. This paper focuses on the measurement and characterization of semantics. Firstly
we discuss the connotation of semantics based on the relevant conclusions of information science and neuroscience
and concludes that semantics has the characteristics of modularity
multi-mode
and hierarchy. Then
a semantic description of multimodal signals and a mathematical description of their measurement are proposed. Next
to verify the feasibility and effectiveness of the characterization and measurement of the proposed signal semantics
experiments are carried out in two applications: MNIST (Mixed National Institute of Standards and Technology database) handwritten digital recognition and underwater acoustic target recognition
and the results are better than those of the traditional deep learning. Finally
the semantics is used for video coding
and a compression ratio far exceeding that of traditional methods is achieved. This lays a theoretical and practical foundation for establishing new information processing and communication technology based on semantics.
KIFER M , LAUSEN G . F-logic: a higher-order language for reasoning about objects, inheritance, and scheme [J]. ACM SIGMOD Record , 1989 , 18 ( 2 ): 134 - 146 .
BAADER F . The Description Logic Handbook: Theory, Implementation, and Applications [M]. Cambridge, UK : Cambridge University Press , 2003
BOLLACKER K , COOK R , TUFTS P . Freebase: a shared database of structured general human knowledge [C]// AAAI'07: Proceedings of the 22nd national conference on Artificial intelligence - Volume 2 . Menlo Park, CA : AAAI Press , 2007 : 1962 - 1963 .
LI Z Y , DING X , LIU T . Constructing narrative event evolutionary graph for script event prediction [C]// IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence . Menlo Park, CA : AAAI Press , 2018 : 4201 - 4207 .
KIPF T N , WELLING M . Semi-supervised classification with graph convolutional networks [EB/OL]. ( 2016-09-09 ). https://arxiv.org/abs/1609.02907 https://arxiv.org/abs/1609.02907 .
VELIČKOVIĆ P , CUCURULL G , CASANOVA A , et al . Graph attention networks [EB/OL]. ( 2017-10-30 ). https://arxiv.org/abs/1710.10903 https://arxiv.org/abs/1710.10903 .
LAFFERTY J D , MCCALLUM A , PEREIRA F C N . Conditional random fields: Probabilistic models for segmenting and labeling sequence data [C]// ICML'01: Proceedings of the Eighteenth International Conference on Machine Learning . San Francisco : Morgan Kaufmann Publishers Inc , 2001 : 282 - 289 .
TERENT'YEV A N , BIDYUK P I . Method of probabilistic inference from learning data in Bayesian networks [J]. Cybernetics and Systems Analysis , 2007 , 43 ( 3 ): 391 - 396 .
MIKOLOV T , KARAFIÁT M , BURGET L , et al . Recurrent neural network based language model [C]// Interspeech 2010 . Chiba : ISCA , 2010 : 1045 - 1048 .
HOCHREITER S , SCHMIDHUBER J . Long short-term memory [J]. Neural Computation , 1997 , 9 ( 8 ): 1735 - 1780 .
GOLDBERG Y , LEVY O . word2vec Explained: deriving Mikolov et al.' s negative-sampling word-embedding method[EB/OL].( 2014-02-15 ). http://arxiv.org/abs/1402.3722 http://arxiv.org/abs/1402.3722 .
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all You need [C]// NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems . Red Hook, NY : Curran Associates Inc , 2017 : 6000 - 6010 .
DEVLIN J , CHANG M-W , LEE K , et al . BERT: Pre-training of deep bidirectional transformers for language understanding [EB/OL]. ( 2018-10-11 ). https://arxiv.org/abs/1810.04805v2 https://arxiv.org/abs/1810.04805v2 .
LOWE D G . Object recognition from local scale-invariant features [C]// Proceedings of the Seventh IEEE International Conference on Computer Vision . Piscataway : IEEE , 1999 : 1150 - 1157 .
FELZENSZWALB P F , GIRSHICK R B , MCALLESTER D , et al . Object detection with discriminatively trained part-based models [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2010 , 32 ( 9 ): 1627 - 1645 .
KOH P W , NGUYEN T , TANG Y S , et al . Concept bottleneck models [EB/OL]. ( 2020-07-07 ). https://arxiv.org/abs/2007.04612 https://arxiv.org/abs/2007.04612
LU C W , KRISHNA R , BERNSTEIN M , et al . Visual relationship detection with language priors [C]// Computer Vision–ECCV 2016 . Cham : Springer International Publishing , 2016 : 852 - 869 .
WANG X L , YE Y F , GUPTA A . Zero-shot recognition via semantic embeddings and knowledge graphs [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 6857 - 6866 .
CARON M , BOJANOWSKI P , JOULIN A , et al . Deep Clustering for Unsupervised Learning of Visual Features [C]// Computer Vision–ECCV 2018 . Cham : Springer International Publishing , 2018 : 139 - 156 .
LI O , LIU H , CHEN C , et al . Deep learning for case-based reasoning through prototypes: a neural network that explains its predictions [C] // Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence . Menlo Park, CA : AAAI Press , 2018 : 3530 - 3537 .
GIRSHICK R , DONAHUE J , DARRELL T , et al . Rich feature hierarchies for accurate object detection and semantic segmentation [C]// 2014 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2014 : 580 - 587 .
CHUTE C G . Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis [C]// Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society Volume 13 . Piscataway : IEEE , 1991 : 1162 - 1163 .
TURNEY P D , PANTEL P . From frequency to meaning: Vector space models of semantics [J]. Journal of Artificial Intelligence Research , 2010 , 37 : 141 - 188 .
MATSUNO K . Semantic commitments as a mode of non-programmable computation in the brain [J]. Bio Systems , 1992 , 27 ( 4 ): 235 - 239 .
SHANNON C E . A mathematical theory of communication [J]. The Bell System Technical Journal , 1948 , 27 ( 3 ): 379 - 423 .
SHANNON C E , WEAVER W . The Mathematical Theory of Communication [M]. Urbana : University of Illinois Press , 1949
GÜLER B , YENER A , SWAMI A . The semantic communication game [J]. IEEE Transactions on Cognitive Communications and Networking , 2018 , 4 ( 4 ): 787 - 802 .
BAO J , BASU P , DEAN M K , et al . Towards a theory of semantic communication [C]// 2011 IEEE Network Science Workshop . Piscataway : IEEE , 2011 : 110 - 117 .
BASU P , BAO J , DEAN M , et al . Preserving quality of information by using semantic relationships [J]. Pervasive and Mobile Computing , 2014 , 11 : 188 - 202 .
WILLEMS F M J , KALKER T . Semantic compaction, transmission, and compression codes [C]// Proceedings of International Symposium on Information Theory , 2005 ISIT . Piscataway : IEEE , 2005 : 214 - 218 .
HUBEL D H , WIESEL T N . Receptive fields, binocular interaction and functional architecture in the cat's visual cortex [J]. The Journal of Physiology , 1962 , 160 ( 1 ): 106 - 154 .
HUTH A G , NISHIMOTO S , VU A T , et al . A continuous semantic space describes the representation of thousands of object and action categories across the human brain [J]. Neuron , 2012 , 76 ( 6 ): 1210 - 1224 .
HANDJARAS G , RICCIARDI E , LEO A , et al . How concepts are encoded in the human brain: A modality independent, category-based cortical organization of semantic knowledge [J]. NeuroImage , 2016 , 135 : 232 - 242 .
DEFELIPE J , MARKRAM H , ROCKLAND K S . The neocortical column [J]. Frontiers in Neuroanatomy , 2012 , 6 : 22 .
MOUNTCASTLE V B . The columnar organization of the neocortex [J]. Brain: a Journal of Neurology , 1997 , 120 ( Pt 4 ): 701 - 722 .
ROCKLAND K S . Five points on columns [J]. Frontiers in Neuroanatomy , 2010 , 4 : 22 .
BI Y C , WANG X Y , CARAMAZZA A . Object domain and modality in the ventral visual pathway [J]. Trends in Cognitive Sciences , 2016 , 20 ( 4 ): 282 - 290 .
HUTH A G , DE HEER W A , GRIFFITHS T L , et al . Natural speech reveals the semantic maps that tile human cerebral cortex [J]. Nature , 2016 , 532 ( 7600 ): 453 - 458 .
SHI G M , ZHANG Z Q , GAO D H , et al . Knowledge-guided semantic computing network [J]. Neurocomputing , 2021 , 426 : 70 - 84 .
LECUN Y , BOTTOU L , BENGIO Y , et al . Gradient-based learning applied to document recognition [J]. Proceedings of the IEEE , 1998 , 86 ( 11 ): 2278 - 2324 .
0
浏览量
9
下载量
2
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621