

浏览全部资源
扫码关注微信
1.西安电子科技大学人工智能学院, 陕西西安 710071
2.鹏城实验室,广东深圳 518055
3.西安电子科技大学生命科学技术学院, 陕西西安 710071
Received:09 February 2021,
Revised:2022-02-02,
Published:25 September 2022
移动端阅览
石光明,高大化,杨旻曦等.信号的语义刻画与度量[J].电子学报,2022,50(09):2068-2078.
SHI Guang-ming,GAO Da-hua,YANG Min-xi,et al.Semantic Characterization and Measurement of Signals[J].ACTA ELECTRONICA SINICA,2022,50(09):2068-2078.
石光明,高大化,杨旻曦等.信号的语义刻画与度量[J].电子学报,2022,50(09):2068-2078. DOI: 10.12263/DZXB.20210238.
SHI Guang-ming,GAO Da-hua,YANG Min-xi,et al.Semantic Characterization and Measurement of Signals[J].ACTA ELECTRONICA SINICA,2022,50(09):2068-2078. DOI: 10.12263/DZXB.20210238.
相比于基于比特数据的信息处理及通信技术,人类通过语义处理和传递信息的方式,在面对智能体间传递处理海量信息这一问题时显得更为高效和自然.然而由于目前缺乏关于语义度量和刻画的数学描述,涉及语义的应用无法兼顾可解释性和泛化性,无法发挥语义的高效自然的优势.本文围绕语义的度量和刻画,首先依据信息科学和神经科学相关结论,讨论了语义的内涵,并指出语义具有模块化、多模态、层级化的特点;接着提出了一种多模态信号的语义刻画和度量的数学描述;然后为了验证所提信号语义的刻画和度量的可行性和有效性,在MNIST(Mixed National Institute of Standards and Technology database)手写数字识别和水声目标识别两个应用中进行了实验,获得比传统深度学习更好的性能;最后将语义用于视频编码,实现了远超传统方法的压缩比,展现了语义在通信领域的实用价值.这为未来建立以语义为基础的新型信息处理与通信技术奠定了理论和实践基础.
Compared with the modern information processing and communication technology based on bits
the semantic-based way of processing and transmitting information used by humans is more efficient and natural in the face of the massive information that needs to be sent between agents. However
due to the lack of mathematical description of semantic measurement and characterization
semantics' applications cannot consider both interpretability and generalization. Therefore
they cannot give play to the advantages of efficient and natural semantics. This paper focuses on the measurement and characterization of semantics. Firstly
we discuss the connotation of semantics based on the relevant conclusions of information science and neuroscience
and concludes that semantics has the characteristics of modularity
multi-mode
and hierarchy. Then
a semantic description of multimodal signals and a mathematical description of their measurement are proposed. Next
to verify the feasibility and effectiveness of the characterization and measurement of the proposed signal semantics
experiments are carried out in two applications: MNIST (Mixed National Institute of Standards and Technology database) handwritten digital recognition and underwater acoustic target recognition
and the results are better than those of the traditional deep learning. Finally
the semantics is used for video coding
and a compression ratio far exceeding that of traditional methods is achieved. This lays a theoretical and practical foundation for establishing new information processing and communication technology based on semantics.
KIFER M , LAUSEN G . F-logic: a higher-order language for reasoning about objects, inheritance, and scheme [J]. ACM SIGMOD Record , 1989 , 18 ( 2 ): 134 - 146 .
BAADER F . The Description Logic Handbook: Theory, Implementation, and Applications [M]. Cambridge, UK : Cambridge University Press , 2003
BOLLACKER K , COOK R , TUFTS P . Freebase: a shared database of structured general human knowledge [C]// AAAI'07: Proceedings of the 22nd national conference on Artificial intelligence - Volume 2 . Menlo Park, CA : AAAI Press , 2007 : 1962 - 1963 .
LI Z Y , DING X , LIU T . Constructing narrative event evolutionary graph for script event prediction [C]// IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence . Menlo Park, CA : AAAI Press , 2018 : 4201 - 4207 .
KIPF T N , WELLING M . Semi-supervised classification with graph convolutional networks [EB/OL]. ( 2016-09-09 ). https://arxiv.org/abs/1609.02907 https://arxiv.org/abs/1609.02907 .
VELIČKOVIĆ P , CUCURULL G , CASANOVA A , et al . Graph attention networks [EB/OL]. ( 2017-10-30 ). https://arxiv.org/abs/1710.10903 https://arxiv.org/abs/1710.10903 .
LAFFERTY J D , MCCALLUM A , PEREIRA F C N . Conditional random fields: Probabilistic models for segmenting and labeling sequence data [C]// ICML'01: Proceedings of the Eighteenth International Conference on Machine Learning . San Francisco : Morgan Kaufmann Publishers Inc , 2001 : 282 - 289 .
TERENT'YEV A N , BIDYUK P I . Method of probabilistic inference from learning data in Bayesian networks [J]. Cybernetics and Systems Analysis , 2007 , 43 ( 3 ): 391 - 396 .
MIKOLOV T , KARAFIÁT M , BURGET L , et al . Recurrent neural network based language model [C]// Interspeech 2010 . Chiba : ISCA , 2010 : 1045 - 1048 .
HOCHREITER S , SCHMIDHUBER J . Long short-term memory [J]. Neural Computation , 1997 , 9 ( 8 ): 1735 - 1780 .
GOLDBERG Y , LEVY O . word2vec Explained: deriving Mikolov et al.' s negative-sampling word-embedding method[EB/OL].( 2014-02-15 ). http://arxiv.org/abs/1402.3722 http://arxiv.org/abs/1402.3722 .
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all You need [C]// NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems . Red Hook, NY : Curran Associates Inc , 2017 : 6000 - 6010 .
DEVLIN J , CHANG M-W , LEE K , et al . BERT: Pre-training of deep bidirectional transformers for language understanding [EB/OL]. ( 2018-10-11 ). https://arxiv.org/abs/1810.04805v2 https://arxiv.org/abs/1810.04805v2 .
LOWE D G . Object recognition from local scale-invariant features [C]// Proceedings of the Seventh IEEE International Conference on Computer Vision . Piscataway : IEEE , 1999 : 1150 - 1157 .
FELZENSZWALB P F , GIRSHICK R B , MCALLESTER D , et al . Object detection with discriminatively trained part-based models [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2010 , 32 ( 9 ): 1627 - 1645 .
KOH P W , NGUYEN T , TANG Y S , et al . Concept bottleneck models [EB/OL]. ( 2020-07-07 ). https://arxiv.org/abs/2007.04612 https://arxiv.org/abs/2007.04612
LU C W , KRISHNA R , BERNSTEIN M , et al . Visual relationship detection with language priors [C]// Computer Vision–ECCV 2016 . Cham : Springer International Publishing , 2016 : 852 - 869 .
WANG X L , YE Y F , GUPTA A . Zero-shot recognition via semantic embeddings and knowledge graphs [C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2018 : 6857 - 6866 .
CARON M , BOJANOWSKI P , JOULIN A , et al . Deep Clustering for Unsupervised Learning of Visual Features [C]// Computer Vision–ECCV 2018 . Cham : Springer International Publishing , 2018 : 139 - 156 .
LI O , LIU H , CHEN C , et al . Deep learning for case-based reasoning through prototypes: a neural network that explains its predictions [C] // Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence . Menlo Park, CA : AAAI Press , 2018 : 3530 - 3537 .
GIRSHICK R , DONAHUE J , DARRELL T , et al . Rich feature hierarchies for accurate object detection and semantic segmentation [C]// 2014 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2014 : 580 - 587 .
CHUTE C G . Classification and retrieval of patient records using natural language: An experimental application of latent semantic analysis [C]// Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society Volume 13 . Piscataway : IEEE , 1991 : 1162 - 1163 .
TURNEY P D , PANTEL P . From frequency to meaning: Vector space models of semantics [J]. Journal of Artificial Intelligence Research , 2010 , 37 : 141 - 188 .
MATSUNO K . Semantic commitments as a mode of non-programmable computation in the brain [J]. Bio Systems , 1992 , 27 ( 4 ): 235 - 239 .
SHANNON C E . A mathematical theory of communication [J]. The Bell System Technical Journal , 1948 , 27 ( 3 ): 379 - 423 .
SHANNON C E , WEAVER W . The Mathematical Theory of Communication [M]. Urbana : University of Illinois Press , 1949
GÜLER B , YENER A , SWAMI A . The semantic communication game [J]. IEEE Transactions on Cognitive Communications and Networking , 2018 , 4 ( 4 ): 787 - 802 .
BAO J , BASU P , DEAN M K , et al . Towards a theory of semantic communication [C]// 2011 IEEE Network Science Workshop . Piscataway : IEEE , 2011 : 110 - 117 .
BASU P , BAO J , DEAN M , et al . Preserving quality of information by using semantic relationships [J]. Pervasive and Mobile Computing , 2014 , 11 : 188 - 202 .
WILLEMS F M J , KALKER T . Semantic compaction, transmission, and compression codes [C]// Proceedings of International Symposium on Information Theory , 2005 ISIT . Piscataway : IEEE , 2005 : 214 - 218 .
HUBEL D H , WIESEL T N . Receptive fields, binocular interaction and functional architecture in the cat's visual cortex [J]. The Journal of Physiology , 1962 , 160 ( 1 ): 106 - 154 .
HUTH A G , NISHIMOTO S , VU A T , et al . A continuous semantic space describes the representation of thousands of object and action categories across the human brain [J]. Neuron , 2012 , 76 ( 6 ): 1210 - 1224 .
HANDJARAS G , RICCIARDI E , LEO A , et al . How concepts are encoded in the human brain: A modality independent, category-based cortical organization of semantic knowledge [J]. NeuroImage , 2016 , 135 : 232 - 242 .
DEFELIPE J , MARKRAM H , ROCKLAND K S . The neocortical column [J]. Frontiers in Neuroanatomy , 2012 , 6 : 22 .
MOUNTCASTLE V B . The columnar organization of the neocortex [J]. Brain: a Journal of Neurology , 1997 , 120 ( Pt 4 ): 701 - 722 .
ROCKLAND K S . Five points on columns [J]. Frontiers in Neuroanatomy , 2010 , 4 : 22 .
BI Y C , WANG X Y , CARAMAZZA A . Object domain and modality in the ventral visual pathway [J]. Trends in Cognitive Sciences , 2016 , 20 ( 4 ): 282 - 290 .
HUTH A G , DE HEER W A , GRIFFITHS T L , et al . Natural speech reveals the semantic maps that tile human cerebral cortex [J]. Nature , 2016 , 532 ( 7600 ): 453 - 458 .
SHI G M , ZHANG Z Q , GAO D H , et al . Knowledge-guided semantic computing network [J]. Neurocomputing , 2021 , 426 : 70 - 84 .
LECUN Y , BOTTOU L , BENGIO Y , et al . Gradient-based learning applied to document recognition [J]. Proceedings of the IEEE , 1998 , 86 ( 11 ): 2278 - 2324 .
0
Views
9
下载量
2
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621