National Natural Science Foundation of China (No.61762078, No.61363058, No.61663004);Supproted by Guangxi Key Laboratory Foundation for Multi-source Information Mining and Security (No.MIMS18-08);Research Project of Guangxi Key Laboratory of Trusted Software (No.KX201705)
Combining Coupled Distance Discrimination and Strong Classifica-tion Features for Short Text Similarity Calculation[J]. Acta Electronica Sinica, 2019, 47(6): 1331-1336.
DOI:
Combining Coupled Distance Discrimination and Strong Classifica-tion Features for Short Text Similarity Calculation[J]. Acta Electronica Sinica, 2019, 47(6): 1331-1336. DOI: 10.3969/j.issn.0372-2112.2019.06.021.
Combining Coupled Distance Discrimination and Strong Classifica-tion Features for Short Text Similarity Calculation
Text similarity measures play a vital role in text related applications in tasks such as social networks
text mining
natural language processing
and others.The typical characteristics of short texts demonstrate severe sparseness and high dimension while the traditional short texts similarity calculation always ignores category information.A coupled distance discrimination and strong classification features based approach for short text similarity calculation
CDDCF
is presented.On the one hand
co-occurrence distance between terms are considered in each text to determine the co-occurrence distance correlation
based on which the weight for each term can be determined and the intra and inter relations between words are established.The similarity of coupling distance discrimination on short text can be captured.On the other hand
strong classification features are extracted via labeled texts.The similarity between two short texts is measured by using the common number of strong discrimination features with the same context.Finally
the distance discrimination and strong classification features are unified into a joint framework to measure the similarity of short texts.Experimental results show that CDDCF performs better compared to baseline algorithms in term of its performance and efficiency of similarity computation.
Pseudo-Relevance Feedback Query Expansion Based on the Fusion of Association Pattern Mining and Word Embedding Learning
Vietnamese-English Cross Language Query Post-Translation Expansion Based on All-Weighted Positive and Negative Association Patterns Mining
Cross Language Query Expansion Based on Item Weight Sorting Mining
Research on Dynamic Continuous Emotional Recognition of EEG Based on Improved TCNN Algorithm
Chinese Long Text Summarization with Guided Attention
Related Author
HUANG Ming-xuan
HUANG Ming-xuan
JIANG Cao-qing
HUANG Ming-xuan
JIANG Cao-qing
JIE Li-lin
LIU Yong
WANG Ming-xun
Related Institution
Guangxi Key Laboratory of Cross-border E-commerce Intelligent Information Processing, Guangxi University of Finance and Economics
School of Information and Statistics, Guangxi University of Finance and Economics
Guangxi Key Laboratory Cultivation Base of Cross-border E-commerce Intelligent Information Processing, Guangxi University of Finance and Economics
School of Information and Statistics, Guangxi University of Finance and Economics
Guangxi Key Laboratory Cultivation Base of Cross-border E-commerce Intelligent Information Processing Guangxi University of Finance and Economics Nanning Guangxi China