纸质出版:2021
移动端阅览
张昱, 刘开峰, 张全新, 等. 基于组合-卷积神经网络的中文新闻文本分类[J]. 电子学报, 2021,49(6):1059-1067.
张昱, 刘开峰, 张全新, et al. A Combined-Convolutional Neural Network for Chinese News Text Classification[J]. Acta Electronica Sinica, 2021, 49(6): 1059-1067.
张昱, 刘开峰, 张全新, 等. 基于组合-卷积神经网络的中文新闻文本分类[J]. 电子学报, 2021,49(6):1059-1067. DOI: 10.12263/DZXB.20200134.
张昱, 刘开峰, 张全新, et al. A Combined-Convolutional Neural Network for Chinese News Text Classification[J]. Acta Electronica Sinica, 2021, 49(6): 1059-1067. DOI: 10.12263/DZXB.20200134.
目前的新闻分类研究以英文居多,而且常用的传统机器学习方法在长文本处理方面,存在局部文本块特征提取不完善的问题.为了解决中文新闻分类缺乏专门术语集的问题,采用构造数据索引的方法,制作了适合中文新闻分类的词汇表,并结合word2vec预训练词向量进行文本特征构建.为了解决特征提取不完善的问题,通过改进经典卷积神经网络模型结构,研究不同的卷积和池化操作对分类结果的影响.为提高新闻文本分类的精确率,本文提出并实现了一种组合-卷积神经网络模型,设计了有效的模型正则化和优化方法.实验结果表明,组合-卷积神经网络模型对中文新闻文本分类的精确率达到93.69%,相比最优的传统机器学习方法和经典卷积神经网络模型精确率分别提升6.34%和1.19%,并在召回率和F值两项指标上均优于对比模型.
At present
most of the researches on news classification are in English
and the traditional machine learning methods have a problem of incomplete extraction of local text block features in long text processing. In order to solve the problem of lack of special term set for Chinese news classification
a vocabulary suitable for Chinese text classification is made by constructing a data index method
and the text feature construction is combined with word2vec pre-trained word vector. In order to solve the problem of incomplete feature extraction
the effects of different convolution and pooling operations on the classification results are studied by improving the structure of classical convolution neural network model. In order to improve the precision of Chinese news text classification
this paper proposes and implements a combined-convolution neural network model
and designs an effective method of model regularization and optimization. The experimental results show that the precision of the combined-convolutional neural network model for Chinese news text classification reaches 93.69%
which is 6.34% and 1.19% higher than the best traditional machine learning method and classic convolutional neural network model
and it is better than the comparison model in recall and F-measure.
0
浏览量
4
下载量
11
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621