

浏览全部资源
扫码关注微信
1. 信息物理社会可信服务计算教育部重点实验室, 重庆大学,重庆,400030
2. 重庆大学计算机学院,重庆,400030
3. 重庆电子工程职业学院,重庆,401331
4. 昆士兰大学信息技术与电子工程学院,布里斯班,澳大利亚,4072
5. 信息物理社会可信服务计算教育部重点实验室 重庆大学,重庆,400030
6. 重庆大学计算机学院,重庆,400030
7. 重庆电子工程职业学院,重庆,401331
8. 昆士兰大学信息技术与电子工程学院,布里斯班,澳大利亚,4072
Published Online:25 September 2018,
Published:2018
移动端阅览
ZHONG Jiang, ZHANG Shu-fang, GUO Wei-li, et al. TFLA: A Quality Analysis Framework for User Generated Contents[J]. Acta Electronica Sinica, 2018, 46(9): 2201-2206.
ZHONG Jiang, ZHANG Shu-fang, GUO Wei-li, et al. TFLA: A Quality Analysis Framework for User Generated Contents[J]. Acta Electronica Sinica, 2018, 46(9): 2201-2206. DOI: 10.3969/j.issn.0372-2112.2018.09.022.
本文设计了一种用户生成文本的质量分析框架.首先,基于主题分析构建商品类别主题特征集合.其次,利用主题特征与商品分类的强关联关系,构建形式化概念分析的形式背景,将分类-主题概念格化简并生成主题特征格,以此构建五个质量特征并生成质量评估模型.最后,在真实评论数据上的实验结果表明新方法具有更高预测精度.
In this paper
we design a topic-features lattices analysis (TFLA) framework based on objectivity quality dimensions. Firstly
we apply the latent Dirichlet allocation (LDA) approach to get latent topics as topic-features for each goods categories. Secondly
we construct formal background based on the strong relationship between goods categories and topic-features. So we could get generalization and instantiation relationship among the topic-features through formal concept analysis (FCA). We employ domain knowledge and relationships among topic-features to define five objective quality features. Also
we use machine learning methods to build quality evaluation models based on these quality features. Experiment results on actual comment data sets show that our new quality models' prediction results are in agreement with the artificial quality tags in most cases. The best performances could get that the mean absolute error (MAE) is 0.7 and F-measure is 0.5
which is significantly better than the conventional quality prediction model based on support vector machine (SVM) classification.
0
Views
415
下载量
2
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621