1. 南京大学计算机软件新技术国家重点实验室,江苏,南京,210023
2. 安徽工业大学计算机学院,安徽,马鞍山,243002
3. 安徽工程大学机电学院,安徽,芜湖,241000
4. 南京大学计算机软件新技术国家重点实验室,江苏,南京,210023
5. 安徽工业大学计算机学院,安徽,马鞍山,243002
6. 安徽工程大学机电学院,安徽,芜湖,241000
纸质出版:2014
移动端阅览
杨思春, 高超, 戴新宇, 等. 基于差异性和重要性的问句特征组合[J]. 电子学报, 2014,42(5):918-924.
YANG Si-chun, GAO Chao, DAI Xin-yu, et al. Combining Features of Question Based on Diversity and Importance[J]. Acta Electronica Sinica, 2014, 42(5): 918-924.
杨思春, 高超, 戴新宇, 等. 基于差异性和重要性的问句特征组合[J]. 电子学报, 2014,42(5):918-924. DOI: 10.3969/j.issn.0372-2112.2014.05.013.
YANG Si-chun, GAO Chao, DAI Xin-yu, et al. Combining Features of Question Based on Diversity and Importance[J]. Acta Electronica Sinica, 2014, 42(5): 918-924. DOI: 10.3969/j.issn.0372-2112.2014.05.013.
在问答系统问句分类研究中,对问句特征进行组合有助于构造高效的问句分类器.针对当前问句分类中的特征组合问题,提出一种基于差异性和重要性的特征组合 (Diversity and Importance based Feature Combination
DIFC)方法.通过计算待组合特征与当前特征组合的错分差异度和正分差异度,以及待组合特征本身的重要度,从候选特征集中动态获取优化的特征组合.在哈工大中文问句集上对词袋绑定特征进行组合的实验结果表明
与其他特征组合方法相比,DIFC方法灵活高效,准确率更高.
In research on question classification in question answering system
combining features can greatly help construct efficient question classifier.In order to deal with the problem of low performance of existing methods
a new method of diversity and importance based feature combination(DIFC) is proposed.By calculating the diversity between candidate feature and current combination for error and correct classification respectively
and the importance of candidate feature
features can be dynamically selected from candidate feature set.The experimental results of bag-of-words binding features on the HIT Chinese question set show that
compared with other methods
the new method is flexible and efficient
and gets more optimal feature combination.
0
浏览量
1903
下载量
3
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621