1. 湖南大学计算机与通信学院,湖南,长沙,410082
2. 湖南大学电气与信息工程学院,湖南,长沙,410082
3. 湖南大学计算机与通信学院湖南长沙,410082
4. 湖南大学电气与信息工程学院湖南长沙,410082
纸质出版:2007
移动端阅览
周顺先, 林亚平, 王耀南, 等. 基于二阶隐马尔可夫模型的文本信息抽取[J]. 电子学报, 2007,35(11):2226-2231.
ZHOU Shun-xian, LIN Ya-ping, WANG Yao-nan, et al. Text Information Extraction Based on the Second-Order Hidden Markov Model[J]. Acta Electronica Sinica, 2007, 35(11): 2226-2231.
隐马尔可夫模型是文本信息抽取的重要方法之一.在一阶隐马尔可夫模型中
假设状态转移概率和观察值输出概率仅依赖于模型当前的状态
一定程度降低了信息抽取的精确度.而二阶隐马尔可夫模型合理地考虑了概率和模型历史状态的关联性
对错误信息有更强的识别能力.提出了基于二阶隐马尔可夫模型的文本信息抽取算法;分析了二阶隐马尔可夫模型在文本信息抽取中的有效性;仿真实验表明
新的算法比基于一阶隐马尔可夫模型的算法具有更高的抽取精确度.
Hidden Markov model is one of important approaches for text information extraction.In the first-order hidden Markov model
there is the hypothesis that the transition probability of state and the output probability of observation are only dependent on the current state of the model
which debases the precision of information extraction comparatively.The relationship between the probability and the model’s historical states is considered reasonably in the second-order hidden Markov model which has stronger performance of recognition for incorrect information.An algorithm of text information extraction based on the second-order hidden Markov model is proposed.The validity of the second-order hidden Markov model in information extraction is analyzed.Simulation Experiments show that the new algorithm has higher precision than the algorithm based on the first-order hidden Markov model.
0
浏览量
2625
下载量
14
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621