基于二阶隐马尔可夫模型的文本信息抽取

周顺先; 林亚平; 王耀南; 易叶青

您当前的位置：

首页 >

文章列表页 >

基于二阶隐马尔可夫模型的文本信息抽取

科研通信 | 更新时间：2025-07-16

- 基于二阶隐马尔可夫模型的文本信息抽取
- Text Information Extraction Based on the Second-Order Hidden Markov Model
- 电子学报 2007年35卷第11期页码：2226-2231
- 作者机构：
  
  1. 湖南大学计算机与通信学院,湖南,长沙,410082
  2. 湖南大学电气与信息工程学院,湖南,长沙,410082
  3. 湖南大学计算机与通信学院湖南长沙,410082
  4. 湖南大学电气与信息工程学院湖南长沙,410082
- 作者简介：
- 基金信息：
  
  国家863高技术研究发展计划 (No.2006AA01Z227);湖南省重点自然科学基金 (No.06JJ20049)
- DOI：
  中图分类号： TP391
- 纸质出版：2007
- 稿件说明：
移动端阅览
周顺先, 林亚平, 王耀南, 等. 基于二阶隐马尔可夫模型的文本信息抽取[J]. 电子学报, 2007,35(11):2226-2231.

ZHOU Shun-xian, LIN Ya-ping, WANG Yao-nan, et al. Text Information Extraction Based on the Second-Order Hidden Markov Model[J]. Acta Electronica Sinica, 2007, 35(11): 2226-2231.
周顺先, 林亚平, 王耀南, 等. 基于二阶隐马尔可夫模型的文本信息抽取[J]. 电子学报, 2007,35(11):2226-2231. DOI：

ZHOU Shun-xian, LIN Ya-ping, WANG Yao-nan, et al. Text Information Extraction Based on the Second-Order Hidden Markov Model[J]. Acta Electronica Sinica, 2007, 35(11): 2226-2231. DOI：

摘要

隐马尔可夫模型是文本信息抽取的重要方法之一.在一阶隐马尔可夫模型中

假设状态转移概率和观察值输出概率仅依赖于模型当前的状态

一定程度降低了信息抽取的精确度.而二阶隐马尔可夫模型合理地考虑了概率和模型历史状态的关联性

对错误信息有更强的识别能力.提出了基于二阶隐马尔可夫模型的文本信息抽取算法;分析了二阶隐马尔可夫模型在文本信息抽取中的有效性;仿真实验表明

新的算法比基于一阶隐马尔可夫模型的算法具有更高的抽取精确度.

Abstract

Hidden Markov model is one of important approaches for text information extraction.In the first-order hidden Markov model

there is the hypothesis that the transition probability of state and the output probability of observation are only dependent on the current state of the model

which debases the precision of information extraction comparatively.The relationship between the probability and the model’s historical states is considered reasonably in the second-order hidden Markov model which has stronger performance of recognition for incorrect information.An algorithm of text information extraction based on the second-order hidden Markov model is proposed.The validity of the second-order hidden Markov model in information extraction is analyzed.Simulation Experiments show that the new algorithm has higher precision than the algorithm based on the first-order hidden Markov model.

关键词

Keywords

references

浏览量

2625

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于最大熵的隐马尔可夫模型文本信息抽取