Anaphora Resolution of Uyghur Personal Pronouns Based on Bi-LSTM

TIAN Sheng-wei; QIN Yue; YU Long; Turgun Ibrahim; FENG Guan-jun

doi:10.3969/j.issn.0372-2112.2018.07.022

您当前的位置：

首页 >

文章列表页 >

Anaphora Resolution of Uyghur Personal Pronouns Based on Bi-LSTM

更新时间：2025-07-16

- Anaphora Resolution of Uyghur Personal Pronouns Based on Bi-LSTM
- Acta Electronica Sinica Vol. 46, Issue 7, Pages: 1691-1699(2018)
- 作者机构：
  
  1. 新疆大学软件学院,新疆,乌鲁木齐,830008
  2. 新疆大学信息科学与工程学院,新疆,乌鲁木齐,830046
  3. 新疆大学网络中心,新疆,乌鲁木齐,830046
  4. 新疆大学人文学院,新疆,乌鲁木齐,830046
- 作者简介：
- 基金信息：
  
  National Natural Science Foundation of China (No.61563051, No.61662074, No.61262064);Key Program of National Natural Science Foundation of China (No.61331011);Science and Technology Talents Cultivation Program of Xinjiang Uygur Autonomous Region (No.QN2016YX0051)
- DOI：10.3969/j.issn.0372-2112.2018.07.022
  CLC： TP391
- Published Online：25 July 2018，
  
  Published：2018
- 稿件说明：
移动端阅览
TIAN Sheng-wei, QIN Yue, YU Long, et al. Anaphora Resolution of Uyghur Personal Pronouns Based on Bi-LSTM[J]. Acta Electronica Sinica, 2018, 46(7): 1691-1699.
DOI：

TIAN Sheng-wei, QIN Yue, YU Long, et al. Anaphora Resolution of Uyghur Personal Pronouns Based on Bi-LSTM[J]. Acta Electronica Sinica, 2018, 46(7): 1691-1699. DOI： 10.3969/j.issn.0372-2112.2018.07.022.

摘要

针对维吾尔语人称代词指代现象，提出利用双向长短时记忆网络（Bi-directional long short term memory，Bi-LSTM）的深度学习机制进行基于深层语义信息的维吾尔语人称代词指代消解.首先将富含语义和句法信息的word embedding向量作为Bi-LSTM的输入，挖掘维吾尔语隐含的上下文语义层面特征；其次对维吾尔语人称代词指代现象进行探索，提取针对人称代词指代研究的24个hand-crafted特征；然后利用多层感知器（multilayer perception，MLP）融合Bi-LSTM学习到的上下文语义层面特征与hand-crafted特征；最后使用融合的两类特征训练softmax分类器完成维吾尔语人称代词指代消解任务.实验结果表明，充分利用两类特征的优势，维吾尔语人称代词指代消解的F1值达到76.86%.实验验证了Bi-LSTM与单向LSTM、浅层机器学习算法的SVM和ANN相比更具备挖掘隐含上下文深层语义信息的能力，而hand-crafted层面特征的引入，则有效提高指代消解性能.

Abstract

Specific to the anaphora phenomenon of Uyghur personal pronouns

a deep learning mechanism of Bi-LSTM (Bi-directional long short term memory) network is proposed

which is based on the deep semantic information to resolve anaphora resolution problem in Uyghur personal pronouns.Firstly

make the word embedding which contain semantic and syntactic information as the input of Bi-LSTM

to excavate the implicit semantic features of Uyghur.Secondly

explore the anaphora phenomenon in Uyghur and extract 24 hand-crafted features.Then

use multilayer perception(MLP)to concatenate hand-crafted features and context semantic features.Finally

two types of features are used to train the softmax classifier to complete the task.The experimental results show that

on the basis of full utilization of the advantages of two types of features

the F1 value of anaphora resolution is 76.86%.It is proved that Bi-LSTM is more capable of mining implicit context semantic information than LSTM、SVM as well as ANN

and the introduction of hand-crafted features can effectively improve the performance.

关键词

Keywords

references

Views

203

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Anaphora Resolution of Uyghur Personal Pronouns Based on Deep Reinforcement Learning

Research and Development of Named Entity Recognition in Chinese Electronic Medical Record

SMGN: A State Memory Graph Network for Dialogue State Tracking

Related Author

YANG Qi-meng

YU Long

TIAN Sheng-wei

Aishan Wumaier

吾买尔

FENG Song

DU Jin-hua

YIN Hao

Related Institution

School of Information Science and Engineering, Xinjiang University

Network Center, Xinjiang University

Software College, Xinjiang University

School of Information Science and Engineering Xinjiang University Urumqi Xinjiang China

Network Center Xinjiang University Urumqi Xinjiang China

⁰