

浏览全部资源
扫码关注微信
1.中央财经大学信息学院,北京102206
2.国家金融安全教育部工程研究中心,北京102206
Published Online:04 July 2023,
移动端阅览
WANG Xiu-li, JIN Fang-yan. Implicit Discourse Relation Recognition Integrating Feature Coding and Phrase Interaction Perception[J/OL]. ACTA ELECTRONICA SINICA, 2023, 1-15.
隐式篇章关系识别研究因其难度大、普遍性高等特点是一项极具挑战性的任务.本文从论元编码和论元交互角度入手,提出了一种融合特征编码和短语交互感知的隐式篇章关系识别模型.该模型同时兼顾了论元本身特征和论元间交互特征的作用,并且分别进行了优化.论元编码部分整合了双向长短时记忆网络(Bidirectional Long Short-Term Memory, BiLSTM)和循环注意力卷积神经网络(Recurrent Attention Convolution Neural Network, RACNN),能够更全面地捕获论元全局和局部特征;论元交互部分从短语层级考虑论元间的语义关系建模,构建了短语级交互注意力机制,并且利用神经张量网络(Neural Tensor Network, NTN)深入挖掘其中的关系模式,更能体现出论元间潜在的更深层次的关联关系.在宾州篇章树库(Penn Discourse Treebank, PDTB)数据集上的实验结果表明,该模型F1值均优于其他对比模型.
Implicit discourse relation recognition is a challenging task because of its difficulty and universality. From the perspective of argument coding and argument interaction
an implicit discourse relation recognition model integrating feature coding and phrase interaction perception is proposed. The model considers both the characteristics of argument itself and the interaction characteristics between arguments
and optimizes separately. The part of argument coding incorporates bidirectional long short-term memory(BiLSTM) and recurrent attention convolution neural network(RACNN)
which can capture global and local features of arguments in a more comprehensive way; in the part of argument interaction
the semantic relationship between arguments is modeled from phrase level
and a mechanism of phrase-level interactive attention is constructed. Also
neural tensor network(NTN) is used to dig into the relational pattern
which can better reflect the potential deeper relational relationship between arguments. Experimental results on penn discourse treebank(PDTB) dataset show that the F1 values of this model are superior to other comparison models.
YOSHIDA Y , SUZUKI J , HIRAO T , et al . Dependency-based discourse parser for single-document summarization [C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) . Doha, Qatar : ACL , 2014 : 1834 - 1839 .
SUN M , CHAI J Y . Discourse processing for context question answering based on linguistic knowledge [J]. Knowledge-Based Systems , 2007 , 20 ( 6 ): 511 - 526 .
LI J J , CARPUAT M , NENKOVA A . Assessing the discourse factors that influence the quality of machine translation [C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) . Baltimore, MD : ACL , 2014 : 283 - 288 .
PRASAD R , DINESH N , LEE A , et al . The penn discourse treebank 2.0 [C]// Proceedings of the International Conference on Language Resources and Evaluation . Marrakech, Morocco : LREC , 2008 : 2961 - 2968 .
ZHOU P , SHI W , TIAN J , et al . Attention-based bidirectional long short-term memory networks for relation classification [C]// Proceedings of the 54th Annual Meeting of the ACL . Stroudsburg, PA : ACL , 2016 : 207 - 212 .
李志欣 , 孙亚茹 , 唐素勤 , 等 . 双路注意力引导图卷积网络的关系抽取 [J]. 电子学报 , 2021 , 49 ( 2 ): 315 - 323 .
LI Zhi-xin , SUN Ya-ru , TANG Su-qin , et al . Dual attention guided graph convolutional networks for relation extraction [J]. Acta Electronica Sinica , 2021 , 49 ( 2 ): 315 - 323 . (in Chinese)
冯建周 , 宋沙沙 , 王元卓 , 等 . 基于改进注意力机制的实体关系抽取方法 [J]. 电子学报 , 2019 , 47 ( 8 ): 1692 - 1700 .
FENG Jian-zhou , SONG Sha-sha , WANG Yuan-zhuo , et al . Entity relation extraction based on improved attention mechanism [J]. Acta Electronica Sinica , 2019 , 47 ( 8 ): 1692 - 1700 . (in Chinese)
ZHANG Y , MENG F , LI P , et al . Context tracking network: graph-based context modeling for implicit discourse relation recognition [C]// Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . Online : ACL , 2021 : 1592 - 1599 .
WU W , WANG H , LIU T , et al . Phrase-level self-attention networks for universal sentence encoding [C]// Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing . Brussels, Belgium : ACL , 2018 : 3729 - 3738 .
ZHANG B , SU J , XIONG D , et al . Shallow convolutional neural network for implicit discourse relation recognition [C]// Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing . Lisbon, Portugal : ACL , 2015 : 2230 - 2235 .
RUTHERFORD A , DEMBERG V , XUE N . A systematic study of neural discourse models for implicit discourse relation [C]// Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1 , Long Papers . Valencia, Spain : ACL , 2017 : 281 - 291 .
凡子威 , 张民 , 李正华 . 基于BiLSTM并结合自注意力机制和句法信息的隐式篇章关系分类 [J]. 计算机科学 , 2019 , 46 ( 5 ): 214 - 220 .
FAN Zi-wei , ZHANG Min , LI Zheng-hua . BiLSTM-based implicit discourse relation classification combining self-attention mechanism and syntactic information [J]. Computer Science , 2019 , 46 ( 5 ): 214 - 220 . (in Chinese)
DAI Z , HUANG R . Improving implicit discourse relation classification by modeling inter-dependencies of discourse units in a paragraph [C]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics :Human Language Technologies . New Orleans, Louisiana : ACL , 2018 : 141 - 151 .
Zhang B , Xiong D , Su J , et al . Learning better discourse representation for implicit discourse relation recognition via attention networks [J]. Neurocomputing , 2018 , 275 : 1241 - 1249 .
Popa D N , Perez J , Henderson J , et al . Implicit discourse relation classification with syntax-aware contextualized word representations [C]// Proceedings of the Thirty-Second International Florida Artificial Intelligence Research Society Conference . Sarasota, Florida : AAAI Press , 2019 : 203 - 208 .
Guo F , He R , Dang J . Implicit discourse relation recognition via a BiLSTM-CNN architecture with dynamic chunk-based max pooling [J]. IEEE Access , 2019 , 7 : 169281 - 169292 .
徐扬 , 周文瑄 , 阮慧彬 , 等 . 基于层次化表示的隐式篇章关系识别 [J]. 南京大学学报: 自然科学版 , 2019 , 55 ( 6 ): 1000 - 1009 .
YANG Xu , ZHOU Wen-xuan , RUAN Hui-bin , et al . Hierarchical representation for implicit discourse relation recognition [J]. Journal of Nanjing University(Natural Sciences) , 2019 , 55 ( 6 ): 1000 – 1009 . (in Chinese)
阮慧彬 , 徐扬 , 孙雨 , 等 . 基于堆叠式注意力机制的隐式篇章关系识别 [J]. 山西大学学报: 自然科学版 , 2020 , 43 ( 3 ): 508 - 516 .
RUAN Hui-bin , XU Yang , SUN Yu , et al . Stacked-attention based implicit discourse relation recognition [J]. Journal of Shanxi University(Natural Sciences) , 2020 , 43 ( 3 ): 508 - 516 . (in Chinese)
GUO F , HE R , JIN D , et al . Implicit discourse relation recognition using neural tensor network with interactive attention and sparse learning [C]// Proceedings of the 27th International Conference on Computational Linguistics . Santa Fe, New Mexico : ACL , 2018 : 547 - 558 .
徐昇 , 王体爽 , 李培峰 , 等 . 运用多层注意力神经网络识别中文隐式篇章关系 [J]. 中文信息学报 , 2019 , 33 ( 8 ): 12 - 19,35 .
XU Sheng , WANG Ti-shuang , LI Pei-feng , et al . Multi-layer attention network based Chinese implicit discourse relation recognition [J]. Journal of Chinese information processing , 2019 , 33 ( 8 ): 12 - 19,35 . (in Chinese)
LIU X , OU J , SONG Y , et al . On the importance of word and sentence representation learning in implicit discourse relation classification [C]// Proceedings of the 29th International Joint Conference on Artificial Intelligence . Amsterdam : Elsevier , 2020 : 3830 - 3836 .
HE R , WANG J , GUO F , et al . Transs-driven joint learning architecture for implicit discourse relation recognition [C]// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . Online : ACL , 2020 : 139 - 148 .
BAI H , ZHAO H . Deep enhanced representation for implicit discourse relation recognition [C]// Proceedings of the 27th International Conference on Computational Linguistics . Santa Fe, New Mexico : ACL , 2018 : 571 - 583 .
唐裕婷 , 李艳斌 , 刘露 , 等 . 面向细粒度隐式篇章关系识别的远距离监督特征学习算法 [J]. 北京大学学报: 自然科学版 , 2019 , 55 ( 1 ): 91 - 97 .
TANG Yu-ting , LI Yan-bin , LIU Lu , et al . Feature learning by distant supervision for fine-grained implicit discourse relation identification [J]. Acta Scientiarum Naturalium Universitatis Pekinensis , 2019 , 55 ( 1 ): 91 - 97 . (in Chinese)
GUO F , HE R , DANG J , et al . Working memory-driven neural networks with a novel knowledge enhancement paradigm for implicit discourse relation recognition [C]// Proceedings of the AAAI Conference on Artificial Intelligence . New York : AAAI Press , 2020 : 7822 - 7829 .
KURFALI M , ÖSTLING R . Let's be explicit about that: Distant supervision for implicit discourse relation classification via connective prediction [C]// Proceedings of the 1st Workshop on Understanding Implicit and Underspecified Language (UnImplicit 2021) . Bangkok, Thailand (online) : ACL , 2021 : 1 - 10 .
HOCHREITER S , SCHMIDHUBER J . Long short-term memory [J]. Neural Computation , 1997 , 9 ( 8 ): 1735 - 1780 .
FU J L , ZHENG H L , MEI T . Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition [C]// Proceedings of the CVPR . Piscataway, NJ : IEEE , 2017 : 4438 - 4446 .
HERZOG M H , KAMMER T , SCHARNOWSKI F . Time slices: what is the duration of a percept? [J]. PLoS biology , 2016 , 14 ( 4 ): 1 - 12 .
0
Views
23
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621