

浏览全部资源
扫码关注微信
1.清华大学北京信息科学与技术国家研究中心,北京 100084
2.中南大学湘雅医院,湖南长沙 410008
Received:29 April 2022,
Revised:2022-08-25,
Published:25 December 2022
移动端阅览
杜晋华,尹浩,冯嵩.中文电子病历命名实体识别的研究与进展[J].电子学报,2022,50(12):3030-3053.
DU Jin-hua,YIN Hao,FENG Song.Research and Development of Named Entity Recognition in Chinese Electronic Medical Record[J].ACTA ELECTRONICA SINICA,2022,50(12):3030-3053.
杜晋华,尹浩,冯嵩.中文电子病历命名实体识别的研究与进展[J].电子学报,2022,50(12):3030-3053. DOI: 10.12263/DZXB.20220485.
DU Jin-hua,YIN Hao,FENG Song.Research and Development of Named Entity Recognition in Chinese Electronic Medical Record[J].ACTA ELECTRONICA SINICA,2022,50(12):3030-3053. DOI: 10.12263/DZXB.20220485.
海量电子病历(Electronic Medical Record,EMR)数据是支撑医疗智能化研究的重要原料,然而电子病历文本数据的半结构化甚至无结构化特点,造成后续对其分析利用的极大困难.虽然近年来基于深度学习的命名实体识别(Named Entity Recognition,NER)成为对电子病历进行自动化信息抽取的核心技术,但鉴于中文电子病历(Chinese Electronic Medical Record,CEMR)具有包括病历文本的非规范性与专业性、医疗实体的独特性和标注语料的稀缺性在内的独特文本数据特征,该研究目前仍存在诸多挑战.本文对中文电子病历命名实体识别的研究与进展进行了综述,系统梳理了命名实体识别的概念、相关理论模型以及制约中文电子病历命名实体识别准确率和识别效率的主要原因;从技术发展角度详细分析了中文电子病历命名实体识别方法的变革历程;并对中文电子病历命名实体识别效果做了实验验证与深入分析,指出了现有模型的不足与改进方向.鉴于国内近年来与中文信息学处理相关的测评会议CCKS持续关注中文电子病历命名实体识别,本文特别对CCKS在该领域五年来的全部代表性测评论文做了纵横对比分析,并通过在主流模型上的深入实验与研究,为后续该领域的继续推进寻求了思路.
Massive electronic medical record(EMR) data is an important raw material to support the research of medical intelligence
but the semi-structured or even unstructured characteristics of EMR text data make it extremely difficult to analyze and utilize them subsequently. Although named entity recognition(NER) based on deep learning has become a core technology for automated information extraction from electronic medical records in recent years
there are still many challenges in this research given the unique textual data characteristics of Chinese electronic medical record(CEMR)
including the non-normative and specialized nature of medical record text
the uniqueness of medical entities and the scarcity of annotated corpus.
This paper provides an overview of the research and progress of named entity recognition in Chinese electronic medical records
systematically sorting out the concept of named entity recognition
related theoretical models and the main reasons limiting the accuracy and efficiency of named entity recognition in Chinese electronic medical records; analyzes in detail the change history of named entity recognition methods in Chinese electronic medical records from the perspective of technical development; and makes an experimental verification and in-depth analysis of the effect of named entity recognition in Chinese electronic medical records
and points out the shortcomings and improvement directions of existing models.
In view of the fact that CCKS
a domestic evaluation conference related to Chinese informatics processing
has continued to focus on the recognition of named entities in Chinese electronic medical records in recent years
this paper presents a longitudinal and cross-sectional analysis of all the representative evaluation papers of CCKS in this field over the past five years
and seeks ideas for the continued advancement of this field through in-depth experiments and research on the mainstream model.
国家卫健委 . 关于印发电子病历应用管理规范(试行)的通知 [EB/OL ] . ( 2017-02-22 )[ 2022-01-02 ] . http://www.nhc.gov.cn/yzygj/s3593/201702/22bb2525318f496f846e8566754876a1.shtml http://www.nhc.gov.cn/yzygj/s3593/201702/22bb2525318f496f846e8566754876a1.shtml .
马欢欢 , 孔繁之 , 高建强 . 中文电子病历命名实体识别方法研究 [J ] . 医学信息学杂志 , 2020 , 41 ( 4 ): 24 - 29 .
MA H H , KONG F Z , GAO J Q . Study on named entity recognition method of Chinese electronic medical records [J ] . Journal of Medical Informatics , 2020 , 41 ( 4 ): 24 - 29 . (in Chinese)
辛海燕 , 李鹏 , 张国庆 . 医院医疗科研大数据平台的建设与应用 [J ] . 中国卫生信息管理杂志 , 2019 , 16 ( 2 ): 206 - 209 .
XIN H Y , LI P , ZHANG G Q . Construction and application of medical research big data platform in hospital [J ] . Chinese Journal of Health Informatics and Management , 2019 , 16 ( 2 ): 206 - 209 . (in Chinese)
崔博文 , 金涛 , 王建民 . 自由文本电子病历信息抽取综述 [J ] . 计算机应用 , 2021 , 41 ( 4 ): 1055 - 1063 .
CUI B W , JIN T , WANG J M . Overview of information extraction of free-text electronic medical records [J ] . Journal of Computer Applications , 2021 , 41 ( 4 ): 1055 - 1063 . (in Chinese)
付秀 , 陈麒麟 , 李杰 , 等 . 基于智能预问诊的全景多学科会诊平台的设计与应用 [J ] . 中国数字医学 , 2021 , 16 ( 10 ): 79 - 82 .
FU X , CHEN Q L , LI J , et al . Design and application of the panoramic multi-disciplinary treatment platform based on intelligent pre-consultation [J ] . China Digital Medicine , 2021 , 16 ( 10 ): 79 - 82 . (in Chinese)
吴宗友 , 白昆龙 , 杨林蕊 , 等 . 电子病历文本挖掘研究综述 [J ] . 计算机研究与发展 , 2021 , 58 ( 3 ): 513 - 527 .
WU Z Y , BAI K L , YANG L R , et al . Review on text mining of electronic medical record [J ] . Journal of Computer Research and Development , 2021 , 58 ( 3 ): 513 - 527 . (in Chinese)
杨锦锋 , 于秋滨 , 关毅 , 等 . 电子病历命名实体识别和实体关系抽取研究综述 [J ] . 自动化学报 , 2014 , 40 ( 8 ): 1537 - 1562 .
YANG J F , YU Q B , GUAN Y , et al . An overview of research on electronic medical record oriented named entity recognition and entity relation extraction [J ] . Acta Automatica Sinica , 2014 , 40 ( 8 ): 1537 - 1562 . (in Chinese)
全国知识图谱与语义计算大会 . CCKS 2021评测二: 电子病历命名实体识别 [EB/OL ] . ( 2021-5-31 )[ 2022-01-02 ] . https://www.biendata.xyz/competition/ccks_2021_clinic/ https://www.biendata.xyz/competition/ccks_2021_clinic/ .
程楠 , 侯豪 , 牛亚军 , 等 . 基于NLP技术后结构化处理的电子病历应用 [J ] . 河南医学研究 , 2021 , 30 ( 24 ): 4510 - 4513 .
CHENG N , HOU H , NIU Y J , et al . Application of post-structured electronic medical record based on NLP technology [J ] . Henan Medical Research , 2021 , 30 ( 24 ): 4510 - 4513 . (in Chinese)
NADEAU D , SEKINE S . A survey of named entity recognition and classification [J ] . Lingvisticæ Investigationes , 2007 , 30 : 3 - 26 .
CORTES C , VAPNIK V . Support-vector networks [J ] . Machine Learning , 1995 , 20 ( 3 ): 273 - 297 .
LAFFERTY J D , MCCALLUM A , PEREIRA F C N . Conditional random fields: Probabilistic models for segmenting and labeling sequence data [C ] // International Conference on Machine Learning . San Francisco : Morgan Kaufmann Publishers Inc. , 2001 : 282 - 289 .
KE X , LI S Z . Chinese organization name recognition based on co-training algorithm [C ] // 2008 3rd International Conference on Intelligent System and Knowledge Engineering . Xiamen : IEEE , 2008 : 771 - 777 .
ANDO R , ZHANG T . A framework for learning predictive structures from multiple tasks and unlabeled data [J ] . Journal of Machine Learning Research , 2005 , 6 : 1817 - 1853 .
HOCHREITER S , SCHMIDHUBER J . Long short-term memory [J ] . Neural Computation , 1997 , 9 ( 8 ): 1735 - 1780 .
WANG Z H , YANG B . Attention-based bidirectional long short-term memory networks for relation classification using knowledge distillation from BERT [C ] // 2020 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress . Calgary : IEEE , 2020 : 562 - 568 .
曹春萍 , 关鹏举 . 基于E-CNN和BLSTM-CRF的临床文本命名实体识别 [J ] . 计算机应用研究 , 2019 , 36 ( 12 ): 3748 - 3751 .
CAO C P , GUAN P J . Clinical text named entity recognition based on E-CNN and BLSTM-CRF [J ] . Application Research of Computers , 2019 , 36 ( 12 ): 3748 - 3751 . (in Chinese)
STRUBELL E , VERGA P , BELANGER D , et al . Fast and accurate entity recognition with iterated dilated convolutions [C ] // Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing . Copenhagen : Association for Computational Linguistics , 2017 : 2670 - 2680 .
许力 , 李建华 . 基于BERT和BiLSTM-CRF的生物医学命名实体识别 [J ] . 计算机工程与科学 , 2021 , 43 ( 10 ): 1873 - 1879 .
XU L , LI J H . Biomedical named entity recognition based on BERT and BiLSTM-CRF [J ] . Computer Engineering & Science , 2021 , 43 ( 10 ): 1873 - 1879 . (in Chinese)
MIKOLOV T , CHEN K , CORRADO G , et al . Efficient estimation of word representations in vector space [C ] // International Conference on Learning Representations . Scottsdale : ICLR , 2013 : 1 - 12 .
PENNINGTON J , SOCHER R , MANNING C . Glove: Global vectors for word representation [C ] // Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing . Doha : Association for Computational Linguistics , 2014 : 1532 - 1543 .
DEVLIN J , CHANG M W , LEE K , et al . BERT: Pre-training of deep bidirectional transformers for language understanding [EB/OL ] . ( 2018-10-11 )[ 2022-01-05 ] . https://arxiv.org/abs/1810.04805 https://arxiv.org/abs/1810.04805 .
WU Y , HUANG J , XU C E , et al . Research on named entity recognition of electronic medical records based on RoBERTa and radical-level feature [J ] . Wireless Communications and Mobile Computing , 2021 , 2021 : 2489754 .
杨锦锋 , 关毅 , 何彬 , 等 . 中文电子病历命名实体和实体关系语料库构建 [J ] . 软件学报 , 2016 , 27 ( 11 ): 2725 - 2746 .
YANG J F , GUAN Y , HE B , et al . Corpus construction for named entities and entity relations on Chinese electronic medical records [J ] . Journal of Software , 2016 , 27 ( 11 ): 2725 - 2746 . (in Chinese)
全国知识图谱与语义计算大会 . 任务二: 电子病历命名实体识别 [EB/OL ] . ( 2017 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2017/?page_id=51 http://www.sigkg.cn/ccks2017/?page_id=51 .
National Knowledge Graph and Semantic Computing Conference . Task 2: Electronic medical record named entity recognition [EB/OL ] . ( 2017 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2017/?page_id=51. http://www.sigkg.cn/ccks2017/?page_id=51. (in Chinese)
全国知识图谱与语义计算大会 . 任务一: 面向中文电子病历的命名实体识别 [EB/OL ] . ( 2018 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2018/?page_id=16 http://www.sigkg.cn/ccks2018/?page_id=16 .
National Knowledge Graph and Semantic Computing Conference . Task 1: Named entity recognition for Chinese electronic medical Record [EB/OL ] . ( 2018 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2018/?page_id=16. http://www.sigkg.cn/ccks2018/?page_id=16. (in Chinese)
全国知识图谱与语义计算大会 . 任务一: 面向中文电子病历的命名实体识别 [EB/OL ] . ( 2019 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2019/?page_id=62 http://www.sigkg.cn/ccks2019/?page_id=62 .
National Knowledge Graph and Semantic Computing Conference . Task 1: Named entity recognition for Chinese electronic medical record [EB/OL ] . ( 2019 )[ 2022-01-05 ] . http://www.sigkg.cn/ccks2019/?page_id=62. http://www.sigkg.cn/ccks2019/?page_id=62. (in Chinese)
全国知识图谱与语义计算大会 . 任务三: 面向中文电子病历的医疗实体及事件抽取 [EB/OL ] . ( 2020 )[ 2022-01-05 ] . http://sigkg.cn/ccks2020/?page_id=69 http://sigkg.cn/ccks2020/?page_id=69 .
National Knowledge Graph and Semantic Computing Conference . Task three: For electronic medical records in Chinese medical entities and event extraction [EB/OL ] . ( 2020 )[ 2022-01-05 ] . http://sigkg.cn/ccks2020/?page_id=69. http://sigkg.cn/ccks2020/?page_id=69. (in Chinese)
全国知识图谱与语义计算大会 . 任务四: 面向中文电子病历的医疗实体及事件抽取 [EB/OL ] . ( 2021 )[ 2022-01-05 ] . http://sigkg.cn/ccks2021/?page_id=27 http://sigkg.cn/ccks2021/?page_id=27 .
National Knowledge Graph and Semantic Computing Conference . Task 4: For electronic medical records in Chinese medical entity and event extraction [EB/OL ] . ( 2021 )[ 2022-01-05 ] . http://sigkg.cn/ccks2021/?page_id=27. http://sigkg.cn/ccks2021/?page_id=27. (in Chinese)
王正宏 . 区域健康医疗数据集成模式研究与实现 [D ] . 合肥 : 合肥工业大学 , 2020 .
WANG Z H . Research and Implementation of Regional Health Medical Data Integration Model [D ] . Hefei : Hefei University of Technology , 2020 . (in Chinese)
韩丽珍 . PDCA循环法应用前后肿瘤科病案缺陷状况对比分析 [J ] . 中国卫生统计 , 2019 , 36 ( 5 ): 745 - 747 .
HAN L Z . Comparative analysis of medical record defects in oncology department before and after application of PDCA cycle [J ] . Chinese Journal of Health Statistics , 2019 , 36 ( 5 ): 745 - 747 . (in Chinese)
邱炎龙 . 基于电子病历的心血管疾病预测技术研究 [D ] . 兰州 : 西北师范大学 , 2021 .
QIU Y L . Research on Cardiovascular Disease Prediction Technology Based on Electronic Medical Records [D ] . Lanzhou : Northwest Normal University , 2021 . (in Chinese)
余健 , 胡孔法 , 丁有伟 . 一种面向中医药数据的高效脱敏算法 [J ] . 世界科学技术-中医药现代化 , 2020 , 22 ( 12 ): 4169 - 4174 .
YU J , HU K F , DING Y W . An efficient desensitization algorithm for Chinese medicine data [J ] . Modernization of Traditional Chinese Medicine and Materia Medica-World Science and Technology , 2020 , 22 ( 12 ): 4169 - 4174 . (in Chinese)
唐观根 . 中文电子病历命名实体识别研究 [D ] . 杭州 : 杭州电子科技大学 , 2020 .
TANG G G . Research on Named Entity Recognition of Chinese Electronic Medical Records [D ] . Hangzhou : Hangzhou Dianzi University , 2020 . (in Chinese)
GRISHMAN R , SUNDHEIM B . Message Understanding Conference-6: A brief history [C ] // Proceedings of the 16th Conference on Computational linguistics-Volume 1 . Copenhagen : Association for Computational Linguistics , 1996 : 466 - 471 .
DODDINGTON G , MITCHELL A , PRZYBOCKI M A , et al . The automatic content extraction(ACE) program - tasks, data, and evaluation [C ] // Language Resources and Evaluation Conference . Lisbon : LREC , 2004 : 1 - 4 .
XU L , TONG Y , DONG Q Q , et al . CLUENER2020: Fine-grained named entity recognition dataset and benchmark for Chinese [EB/OL ] . ( 2020-01-13 )[ 2022-01-05 ] . https://arxiv.org/abs/2001.04351 https://arxiv.org/abs/2001.04351 .
ZHANG N Y , CHEN M S , BI Z , et al . CBLUE: A Chinese biomedical language understanding evaluation benchmark [EB/OL ] . ( 2021-01-15 )[ 2022-01-05 ] . https://arxiv.org/abs/2106.08087 https://arxiv.org/abs/2106.08087 .
龚乐君 , 张知菲 . 基于领域词典与CRF双层标注的中文电子病历实体识别 [J ] . 工程科学学报 , 2020 , 42 ( 4 ): 469 - 475 .
GONG L J , ZHANG Z F . Clinical named entity recognition from Chinese electronic medical records using a double-layer annotation model combining a domain dictionary with CRF [J ] . Chinese Journal of Engineering , 2020 , 42 ( 4 ): 469 - 475 . (in Chinese)
GORINSKI P J , WU H H , GROVER C , et al . Named entity recognition for electronic health records: A comparison of rule-based and machine learning approaches [EB/OL ] . ( 2019-03-10 )[ 2022-01-05 ] . https://arxiv.org/abs/1903.03985 https://arxiv.org/abs/1903.03985 .
高冰涛 , 张阳 , 刘斌 . BioTrHMM: 基于迁移学习的生物医学命名实体识别算法 [J ] . 计算机应用研究 , 2019 , 36 ( 1 ): 45 - 48 .
GAO B T , ZHANG Y , LIU B . BioTrHMM: Named entity recognition algorithm based on transfer learning in biomedical texts [J ] . Application Research of Computers , 2019 , 36 ( 1 ): 45 - 48 . (in Chinese)
杨丽静 , 唐俊 , 沈伟富 , 等 . 基于命名实体识别的恶性肿瘤诊断文本信息提取研究 [J ] . 医院管理论坛 , 2020 , 37 ( 8 ): 74 - 77 .
YANG L J , TANG J , SHEN W F , et al . Research on text information extraction of malignant tumor diagnosis based on named entity recognition [J ] . Hospital Management Forum , 2020 , 37 ( 8 ): 74 - 77 . (in Chinese)
张华丽 , 康晓东 , 李博 , 等 . 结合注意力机制的Bi-LSTM-CRF中文电子病历命名实体识别 [J ] . 计算机应用 , 2020 , 40 ( S1 ): 98 - 102 .
ZHANG H L , KANG X D , LI B , et al . Medical name entity recognition based on Bi-LSTM-CRF and attention mechanism [J ] . Journal of Computer Applications , 2020 , 40 ( S1 ): 98 - 102 . (in Chinese)
DOS SANTOS C N , ZADROZNY B . Learning character-level representations for part-of-speech tagging [C ] // Proceedings of the 31st International Conference on International Conference on Machine Learning-Volume 32 . Beijing : JMLR.org , 2014 : II( 1818-1826 .
乔锐 , 杨笑然 , 黄文亢 . 基于BERT与模型融合的医疗命名实体识别 [C ] // 2019年全国知识图谱与语义计算大会 . 杭州 : 中国中文信息学会 , 2019 : 1 - 6 .
曲春燕 , 关毅 , 杨锦锋 , 等 . 中文电子病历命名实体标注语料库构建 [J ] . 高技术通讯 , 2015 , 25 ( 2 ): 143 - 150 .
QU C Y , GUAN Y , YANG J F , et al . The construction of annotated corpora of named entities for Chinese electronic medical records [J ] . Chinese High Technology Letters , 2015 , 25 ( 2 ): 143 - 150 . (in Chinese)
刘一斌 . 中医中文电子病历命名实体语料库构建及研究 [D ] . 广州 : 广州中医药大学 , 2020 .
LIU Y B . Construction and Research of Chinese Electronic Medical Record Named Entity Recognition Corpus [D ] . Guangzhou : Guangzhou University of Chinese Medicine , 2020 . (in Chinese)
陈曙东 , 罗超 , 欧阳小叶 , 等 . 基于动态词典匹配的语义增强中文命名实体识别算法 [J ] . 无线电工程 , 2021 , 51 ( 7 ): 519 - 525 .
CHEN S D , LUO C , OUYANG X Y , et al . A semantic-enhanced Chinese named entity recognition algorithm based on dynamic dictionary matching [J ] . Radio Engineering , 2021 , 51 ( 7 ): 519 - 525 . (in Chinese)
WANG Q , ZHOU Y M , RUAN T , et al . Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition [J ] . Journal of Biomedical Informatics , 2019 , 92 : 103133 .
CHEN X L , OUYANG C P , LIU Y B , et al . Improving the named entity recognition of Chinese electronic medical records by combining domain dictionary and rules [J ] . International Journal of Environmental Research and Public Health , 2020 , 17 ( 8 ): 2687 .
JUSTYNA S W , ALEKSANDER W , ALEKSANDER P , et al . Detecting formal thought disorder by deep contextualized word representations [J ] . Psychiatry Research , 2021 , 304 : 114135 .
ZHANG Y , YANG J . Chinese NER using lattice LSTM [C ] // Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics . Melbourne : Association for Computational Linguistics , 2018 : 1554 - 1564 .
ANDREW B F . A Maximum Entropy Approach to Named Entity Recognition [D ] . New York : New York University , 1999 .
陈琛 . 基于BiGRU_CRF模型的医疗领域命名实体识别 [J ] . 电子技术与软件工程 , 2020 ( 14 ): 180 - 182 .
CHEN C . Named entity recognition in medical field based on BiGRU_CRF mode [J ] . Electronic Technology & Software Engineering , 2020 ( 14 ): 180 - 182 . (in Chinese)
FINE S , SINGER Y , TISHBY N . The hierarchical hidden Markov model: Analysis and applications [J ] . Machine Learning , 1998 , 32 ( 1 ): 41 - 62 .
MCCALLUM A , FREITAG D , PEREIRA F C . Maximum entropy Markov models for information extraction and segmentation [C ] // Proceedings of the Seventeenth International Conference on Machine Learning . Stanford : Morgan Kaufmann Publishers Inc. , 2000 : 591 - 598 .
李博 , 康晓东 , 张华丽 , 等 . 采用Transformer-CRF的中文电子病历命名实体识别 [J ] . 计算机工程与应用 , 2020 , 56 ( 5 ): 153 - 159 .
LI B , KANG X D , ZHANG H L , et al . Named entity recognition in Chinese electronic medical records using transformer-CRF [J ] . Computer Engineering and Applications , 2020 , 56 ( 5 ): 153 - 159 . (in Chinese)
KIM Y . Convolutional neural networks for sentence classification [EB/OL ] . ( 2014-08-25 )[ 2022-01-05 ] . https://arxiv.org/abs/1408.5882 https://arxiv.org/abs/1408.5882 .
CHO K , VAN MERRIENBOER B , GULCEHRE C , et al . Learning phrase representations using RNN encoder-decoder for statistical machine translation [EB/OL ] . ( 2014-06-03 )[ 2022-01-05 ] . https://arxiv.org/abs/1406.1078 https://arxiv.org/abs/1406.1078 .
CHIU J P , NICHOLS E . Named entity recognition with bidirectional LSTM-CNNs [J ] . Transactions of the Association for Computational Linguistics , 2016 , 4 : 357 - 370 .
WANG Y Q , HUANG M L , ZHU X Y , et al . Attention-based LSTM for aspect-level sentiment classification [C ] // Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing . Austin : Association for Computational Linguistics , 2016 : 606 - 615 .
JI B , LIU R , LI S S , et al . A BiLSTM-CRF method to Chinese electronic medical record named entity recognition [C ] // Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence . Sanya : ACM , 2018 : 1 - 6 .
LI X N , YAN H , QIU X P , et al . FLAT: Chinese NER using flat-lattice transformer [C ] // Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . Virtual Conference : Association for Computational Linguistics , 2020 : 6836 - 6842 .
ADHIKARI A , RAM A , TANG R , et al . DocBERT: BERT for document classification [EB/OL ] . ( 2019-04-17 )[ 2022-01-05 ] . https://arxiv.org/abs/1904.08398 https://arxiv.org/abs/1904.08398 .
ALBERTI C , LEE K , COLLINS M . A BERT baseline for the natural questions [EB/OL ] . ( 2019-01-24 )[ 2022-01-05 ] . https://arxiv.org/abs/1901.08634 https://arxiv.org/abs/1901.08634 .
YANG W , ZHANG H T , LIN J . Simple applications of BERT for ad hoc document retrieval [EB/OL ] . ( 2019-03-26 )[ 2022-01-05 ] . https://arxiv.org/abs/1903.10972 https://arxiv.org/abs/1903.10972 .
RADFORD A , NARASIMHAN K . Improving language understanding by generative pre-training [J ] . Computer Science , 2018 : 1 - 12 .
朱岩 , 张利 , 王煜 . 基于RoBERTa-WWM的中文电子病历命名实体识别 [J ] . 计算机与现代化 , 2021 ( 2 ): 51 - 55 .
ZHU Y , ZHANG L , WANG Y . Named entity recognition on Chinese electronic medical records based on RoBERTa-WWM [J ] . Computer and Modernization , 2021 ( 2 ): 51 - 55 . (in Chinese)
刘司宇 . 基于深度学习的中文命名实体识别方法改进研究 [D ] . 成都 : 成都理工大学 , 2020 : 70 - 71 .
LIU S Y . The Research on Improvement of Chinese Named Entity Recognition Method Based on Deep Learning [D ] . Chengdu : Chengdu University of Technology , 2020 : 70 - 71 . (in Chinese)
MA X Z , HOVY E . End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF [EB/OL ] .( 2016-03-04 )[ 2022-01-05 ] . https://arxiv.org/abs/1603.01354 https://arxiv.org/abs/1603.01354 .
晏阳天 , 赵新宇 , 吴贤 . 基于BERT与字形字音特征的医疗命名实体识别 [C ] // 2020年全国知识图谱与语义计算大会 . 南昌 : 中国中文信息学会 , 2020 : 1 - 7 .
殷章志 , 李欣子 , 黄德根 , 等 . 融合字词模型的中文命名实体识别研究 [J ] . 中文信息学报 , 2019 , 33 ( 11 ): 95 - 100, 106 .
YIN Z Z , LI X Z , HUANG D G , et al . Chinese named entity recognition ensembled with character [J ] . Journal of Chinese Information Processing , 2019 , 33 ( 11 ): 95 - 100, 106 . (in Chinese)
RONNEBERGER O , FISCHER P , BROX T . U-Net: Convolutional networks for biomedical image segmentation [C ] // International Conference on Medical Image Computing and Computer-Assisted Intervention . Munich : Springer , 2015 : 234 - 241 .
WANG S H , JIANG J . Machine comprehension using match-LSTM and answer pointer [EB/OL ] . ( 2016-08-29 )[ 2022-01-05 ] . https://arxiv.org/abs/1608.07905 https://arxiv.org/abs/1608.07905 .
CUI Y M , CHE W X , LIU T , et al . Pre-training with whole word masking for Chinese BERT [EB/OL ] . ( 2019-06-19 )[ 2022-01-05 ] . https://arxiv.org/abs/1906.08101 https://arxiv.org/abs/1906.08101 .
HU J L , SHI X , LIU Z J , et al . HITSZ_CNER: A hybrid system for entity recognition from Chinese clinical text [C ] // Proceedings of the Evaluation Tasks at the China Conference on Knowledge Graph and Semantic Computing . Chendu : Springer , 2017 : 1 - 6 .
LU N J , ZHENG J , WU W , et al . Chinese clinical named entity recognition with word-level information incorporating dictionaries [C ] // 2019 International Joint Conference on Neural Networks . Budapest : IEEE , 2019 : 1 - 8 .
罗凌 , 杨志豪 , 宋雅文 , 等 . 基于笔画ELMo和多任务学习的中文电子病历命名实体识别研究 [J ] . 计算机学报 , 2020 , 43 ( 10 ): 1943 - 1957 .
LUO L , YANG Z H , SONG Y W , et al . Chinese clinical named entity recognition based on stroke ELMo and multi-task learning [J ] . Chinese Journal of Computers , 2020 , 43 ( 10 ): 1943 - 1957 . (in Chinese)
LI X Y , ZHANG H , ZHOU X H . Chinese clinical named entity recognition with variant neural structures based on BERT methods [J ] . Journal of Biomedical Informatics , 2020 , 107 : 103422 .
唐国强 , 高大启 , 阮彤 , 等 . 融入语言模型和注意力机制的临床电子病历命名实体识别 [J ] . 计算机科学 , 2020 , 47 ( 3 ): 211 - 216 .
TANG G Q , GAO D Q , RUAN T , et al . Clinical electronic medical record named entity recognition incorporating language model [J ] . Computer Science , 2020 , 47 ( 3 ): 211 - 216 . (in Chinese)
QIU J H , ZHOU Y M , WANG Q , et al . Chinese clinical named entity recognition using residual dilated convolutional neural network with conditional random field [J ] . IEEE Transactions on NanoBioscience , 2019 , 18 ( 3 ): 306 - 315 .
JI B , LIU R , LI S S , et al . A hybrid approach for named entity recognition in Chinese electronic medical record [J ] . BMC Medical Informatics and Decision Making , 2019 , 19 ( Suppl 2 ): 64 .
WANG C Y , WANG H , ZHUANG H , et al . Chinese medical named entity recognition based on multi-granularity semantic dictionary and multimodal tree [J ] . Journal of Biomedical Informatics , 2020 , 111 : 103583 .
潘璀然 , 王青华 , 汤步洲 , 等 . 基于句子级Lattice-长短记忆神经网络的中文电子病历命名实体识别 [J ] . 第二军医大学学报 , 2019 , 40 ( 5 ): 497 - 506 .
PAN C R , WANG Q H , TANG B Z , et al . Chinese electronic medical record named entity recognition based on sentence-level Lattice-long short-term memory neural network [J ] . Academic Journal of Second Military Medical University , 2019 , 40 ( 5 ): 497 - 506 . (in Chinese)
YANG X R , HUANG W K . A conditional random fields approach to clinical name entity recognition [C ] // China Conference on Knowledge Graph and Semantic Computing . Tianjin : Springer , 2018 : 1 - 6 .
LUO L , LI N , LI S , et al . DUTIR at the CCKS-2018 Task1: A neural network ensemble approach for Chinese clinical named entity recognition [C ] // China Conference on Knowledge Graph and Semantic Computing . Tianjin : Springer , 2018 : 7 - 12 .
何云琪 , 刘苏文 , 钱龙华 , 周国栋 . 基于句法和语义特征的疾病名称识别 [J ] . 中国科学: 信息科学 , 2018 , 48 ( 11 ): 1546 - 1557 .
盛剑 , 向政鹏 , 秦兵 , 等 . 多场景文本的细粒度命名实体识别 [J ] . 中文信息学报 , 2019 , 33 ( 6 ): 80 - 87 .
SHENG J , XIANG Z P , QIN B , et al . Fine-grained named entity recognition for multi-scenario [J ] . Journal of Chinese Information Processing , 2019 , 33 ( 6 ): 80 - 87 . (in Chinese)
LIU M L , ZHOU X S , CAO Z , et al . Team MSIIP at CCKS 2019 Task 1 [C ] // 2019 China Conference on Knowledge Graph and Semantic Computing . Hangzhou : Chinese Information Processing Society of China , 2019 : 1 - 11 .
LI N , LUO L , DING Z , et al . DUTIR at the CCKS-2019 Task1: Improving Chinese clinical named entity recognition using stroke ELMo and transfer learning [C ] // Proceedings of the 4th China Conference on Knowledge Graph and Semantic Computing . Hangzhou : Chinese Information Processing Society of China , 2019 : 24 - 27 .
赵刚 , 张腾 , 王晨骁 , 等 . Team MSIIP at CCKS 2019 Task 2 [EB/OL ] . ( 2019 )[ 2022-01-05 ] . https://conference.bj.bcebos.com/ccks2019/eval/webpage/pdfs/eval_paper_1_2_2.pdf https://conference.bj.bcebos.com/ccks2019/eval/webpage/pdfs/eval_paper_1_2_2.pdf .
JI B , LI S S , YU J , et al . Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models [J ] . Journal of Biomedical Informatics , 2020 , 104 : 103395 .
LI Z C , GAN Z , ZHANG B L , et al . Semi-supervised noisy label learning for Chinese clinical named entity recognition [J ] . Data Intelligence 2021 , 3 ( 3 ): 389 - 401
杨文明 , 毕金良 , 邹佳丽 , 等 . 基于ChiEHRBert与多模型融合的医疗命名实体识别 [C ] // 2020年全国知识图谱与语义计算大会 . 南昌 : 中国中文信息学会 , 2020 : 1 - 9 .
ZHENG H Y , QIN B , XU M . Chinese medical named entity recognition using CRF-MT-Adapt and NER-MRC [C ] // 2021 2nd International Conference on Computing and Data Science . Stanford : IEEE , 2021 : 362 - 365 .
温超杰 , 陈涛 , 朱江 . 基于预训练模型和领域词典的医疗命名实体识别方法研究 [C ] // 2020年全国知识图谱与语义计算大会 . 南昌 : 中国中文信息学会 , 2020 : 1 - 11 .
MA C , HUANG W K . Named entity recognition and event extraction in Chinese electronic medical records [C ] // China Conference on Knowledge Graph and Semantic Computing . Qinhuangdao : Springer , 2022 : 133 - 138 .
GAN Z , LI Z C , ZHANG B L , et al . Enhance both text and label: Combination strategies for improving the generalization ability of medical entity extraction [C ] // China Conference on Knowledge Graph and Semantic Computing . Qinhuangdao : Springer , 2022 : 92 - 101 .
晏阳天 , 张昕楠 , 吴喆 , 等 . 基于多特征融合的预训练医疗实体和事件抽取模型 [C ] // 2021年全国知识图谱与语义计算大会 . 广州 : 中国中文信息学会 , 2021 : 1 - 8 .
WANG Y , LI Y , TONG H H , et al . HIT: Nested named entity recognition via head-tail pair and token interaction [C ] // Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing . Virtual Conference : Association for Computational Linguistics , 2020 : 6027 - 6036 .
BEKOULIS G , DELEU J , DEMEESTER T , et al . Joint entity recognition and relation extraction as a multi-head selection problem [EB/OL ] .( 2018-04-20 )[ 2022-01-05 ] . https://arxiv.org/abs/1804.07847 https://arxiv.org/abs/1804.07847 .
EBERTS M , ULGES A . Span-based joint entity and relation extraction with transformer pre-training [EB/OL ] . ( 2019-09-17 )[ 2022-01-05 ] . https://arxiv.org/abs/1909.07755 https://arxiv.org/abs/1909.07755 .
WADDEN D , WENNBERG U , LUAN Y , et al . Entity, relation, and event extraction with contextualized span representations [EB/OL ] . ( 2019-09-08 )[ 2022-01-05 ] . https://arxiv.org/abs/1909.03546 https://arxiv.org/abs/1909.03546 .
KINGMA D P , BA J . Adam: A method for stochastic optimization [EB/OL ] . ( 2014-10-22 )[ 2022-01-05 ] . https://arxiv.org/abs/1412.6980 https://arxiv.org/abs/1412.6980 .
LOSHCHILOV I , HUTTER F . Fixing weight decay regularization in adam [C ] // 2018 International Conference on Learning Representations . Vancouver : ICLR , 2018 : 1 - 14 .
卢宁杰 . 结合主动学习的中文医疗命名实体识别研究 [D ] . 上海 : 华东师范大学 , 2020 .
LU N J . Research on Chinese Medical Named Entity Recognition Combined with Active Learning [D ] . Shanghai : East China Normal University , 2020 . (in Chinese)
钟志农 , 刘方驰 , 吴烨 , 等 . 主动学习与自学习的中文命名实体识别 [J ] . 国防科技大学学报 , 2014 , 36 ( 4 ): 82 - 88 .
ZHONG Z N , LIU F C , WU Y , et al . Chinese named entity recognition combined active learning with self-training [J ] . Journal of National University of Defense Technology , 2014 , 36 ( 4 ): 82 - 88 . (in Chinese)
李猛 , 李艳玲 , 林民 . 命名实体识别的迁移学习研究综述 [J ] . 计算机科学与探索 , 2021 , 15 ( 2 ): 206 - 218 .
LI M , LI Y L , LIN M . Review of transfer learning for named entity recognition [J ] . Journal of Frontiers of Computer Science and Technology , 2021 , 15 ( 2 ): 206 - 218 . (in Chinese)
王浩畅 , 李钰 , 赵铁军 . 面向生物医学命名实体识别的多Agent元学习框架 [J ] . 计算机学报 , 2010 , 33 ( 7 ): 1256 - 1262 .
WANG H C , LI Y , ZHAO T J . Biomedical named entity recognition through a multi-agent meta-learning framework [J ] . Chinese Journal of Computers , 2010 , 33 ( 7 ): 1256 - 1262 . (in Chinese)
SUI D B , CHEN Y B , ZHAO J , et al . Feded: Federated learning via ensemble distillation for medical relation extraction [C ] // Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing . Seattle : Association for Computational Linguistics , 2020 : 2118 - 2128 .
0
Views
9
下载量
6
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621