National Natural Science Foundation of China Outstanding State Key Laboratory Research Project (No.61223003);Youth Fund of National Natural Science Foundation of China (No.61300158)
YIN Cun-yan, HUANG Shu-jian, DAI Xin-yu, et al. Optimization of Chinese Word Segmentation in Named Entity Recognition and Word Alignment[J]. Acta Electronica Sinica, 2015, 43(8): 1481-1487.
DOI:
YIN Cun-yan, HUANG Shu-jian, DAI Xin-yu, et al. Optimization of Chinese Word Segmentation in Named Entity Recognition and Word Alignment[J]. Acta Electronica Sinica, 2015, 43(8): 1481-1487. DOI: 10.3969/j.issn.0372-2112.2015.08.003.
Optimization of Chinese Word Segmentation in Named Entity Recognition and Word Alignment
Bilingual named entity recognition and alignment are important for many natural language processing.Named entity translation can improve a lot the performance of the system like statistical machine translation or cross-language information retrieval.Quality of Chinese word segmentation does have a big impact over named entity (NE) recognition and bilingual NE extraction.Bilingual alignment information provides indications for NE recognition and word segmentation.Accordingly
based on the characteristics of NE recognition
NE alignment
and word segmentation
this paper proposes an optimization algorithm of Chinese word segmentation.By correcting word segmentation error and adjusting word segmentation granularity
the optimization algorithm can enhance extraction effect of Chinese-English NE translation and performance of statistical machine translation.The experimental result on Chinese-English news corpus shows the efficiency of our algorithm.