LI Xiang-yang, LU Jian-jiang, ZHANG Ya-fei. Web Information Extraction by Competing Classification[J]. Acta Electronica Sinica, 2004, 32(11): 1915-1917.
DOI:
LI Xiang-yang, LU Jian-jiang, ZHANG Ya-fei. Web Information Extraction by Competing Classification[J]. Acta Electronica Sinica, 2004, 32(11): 1915-1917.DOI:
Web Information Extraction by Competing Classification
A competing classification method is presented to extract Web information.The method uses similarity between information fragments and samples as competing ability.It classifies fragments and filters out noise information through competition of fragments for template slots.It needs far less tagged samples than those using rules to extract information.Experiments show that the method keeps high precision of information extraction without any feature clues provided by users.Therefore it is adaptive.The competing classification method is also robust in dealing with data sources having missing items and items of various orders.