Gabor Filter Based Text Extraction from Digital Document Images

FU Ping, LI Meng, YIN Hong-tao

ACTA ELECTRONICA SINICA ›› 2006, Vol. 34 ›› Issue (S1) : 2387-2390.

PDF(1233 KB)
CIE Homepage  |  Join CIE  |  Login CIE  |  中文 
PDF(1233 KB)
ACTA ELECTRONICA SINICA ›› 2006, Vol. 34 ›› Issue (S1) : 2387-2390.

Gabor Filter Based Text Extraction from Digital Document Images

  • FU Ping, LI Meng, YIN Hong-tao
Author information +

Abstract

This paper presents an algorithm that can automatically detect and extract text in digital document images.Firstly,we process and fuse Gabor filtered images at different orientations and scales and obtain an image that reflects the layout of the document image.Then,potential text regions are directly extracted from the resulting image.Finally,two criteria based on the geometrical property and high frequency content are adopted to kick-out those non-text regions.The experiments are performed on some representative images with different styles and with texts in different languages and fonts.Experimental results show that the algorithm works well on document images from a wide variety of source.

Key words

text extraction / Gabor filter / digital document images

Cite this article

Download Citations
FU Ping, LI Meng, YIN Hong-tao. Gabor Filter Based Text Extraction from Digital Document Images[J]. Acta Electronica Sinica, 2006, 34(S1): 2387-2390.

References

[1] F M Wahl,K Y Wong,R G Casey.Block segmentation and text extraction in mixed text/image document[J].Computer Graphics and Image Processing,1982,20(4):375-390.
[2] K Y Wong,R G Casey,F M Wahl.Document analysis system[J].IBM Journal Res.Dev,1982,26(6):647-656.
[3] D Wang,S N Srihari.Classification of newspaper image blocks using texture analysis[J].Computer Graphics and Image Processing,1989,47 (3):327-352.
[4] L O' Gorman.The document specwam for page layout analysis[J].IEEE Trans Pattern Analysis and Machine Intelligence,1993,15(11):1162-1173.
[5] A K Jain,S Bhattacharjee.Text segmentation using Gabor filters for automatic document processing[J].Machine Vision and Applications,1992,5(3):169-184.
[6] A K Jain,Y Zhong.Page segmentation using texture analysis[J].PR,1996,29(5):743-770.
[7] K Etemad,D Doermann,R Chellappa.Multiscale document page segmentation using soft decision integration[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(1):92-96.
[8] S S Raju,P B Pail,A G Ramakrishnan.Gabor filter based block energy analysis for text extraction from digital document images[A].Proceedings of the First International Workshop on Document Image Analysis for Libraries[C].Palo Alto,California,USA:IEEE,2004.233-243.
[9] S S Raju,P B Pati,A G Ramakrishnan.Text localization and extraction from complex color images[J].International Symposium on Visual Computing 2005,LNCS-3804:486-493.
[10] S Mao,T Kanungo.Emprirical performance evaluation methodology and its application to page segmentation algorithms[J].PAMI,2001,23(3):242-256.
[11] M Clark,A C Bovik,W S Geisler.Texture segmentation using Gabor modulation/demodulatiou[J].Pattern Recognition Letters,1987,6(4):261-267.
[12] J G Daugman.Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression[J].IEEE Trans.Acoust.Speech Signal Process,1988,ASSP-36:1169-1179.
[13] I Fogel,D Sagi.Gabor filters as texture discriminator[J].Biol Cybernet,1989,61 (2):103-113.
[14] M Sonka,V Hlavac,R Boyle.图像处理,分析与机器视觉[M].北京:人民邮电出版社,2002.142-148.
PDF(1233 KB)

2078

Accesses

0

Citation

Detail

Sections
Recommended

/