Natural Scene Text Detection Based on Adaptive Color Clustering and Context Information
ZOU Bei-ji1,2, GUO Jian-jing1,2, ZHU Cheng-zhang1,2, YANG Wen-jun1,2, XU Zi-wen1,2
1. School of Information Science and Engineering, Central South University, Changsha, Hunan 410083, China;
2. Center for Ophthalmic Imaging Research, Central South University, Changsha, Hunan 410083, China
Abstract:Natural scene text detection is an important task for image analysis and understanding.In this paper,a natural scene text detection method is proposed,using adaptive color clustering and context information analysis.Firstly,combining hierarchical clustering with self-learning strategy,we design an adaptive color clustering method,which learns clustering weights automatically and generates high character recall.Then,considering text in images usually containing several characters,we propose a character verification strategy based on image context information,which can guarantee high character recall and remove non-text components at the same time.Finally,characters are merged to text lines,and further post-processing is applied to generate final text detection results.Experiments on the ICDAR2013 publicly available dataset show that we obtain recall of 74.17%,precision of 83.40% and F-score of 78.52%.Compared with other text detection methods,our method obtains better text detection performance,indicating superiority of the proposed method.
邹北骥, 郭建京, 朱承璋, 杨文君, 徐子雯. 基于自适应色彩聚类和上下文信息的自然场景文本检测[J]. 电子学报, 2018, 46(6): 1436-1444.
ZOU Bei-ji, GUO Jian-jing, ZHU Cheng-zhang, YANG Wen-jun, XU Zi-wen. Natural Scene Text Detection Based on Adaptive Color Clustering and Context Information. Acta Electronica Sinica, 2018, 46(6): 1436-1444.
[1] Wei B,Yin Z,Jie Y,et al.A novel approach to text detection and extraction from videos by discriminative features and density[J].Chinese Journal of Electronics,2014,23(2):322-328.
[2] Bai X,Yao C,Liu W.Strokelets:A learned multi-scale mid-level representation for scene text recognition[J].IEEE Transactions on Image Processing,2016,25(6):2789-2802.
[3] Wang X,Zha T,Wu C,et al.Text semantics based automatic summarization for chinese videos[J].Chinese Journal of Electronics,2015,24(3):462-467.
[4] 袁海东,马华东,黄晓冬.基于梯度与粗糙度的视频文本检测与定位[J].电子学报,2008,36(8):1660-1664. H.Yuan,H.Ma abd X.Huang.Video text detection and localization based on gradients and coarsenss[J].Acta Electronica Sinica,2008,36(8):1660-1665.(in Chinese)
[5] Lee J J,Lee P H,Lee S W,et al.AdaBoost for text detection in natural scene[A].2011 International Conference on Document Analysis and Recognition[C].Beijing,China:IEEE,2011.429-434.
[6] Wang T,Wu D J,Coates A,et al.End-to-end text recognition with convolutional neural networks[A].Proceedings of the 21st International Conference on Pattern Recognition[C].Tsukuba,Japan:IEEE,2012.3304-3308.
[7] Zhang Z,Wei S,Yao C,et al.Symmetry-based text line detection in natural scenes[A].2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)[C].Boston,USA:IEEE,2015.2558-2567.
[8] Neumann L,Matas J.Real-time scene text localization and recognition[A].2012 IEEE Conference on Computer Vision and Pattern Recognition[C].Providence,USA:IEEE,2012.3538-3545.
[9] Yin X C,Yin X,Huang K,et al.Robust text detection in natural scene images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2014,36(5):970-983.
[10] Wang Q,Lu Y,Sun S.Text detection in nature scene images using two-stage nontext filtering[A].201513th International Conference on Document Analysis and Recognition (ICDAR)[C].Nancy,France:IEEE,2015.106-110.
[11] Yao C,Bai X,Liu W.A unified framework for multioriented text detection and recognition[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2014,23(11):4737-4749.
[12] Huang W,Lin Z,Yang J,et al.Text localization in natural images using stroke feature transform and text covariance descriptors[A].2013 IEEE International Conference on Computer Vision[C].Sydney,Australia:IEEE,2013.1241-1248.
[13] Epshtein B,Ofek E,Wexler Y.Detecting text in natural scenes with stroke width transform[A].2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition[C].San Francisco,USA:IEEE,2010.2963-2970.
[14] Wu H,Zou B,Zhao Y-Q,et al.Natural scene text detection by multi-scale adaptive color clustering and non-text filtering[J].Neurocomputing,2016,214:1011-1025.
[15] Wu H,Zou B,Zhao Y-Q,et al.Scene text detection using adaptive color reduction,adjacent character model and hybrid verification strategy[J].The Visual Computer,33(1):1-14.
[16] Yi C,Tian Y.Text string detection from natural scenes by structure-based partition and grouping[J].IEEE Transactions on Image Processing,2011,20(9):2594-2605.
[17] Yang Q,Tang J,Ahuja N.Efficient and robust specular highlight removal[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(6):1304-1311.
[18] Dalal N,Triggs B.Histograms of oriented gradients for human detection[A].2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05)[C].San Diego,USA:IEEE,2005.886-893.
[19] Karatzas D,Shafait F,Uchida S,et al.ICDAR 2013 Robust Reading Competition[A].201312th International Conference on Document Analysis and Recognition[C].Washington DC,USA:IEEE,2013.1484-1493.
[20] Du Y,Duan G,Ai H.Context-based text detection in natural scenes[A].IEEE International Conference on Image Processing[C].Orlando,USA:IEEE,2012.1857-1860.
[21] Yin X C,Pei W Y,Zhang J,et al.Multi-orientation scene text detection with adaptive clustering[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1930-1937.
[22] Shi C Z,Wang C H,Xiao B H,et al.Scene text recognition using structure-guided character detection and linguistic knowledge[J].IEEE Transactions on Circuits and Systems for Video Technology,2014,24(7):1235-1250.
[23] Yao C,Bai X,Liu W,et al.Detecting texts of arbitrary orientations in natural images[A].2012 IEEE Conference on Computer Vision and Pattern Recognition[C].Providence,USA:IEEE,2012.1083-1090.