Arabic text detection using ensemble machine learning

Abstract

The automatic detection and recognition of zone text in natural images remain indispensable due to the omnipresent of text information in daily human life. This domain contoured a development of many applications specially with English language where many systems were implemented and proved their efficiency. Arabic language represents a real challenge for its cursive nature and rich vocabulary. The first step of our work was inspired from Gomez and Karatzas [7] on multiscript detection using Gestalt theory. For the second step, we implemented three classifiers namely Neural Network (NN) Support Vector machine (SVM) and Adaboost. These classifiers were deployed to classify the group regions in images as text or non-text. To improve the system performance an ensemble method based on majority voting was applied where the outputs of the three classifiers were fused. Experiments were conducted using own image database and ground-truth and the empirical results illustrate that the proposed method is efficient.

Keywords

Arabic text detection gestalt theory neural network adaboost support vector machine

Get full access to this article

View all access options for this article.

References

Yin

X.-C.

Yin

Huang

and Hao

H.-W.

, Robust text detection in natural scene images. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, pp. 970–983.

Neumann

and Matas

, Real-time scene text localization and recognition. Real-time scene text localization and recognition, In CVPR 2012, 2012, pp. 3538–3545.

Jaderberg

Simonyan

Vedaldi

and Zisserman

, Synthetic data and artificial neural networks for natural scene text recognition. Proc. NIPS Workshops, 2014.

Halima

Karray

and Alimi

, Arabic Text Recognition in Video Sequences. In Proceeding of International Conference on Informatics, Cybernetics and Computer Applications, Bangalore, 2010, pp. 603–608.

Moradi

and Mozaffari

, Hybrid Approach for Farsi/Arabic Text Detection and Localisation in Video Frames. Processing, 7(2), 2013.

Epshtein

Ofek

and Wexler

, Detecting text in natural scenes with stroke width transform, In CVPR, 2010.

Gomez

and Karatzas

, Multi-script text extraction from Natural scenes, In ICDAR, 2013.

Matas

Chum

Urban

and Pajdla

, Robust wide baseline stereo from maximally stable extremal regions, In Proc. BMVC., 2002, pp. 384–393.

Al-Muhtaseb

Mahmoud

and Qahwaji

, A novel minimal Arabic script for preparing databases and benchmarks for Arabic text recognition research. Paper presented at the 8th WSEAS International Conference on Signal Processing (SIP), May 30– June 1, 2009.

10.

Slimane

Ingold

Alimi

M.A.

and Hennebert

, Duration Models for Arabic Text Recognition using Hidden Markov Models, CIMCA 2008, Vienne, Austria, 2008.

11.

Slimane

Ingold

Kanoun

Alimi

M.A.

and Hennebert

, Database and Evaluation Protocols for Arabic Printed Text Recognition, Internal Research Report, DIUF, University of Fribourg, Switzerland, 2009.

12.

Wang

, Some fundamental issues in ensemble methods, In World Congress on Computational Intelligence, Hong Kong, IEEE Los Alamitos, 2008, pp. 2244–2251.

13.

Nikulin

McLachlan

and Ng

, Ensemble Approach for Classification of Imbalanced Data, Proceedings of the 22nd Australian Joint Conference on Advances in Artificial Intelligence, Springer-Verlag, 2009.

14.

Gasparini

Corchs

and Schettini

, Recall or precision-oriented strategies for binary classification of skin pixels, Journal of Electronic Imaging 17(2) (2008), 023017.

15.

Märgner

El Abed

and Pechwitz

, Offline Handwritten Arabic Word Recognition Using HMM – a Character Based Approach without Explicit Segmentation, In the 9th Colloque International Francophone sur l’Ecrit et le Document, CIFED 2006, Sep 18–21 2006.

16.

Jain

Mathew

and Jawahar

C.V.

, Unconstrained Scene Text and Video Text Recognition for Arabic Script, ASAR, 2017.

17.

Yousfi

Berrani

S.-A.

and Garcia

, Arabic text detection in videos using neural and boosting-based approaches: application to video indexing, In International Conference on Image Processing, Paris, France, 2014, pp. 3028–3032.

18.

Zhang

, Neural networks for classification: a survey, IEEE Transactions on Systems, Man, and Cybernetics, Part C 30(4) (2000), 451–462.

19.

Muller

K.-R.

Mika

Ratsch

Tsuda

and Scholkopf

, An introduction to kernel-based learning algorithms, IEEE Trans. On Neural Networks 12 (2001), 181–201.

20.

Desolneux

Moisan

and Morel

J.-M.

, A grouping principle and four applications, IEEE Trans. PAMI, 2003.