TY - JOUR
T1 - Offline handwritten Arabic cursive text recognition using hidden markov models and re-ranking
AU - Alkhateeb, Jawad H.
AU - Ren, Jinchang
AU - Jiang, Jianmin
AU - Al-Muhtaseb, Husni
PY - 2011/6/1
Y1 - 2011/6/1
N2 - Recognition of handwritten Arabic cursive texts is a complex task due to the similarities between letters under different writing styles. In this paper, a word-based off-line recognition system is proposed, using Hidden Markov Models (HMMs). The method employed involves three stages, namely preprocessing, feature extraction and classification. First, words from input scripts are segmented and normalized. Then, a set of intensity features are extracted from each of the segmented words, which is based on a sliding window moving across each mirrored word image. Meanwhile, structure-like features are also extracted including number of subwords and diacritical marks. Finally, these features are applied in a combined scheme for classification. Intensity features are used to train a HMM classifier, whose results are re-ranked using structure-like features for improved recognition rate. In order to validate the proposed techniques, extensive experiments were carried out using the IFN/ENIT database which contains 32,492 handwritten Arabic words. The proposed algorithm yields superior results of improved accuracy in comparison with several typical methods.
AB - Recognition of handwritten Arabic cursive texts is a complex task due to the similarities between letters under different writing styles. In this paper, a word-based off-line recognition system is proposed, using Hidden Markov Models (HMMs). The method employed involves three stages, namely preprocessing, feature extraction and classification. First, words from input scripts are segmented and normalized. Then, a set of intensity features are extracted from each of the segmented words, which is based on a sliding window moving across each mirrored word image. Meanwhile, structure-like features are also extracted including number of subwords and diacritical marks. Finally, these features are applied in a combined scheme for classification. Intensity features are used to train a HMM classifier, whose results are re-ranked using structure-like features for improved recognition rate. In order to validate the proposed techniques, extensive experiments were carried out using the IFN/ENIT database which contains 32,492 handwritten Arabic words. The proposed algorithm yields superior results of improved accuracy in comparison with several typical methods.
KW - offline
KW - handwritten
KW - Arabic
KW - cursive text
KW - recognition
KW - hidden Markov model
KW - re-ranking
UR - http://www.scopus.com/inward/record.url?scp=79953042413&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2011.02.006
DO - 10.1016/j.patrec.2011.02.006
M3 - Article
AN - SCOPUS:79953042413
SN - 0167-8655
VL - 32
SP - 1081
EP - 1088
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
IS - 8
ER -