A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words

Tarik Abu-Ain, Siti Norul Huda Sheikh Abdullah, Bilal Bataineh, Khairuddin Omar, Ashraf Abu-Ein

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Baseline detection is an important process in document image analysis and recognition systems. It is extensively used to many various preprocessing stages such as text normalization, skew correction, characters segmentation, slant and slop correction as well as in feature extraction. in this work, we proposed a new method for baseline detection based on horizontal projection histogram and directions features of subwords skeleton for Arabic script; which form the main component of the text that may consist of at least one letter, in addition of diacritic and dots. The efficiency of the proposed method is has been proven by the experiment's results on an IFN/ENIT Arabic benchmark dataset.

Original languageEnglish
Title of host publicationCommunications in Computer and Information Science
PublisherSpringer Verlag
Pages67-77
Number of pages11
Volume378 CCIS
ISBN (Print)9783642405662
DOIs
Publication statusPublished - 2013
Event2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013 - Shah Alam
Duration: 28 Aug 201329 Aug 2013

Publication series

NameCommunications in Computer and Information Science
Volume378 CCIS
ISSN (Print)18650929

Other

Other2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013
CityShah Alam
Period28/8/1329/8/13

Fingerprint

Image recognition
Image analysis
Feature extraction
Experiments

Keywords

  • Arabic handwriting
  • Baseline detection
  • Preprocessing
  • Sub-word extraction
  • Text normalization

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Abu-Ain, T., Sheikh Abdullah, S. N. H., Bataineh, B., Omar, K., & Abu-Ein, A. (2013). A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words. In Communications in Computer and Information Science (Vol. 378 CCIS, pp. 67-77). (Communications in Computer and Information Science; Vol. 378 CCIS). Springer Verlag. https://doi.org/10.1007/978-3-642-40567-9_6

A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words. / Abu-Ain, Tarik; Sheikh Abdullah, Siti Norul Huda; Bataineh, Bilal; Omar, Khairuddin; Abu-Ein, Ashraf.

Communications in Computer and Information Science. Vol. 378 CCIS Springer Verlag, 2013. p. 67-77 (Communications in Computer and Information Science; Vol. 378 CCIS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abu-Ain, T, Sheikh Abdullah, SNH, Bataineh, B, Omar, K & Abu-Ein, A 2013, A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words. in Communications in Computer and Information Science. vol. 378 CCIS, Communications in Computer and Information Science, vol. 378 CCIS, Springer Verlag, pp. 67-77, 2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013, Shah Alam, 28/8/13. https://doi.org/10.1007/978-3-642-40567-9_6
Abu-Ain T, Sheikh Abdullah SNH, Bataineh B, Omar K, Abu-Ein A. A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words. In Communications in Computer and Information Science. Vol. 378 CCIS. Springer Verlag. 2013. p. 67-77. (Communications in Computer and Information Science). https://doi.org/10.1007/978-3-642-40567-9_6
Abu-Ain, Tarik ; Sheikh Abdullah, Siti Norul Huda ; Bataineh, Bilal ; Omar, Khairuddin ; Abu-Ein, Ashraf. / A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words. Communications in Computer and Information Science. Vol. 378 CCIS Springer Verlag, 2013. pp. 67-77 (Communications in Computer and Information Science).
@inproceedings{56e0ef3caaae4ddeb38169ff0c8a3f9c,
title = "A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words",
abstract = "Baseline detection is an important process in document image analysis and recognition systems. It is extensively used to many various preprocessing stages such as text normalization, skew correction, characters segmentation, slant and slop correction as well as in feature extraction. in this work, we proposed a new method for baseline detection based on horizontal projection histogram and directions features of subwords skeleton for Arabic script; which form the main component of the text that may consist of at least one letter, in addition of diacritic and dots. The efficiency of the proposed method is has been proven by the experiment's results on an IFN/ENIT Arabic benchmark dataset.",
keywords = "Arabic handwriting, Baseline detection, Preprocessing, Sub-word extraction, Text normalization",
author = "Tarik Abu-Ain and {Sheikh Abdullah}, {Siti Norul Huda} and Bilal Bataineh and Khairuddin Omar and Ashraf Abu-Ein",
year = "2013",
doi = "10.1007/978-3-642-40567-9_6",
language = "English",
isbn = "9783642405662",
volume = "378 CCIS",
series = "Communications in Computer and Information Science",
publisher = "Springer Verlag",
pages = "67--77",
booktitle = "Communications in Computer and Information Science",

}

TY - GEN

T1 - A Novel Baseline Detection Method of Handwritten Arabic-Script Documents Based on Sub-Words

AU - Abu-Ain, Tarik

AU - Sheikh Abdullah, Siti Norul Huda

AU - Bataineh, Bilal

AU - Omar, Khairuddin

AU - Abu-Ein, Ashraf

PY - 2013

Y1 - 2013

N2 - Baseline detection is an important process in document image analysis and recognition systems. It is extensively used to many various preprocessing stages such as text normalization, skew correction, characters segmentation, slant and slop correction as well as in feature extraction. in this work, we proposed a new method for baseline detection based on horizontal projection histogram and directions features of subwords skeleton for Arabic script; which form the main component of the text that may consist of at least one letter, in addition of diacritic and dots. The efficiency of the proposed method is has been proven by the experiment's results on an IFN/ENIT Arabic benchmark dataset.

AB - Baseline detection is an important process in document image analysis and recognition systems. It is extensively used to many various preprocessing stages such as text normalization, skew correction, characters segmentation, slant and slop correction as well as in feature extraction. in this work, we proposed a new method for baseline detection based on horizontal projection histogram and directions features of subwords skeleton for Arabic script; which form the main component of the text that may consist of at least one letter, in addition of diacritic and dots. The efficiency of the proposed method is has been proven by the experiment's results on an IFN/ENIT Arabic benchmark dataset.

KW - Arabic handwriting

KW - Baseline detection

KW - Preprocessing

KW - Sub-word extraction

KW - Text normalization

UR - http://www.scopus.com/inward/record.url?scp=84901367844&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84901367844&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-40567-9_6

DO - 10.1007/978-3-642-40567-9_6

M3 - Conference contribution

AN - SCOPUS:84901367844

SN - 9783642405662

VL - 378 CCIS

T3 - Communications in Computer and Information Science

SP - 67

EP - 77

BT - Communications in Computer and Information Science

PB - Springer Verlag

ER -