Character and object recognition based on global feature extraction

Maryam Naeimizaghiani, Siti Norul Huda Sheikh Abdullah, Farshid Pirahansiah, Bilal Bataineh

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Optical character recognition (OCR) is one of the most important fields in pattern recognition world which is able to recognize handwritten characters, irregular characters and machine printed characters. Optical character recognition system consists of five major tasks which are involved pre-processing, segmentation, feature extraction, classification and recognition. Generally, less discriminative features in global feature approach leads to reduce in recognition rate. By proposing a global approach that produces more discriminative features and less dimensionality of data, these problems are overcome. Two feature extraction methods are studied namely Gray Level Co-occurrence Matrix (GLCM) and edge direction matrix (EDMS) and combination of two popular feature extraction methods is proposed. The most important problem of EDMS is the number of produced features with this method which is just 18 features and is not enough for feature extraction purpose and it causes reducing the recognition rate. The aim of this research is improving the recognition rate of EDMS by combining with a global feature extraction method in order to increase the number of extracted feature and produce better recognition rate. The proposed method is a combination of GLCM and EDMS method with and without feature selection method called gain ratio and ranker search which applied to reduce the dimensionality of data. They have been tested onto four different datasets involving manually and automatically cropped licence plate and font style images amounting 3520 images from 0 to 9 and capital letter A to Z with various size and shape. Another dataset is large binary images of shapes which involve 300 images of objects. Then, gain ratio and ranker search are used to select discriminative features whereby the features reduced from 58 to 34 numbers of features. The proposed combinatory method, EDMS and GLCM methods are classified using neural network, Bayes network and decision tree. The experimental results for character recognition indicate that the proposed combinatorial method obtain better average accuracy rate about 85.99% whereas EDMS, GLCM and combination without feature selection achieved 80.19%, 38.84%, and 58.78% subsequently and the experimental results for object recognition indicates that proposed method before and after feature selection outperformed other methods such as EDMS and GLCM with 90.83% and 92.5% accuracy rate respectively with NN as the classifier. Consequently global and spatial approaches are compared in recognition of objects and characters. The experimental results show the better performance of proposed method as a global feature extraction method for object recognition purpose with 92.5% accuracy rate with NN while Robinson filter as a spatial feature extraction method outperformed the global feature extraction methods for character recognition purpose with 100% accuracy rate with NN. Also global approach obtains smaller processing time comparing to spatial approach.

Original languageEnglish
Pages (from-to)160-172
Number of pages13
JournalJournal of Theoretical and Applied Information Technology
Volume52
Issue number2
Publication statusPublished - 2013

Fingerprint

Character recognition
Character Recognition
Object recognition
Object Recognition
Feature Extraction
Feature extraction
Gray Level Co-occurrence Matrix
Optical character recognition
Feature Selection
Dimensionality
Experimental Results
Binary images
Decision trees
Processing
Pattern recognition
Binary Image
Classifiers
Matrix Method
Bayes
Decision tree

Keywords

  • Character recognition
  • Feature extraction
  • Global feature extraction
  • Image processing
  • Ocr

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Character and object recognition based on global feature extraction. / Naeimizaghiani, Maryam; Sheikh Abdullah, Siti Norul Huda; Pirahansiah, Farshid; Bataineh, Bilal.

In: Journal of Theoretical and Applied Information Technology, Vol. 52, No. 2, 2013, p. 160-172.

Research output: Contribution to journalArticle

Naeimizaghiani, Maryam ; Sheikh Abdullah, Siti Norul Huda ; Pirahansiah, Farshid ; Bataineh, Bilal. / Character and object recognition based on global feature extraction. In: Journal of Theoretical and Applied Information Technology. 2013 ; Vol. 52, No. 2. pp. 160-172.
@article{dac3316f0a5744c4b9de290e44e5a5c8,
title = "Character and object recognition based on global feature extraction",
abstract = "Optical character recognition (OCR) is one of the most important fields in pattern recognition world which is able to recognize handwritten characters, irregular characters and machine printed characters. Optical character recognition system consists of five major tasks which are involved pre-processing, segmentation, feature extraction, classification and recognition. Generally, less discriminative features in global feature approach leads to reduce in recognition rate. By proposing a global approach that produces more discriminative features and less dimensionality of data, these problems are overcome. Two feature extraction methods are studied namely Gray Level Co-occurrence Matrix (GLCM) and edge direction matrix (EDMS) and combination of two popular feature extraction methods is proposed. The most important problem of EDMS is the number of produced features with this method which is just 18 features and is not enough for feature extraction purpose and it causes reducing the recognition rate. The aim of this research is improving the recognition rate of EDMS by combining with a global feature extraction method in order to increase the number of extracted feature and produce better recognition rate. The proposed method is a combination of GLCM and EDMS method with and without feature selection method called gain ratio and ranker search which applied to reduce the dimensionality of data. They have been tested onto four different datasets involving manually and automatically cropped licence plate and font style images amounting 3520 images from 0 to 9 and capital letter A to Z with various size and shape. Another dataset is large binary images of shapes which involve 300 images of objects. Then, gain ratio and ranker search are used to select discriminative features whereby the features reduced from 58 to 34 numbers of features. The proposed combinatory method, EDMS and GLCM methods are classified using neural network, Bayes network and decision tree. The experimental results for character recognition indicate that the proposed combinatorial method obtain better average accuracy rate about 85.99{\%} whereas EDMS, GLCM and combination without feature selection achieved 80.19{\%}, 38.84{\%}, and 58.78{\%} subsequently and the experimental results for object recognition indicates that proposed method before and after feature selection outperformed other methods such as EDMS and GLCM with 90.83{\%} and 92.5{\%} accuracy rate respectively with NN as the classifier. Consequently global and spatial approaches are compared in recognition of objects and characters. The experimental results show the better performance of proposed method as a global feature extraction method for object recognition purpose with 92.5{\%} accuracy rate with NN while Robinson filter as a spatial feature extraction method outperformed the global feature extraction methods for character recognition purpose with 100{\%} accuracy rate with NN. Also global approach obtains smaller processing time comparing to spatial approach.",
keywords = "Character recognition, Feature extraction, Global feature extraction, Image processing, Ocr",
author = "Maryam Naeimizaghiani and {Sheikh Abdullah}, {Siti Norul Huda} and Farshid Pirahansiah and Bilal Bataineh",
year = "2013",
language = "English",
volume = "52",
pages = "160--172",
journal = "Journal of Theoretical and Applied Information Technology",
issn = "1992-8645",
publisher = "Asian Research Publishing Network (ARPN)",
number = "2",

}

TY - JOUR

T1 - Character and object recognition based on global feature extraction

AU - Naeimizaghiani, Maryam

AU - Sheikh Abdullah, Siti Norul Huda

AU - Pirahansiah, Farshid

AU - Bataineh, Bilal

PY - 2013

Y1 - 2013

N2 - Optical character recognition (OCR) is one of the most important fields in pattern recognition world which is able to recognize handwritten characters, irregular characters and machine printed characters. Optical character recognition system consists of five major tasks which are involved pre-processing, segmentation, feature extraction, classification and recognition. Generally, less discriminative features in global feature approach leads to reduce in recognition rate. By proposing a global approach that produces more discriminative features and less dimensionality of data, these problems are overcome. Two feature extraction methods are studied namely Gray Level Co-occurrence Matrix (GLCM) and edge direction matrix (EDMS) and combination of two popular feature extraction methods is proposed. The most important problem of EDMS is the number of produced features with this method which is just 18 features and is not enough for feature extraction purpose and it causes reducing the recognition rate. The aim of this research is improving the recognition rate of EDMS by combining with a global feature extraction method in order to increase the number of extracted feature and produce better recognition rate. The proposed method is a combination of GLCM and EDMS method with and without feature selection method called gain ratio and ranker search which applied to reduce the dimensionality of data. They have been tested onto four different datasets involving manually and automatically cropped licence plate and font style images amounting 3520 images from 0 to 9 and capital letter A to Z with various size and shape. Another dataset is large binary images of shapes which involve 300 images of objects. Then, gain ratio and ranker search are used to select discriminative features whereby the features reduced from 58 to 34 numbers of features. The proposed combinatory method, EDMS and GLCM methods are classified using neural network, Bayes network and decision tree. The experimental results for character recognition indicate that the proposed combinatorial method obtain better average accuracy rate about 85.99% whereas EDMS, GLCM and combination without feature selection achieved 80.19%, 38.84%, and 58.78% subsequently and the experimental results for object recognition indicates that proposed method before and after feature selection outperformed other methods such as EDMS and GLCM with 90.83% and 92.5% accuracy rate respectively with NN as the classifier. Consequently global and spatial approaches are compared in recognition of objects and characters. The experimental results show the better performance of proposed method as a global feature extraction method for object recognition purpose with 92.5% accuracy rate with NN while Robinson filter as a spatial feature extraction method outperformed the global feature extraction methods for character recognition purpose with 100% accuracy rate with NN. Also global approach obtains smaller processing time comparing to spatial approach.

AB - Optical character recognition (OCR) is one of the most important fields in pattern recognition world which is able to recognize handwritten characters, irregular characters and machine printed characters. Optical character recognition system consists of five major tasks which are involved pre-processing, segmentation, feature extraction, classification and recognition. Generally, less discriminative features in global feature approach leads to reduce in recognition rate. By proposing a global approach that produces more discriminative features and less dimensionality of data, these problems are overcome. Two feature extraction methods are studied namely Gray Level Co-occurrence Matrix (GLCM) and edge direction matrix (EDMS) and combination of two popular feature extraction methods is proposed. The most important problem of EDMS is the number of produced features with this method which is just 18 features and is not enough for feature extraction purpose and it causes reducing the recognition rate. The aim of this research is improving the recognition rate of EDMS by combining with a global feature extraction method in order to increase the number of extracted feature and produce better recognition rate. The proposed method is a combination of GLCM and EDMS method with and without feature selection method called gain ratio and ranker search which applied to reduce the dimensionality of data. They have been tested onto four different datasets involving manually and automatically cropped licence plate and font style images amounting 3520 images from 0 to 9 and capital letter A to Z with various size and shape. Another dataset is large binary images of shapes which involve 300 images of objects. Then, gain ratio and ranker search are used to select discriminative features whereby the features reduced from 58 to 34 numbers of features. The proposed combinatory method, EDMS and GLCM methods are classified using neural network, Bayes network and decision tree. The experimental results for character recognition indicate that the proposed combinatorial method obtain better average accuracy rate about 85.99% whereas EDMS, GLCM and combination without feature selection achieved 80.19%, 38.84%, and 58.78% subsequently and the experimental results for object recognition indicates that proposed method before and after feature selection outperformed other methods such as EDMS and GLCM with 90.83% and 92.5% accuracy rate respectively with NN as the classifier. Consequently global and spatial approaches are compared in recognition of objects and characters. The experimental results show the better performance of proposed method as a global feature extraction method for object recognition purpose with 92.5% accuracy rate with NN while Robinson filter as a spatial feature extraction method outperformed the global feature extraction methods for character recognition purpose with 100% accuracy rate with NN. Also global approach obtains smaller processing time comparing to spatial approach.

KW - Character recognition

KW - Feature extraction

KW - Global feature extraction

KW - Image processing

KW - Ocr

UR - http://www.scopus.com/inward/record.url?scp=84879356230&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84879356230&partnerID=8YFLogxK

M3 - Article

VL - 52

SP - 160

EP - 172

JO - Journal of Theoretical and Applied Information Technology

JF - Journal of Theoretical and Applied Information Technology

SN - 1992-8645

IS - 2

ER -