Classifiers combination to Arabic MorphoSyntactic Disambiguation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Parts of speech tagging forms the important preprocessing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine learning approaches that have been used in the part of speech tagging. Then we review all the Arabic works to compare and to confirm our need to develop an accurate and efficient Arabic MorphoSyntactic Disambiguation system. Finally we propose a classifiers combination experimental framework for Arabic part of speech tagger in which three diverse probabilistic classifiers (Hidden Markov, Maximum Entropy and Transformation Based Learning) are combined using many different combination strategies to exploit their advantages.

Original languageEnglish
Title of host publicationProceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009
Pages163-171
Number of pages9
Volume1
DOIs
Publication statusPublished - 2009
Event2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009 - Selangor
Duration: 5 Aug 20097 Aug 2009

Other

Other2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009
CitySelangor
Period5/8/097/8/09

Fingerprint

Classifiers
Information retrieval systems
Learning systems
Entropy
Processing

Keywords

  • Machine learning
  • MorphoSyntactic disambiguation
  • Natural language processing

ASJC Scopus subject areas

  • Information Systems
  • Software
  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering

Cite this

Albared, M., Omar, N., & Ab Aziz, M. J. (2009). Classifiers combination to Arabic MorphoSyntactic Disambiguation. In Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009 (Vol. 1, pp. 163-171). [5254797] https://doi.org/10.1109/ICEEI.2009.5254797

Classifiers combination to Arabic MorphoSyntactic Disambiguation. / Albared, Mohammed; Omar, Nazlia; Ab Aziz, Mohd Juzaiddin.

Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009. Vol. 1 2009. p. 163-171 5254797.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Albared, M, Omar, N & Ab Aziz, MJ 2009, Classifiers combination to Arabic MorphoSyntactic Disambiguation. in Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009. vol. 1, 5254797, pp. 163-171, 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009, Selangor, 5/8/09. https://doi.org/10.1109/ICEEI.2009.5254797
Albared M, Omar N, Ab Aziz MJ. Classifiers combination to Arabic MorphoSyntactic Disambiguation. In Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009. Vol. 1. 2009. p. 163-171. 5254797 https://doi.org/10.1109/ICEEI.2009.5254797
Albared, Mohammed ; Omar, Nazlia ; Ab Aziz, Mohd Juzaiddin. / Classifiers combination to Arabic MorphoSyntactic Disambiguation. Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009. Vol. 1 2009. pp. 163-171
@inproceedings{b05eb4e2e4c04fceba501cde89030e83,
title = "Classifiers combination to Arabic MorphoSyntactic Disambiguation",
abstract = "Parts of speech tagging forms the important preprocessing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine learning approaches that have been used in the part of speech tagging. Then we review all the Arabic works to compare and to confirm our need to develop an accurate and efficient Arabic MorphoSyntactic Disambiguation system. Finally we propose a classifiers combination experimental framework for Arabic part of speech tagger in which three diverse probabilistic classifiers (Hidden Markov, Maximum Entropy and Transformation Based Learning) are combined using many different combination strategies to exploit their advantages.",
keywords = "Machine learning, MorphoSyntactic disambiguation, Natural language processing",
author = "Mohammed Albared and Nazlia Omar and {Ab Aziz}, {Mohd Juzaiddin}",
year = "2009",
doi = "10.1109/ICEEI.2009.5254797",
language = "English",
isbn = "9781424449132",
volume = "1",
pages = "163--171",
booktitle = "Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009",

}

TY - GEN

T1 - Classifiers combination to Arabic MorphoSyntactic Disambiguation

AU - Albared, Mohammed

AU - Omar, Nazlia

AU - Ab Aziz, Mohd Juzaiddin

PY - 2009

Y1 - 2009

N2 - Parts of speech tagging forms the important preprocessing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine learning approaches that have been used in the part of speech tagging. Then we review all the Arabic works to compare and to confirm our need to develop an accurate and efficient Arabic MorphoSyntactic Disambiguation system. Finally we propose a classifiers combination experimental framework for Arabic part of speech tagger in which three diverse probabilistic classifiers (Hidden Markov, Maximum Entropy and Transformation Based Learning) are combined using many different combination strategies to exploit their advantages.

AB - Parts of speech tagging forms the important preprocessing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine learning approaches that have been used in the part of speech tagging. Then we review all the Arabic works to compare and to confirm our need to develop an accurate and efficient Arabic MorphoSyntactic Disambiguation system. Finally we propose a classifiers combination experimental framework for Arabic part of speech tagger in which three diverse probabilistic classifiers (Hidden Markov, Maximum Entropy and Transformation Based Learning) are combined using many different combination strategies to exploit their advantages.

KW - Machine learning

KW - MorphoSyntactic disambiguation

KW - Natural language processing

UR - http://www.scopus.com/inward/record.url?scp=70449686424&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70449686424&partnerID=8YFLogxK

U2 - 10.1109/ICEEI.2009.5254797

DO - 10.1109/ICEEI.2009.5254797

M3 - Conference contribution

SN - 9781424449132

VL - 1

SP - 163

EP - 171

BT - Proceedings of the 2009 International Conference on Electrical Engineering and Informatics, ICEEI 2009

ER -