A hybrid approach to pronominal anaphora resolution in Arabic

Abdullatif Abolohom, Nazlia Omar

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution is considered as an important task for a number of natural language processing applications such as information extraction, question answering and text summarization. Most of the earlier works of anaphora resolution have been applied to English and other languages. However, the work done in Arabic is not sufficiently studied. In this study, a hybrid approach that combines different architectures for resolving pronominal anaphora in Arabic language is presented. The hybrid model adopted the strategy based on the combination of a rule-based and machine learning approach. The collection of anaphora and respective possible antecedents was identified in a rule-based manner with morphological information taken into account. In addition, the selection of the most probable candidate as the antecedent of the anaphor was done by machine learning based on a k-Nearest Neighbor (k-NN) approach. In this study, the appropriate features to be used in this task were determined and their effect on the performance of anaphora resolution was investigated. Experiments of the proposed method were performed using the corpus of the Quran annotated with pronominal anaphora. The experimental results indicate that the proposed hybrid approach is completely reasonable and feasible for Arabic pronominal anaphora resolution.

Original languageEnglish
Pages (from-to)764-771
Number of pages8
JournalJournal of Computer Science
Volume11
Issue number5
DOIs
Publication statusPublished - 2015

Fingerprint

Learning systems
Processing
Experiments

Keywords

  • Anaphora resolution
  • Machine learning approach
  • Natural language processing
  • Rule-based approach

ASJC Scopus subject areas

  • Software
  • Computer Networks and Communications
  • Artificial Intelligence

Cite this

A hybrid approach to pronominal anaphora resolution in Arabic. / Abolohom, Abdullatif; Omar, Nazlia.

In: Journal of Computer Science, Vol. 11, No. 5, 2015, p. 764-771.

Research output: Contribution to journalArticle

@article{f4a1b888a8eb4466b11a2ea6d5689f45,
title = "A hybrid approach to pronominal anaphora resolution in Arabic",
abstract = "One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution is considered as an important task for a number of natural language processing applications such as information extraction, question answering and text summarization. Most of the earlier works of anaphora resolution have been applied to English and other languages. However, the work done in Arabic is not sufficiently studied. In this study, a hybrid approach that combines different architectures for resolving pronominal anaphora in Arabic language is presented. The hybrid model adopted the strategy based on the combination of a rule-based and machine learning approach. The collection of anaphora and respective possible antecedents was identified in a rule-based manner with morphological information taken into account. In addition, the selection of the most probable candidate as the antecedent of the anaphor was done by machine learning based on a k-Nearest Neighbor (k-NN) approach. In this study, the appropriate features to be used in this task were determined and their effect on the performance of anaphora resolution was investigated. Experiments of the proposed method were performed using the corpus of the Quran annotated with pronominal anaphora. The experimental results indicate that the proposed hybrid approach is completely reasonable and feasible for Arabic pronominal anaphora resolution.",
keywords = "Anaphora resolution, Machine learning approach, Natural language processing, Rule-based approach",
author = "Abdullatif Abolohom and Nazlia Omar",
year = "2015",
doi = "10.3844/jcssp.2015.764.771",
language = "English",
volume = "11",
pages = "764--771",
journal = "Journal of Computer Science",
issn = "1549-3636",
publisher = "Science Publications",
number = "5",

}

TY - JOUR

T1 - A hybrid approach to pronominal anaphora resolution in Arabic

AU - Abolohom, Abdullatif

AU - Omar, Nazlia

PY - 2015

Y1 - 2015

N2 - One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution is considered as an important task for a number of natural language processing applications such as information extraction, question answering and text summarization. Most of the earlier works of anaphora resolution have been applied to English and other languages. However, the work done in Arabic is not sufficiently studied. In this study, a hybrid approach that combines different architectures for resolving pronominal anaphora in Arabic language is presented. The hybrid model adopted the strategy based on the combination of a rule-based and machine learning approach. The collection of anaphora and respective possible antecedents was identified in a rule-based manner with morphological information taken into account. In addition, the selection of the most probable candidate as the antecedent of the anaphor was done by machine learning based on a k-Nearest Neighbor (k-NN) approach. In this study, the appropriate features to be used in this task were determined and their effect on the performance of anaphora resolution was investigated. Experiments of the proposed method were performed using the corpus of the Quran annotated with pronominal anaphora. The experimental results indicate that the proposed hybrid approach is completely reasonable and feasible for Arabic pronominal anaphora resolution.

AB - One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution is considered as an important task for a number of natural language processing applications such as information extraction, question answering and text summarization. Most of the earlier works of anaphora resolution have been applied to English and other languages. However, the work done in Arabic is not sufficiently studied. In this study, a hybrid approach that combines different architectures for resolving pronominal anaphora in Arabic language is presented. The hybrid model adopted the strategy based on the combination of a rule-based and machine learning approach. The collection of anaphora and respective possible antecedents was identified in a rule-based manner with morphological information taken into account. In addition, the selection of the most probable candidate as the antecedent of the anaphor was done by machine learning based on a k-Nearest Neighbor (k-NN) approach. In this study, the appropriate features to be used in this task were determined and their effect on the performance of anaphora resolution was investigated. Experiments of the proposed method were performed using the corpus of the Quran annotated with pronominal anaphora. The experimental results indicate that the proposed hybrid approach is completely reasonable and feasible for Arabic pronominal anaphora resolution.

KW - Anaphora resolution

KW - Machine learning approach

KW - Natural language processing

KW - Rule-based approach

UR - http://www.scopus.com/inward/record.url?scp=84949551391&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84949551391&partnerID=8YFLogxK

U2 - 10.3844/jcssp.2015.764.771

DO - 10.3844/jcssp.2015.764.771

M3 - Article

AN - SCOPUS:84949551391

VL - 11

SP - 764

EP - 771

JO - Journal of Computer Science

JF - Journal of Computer Science

SN - 1549-3636

IS - 5

ER -