Term extraction and hierarchy induction method based on islamic dictionary

Ammar Abdulateef Ali, Saidah Saad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A machine readable dictionary (MRD) is an electronic dictionary that enables query processing. One of the common processing tasks that has been widely applied is Concept Hierarchy Induction which aims at identifying concepts with its corresponding taxonomies. The existing concept hierarchy approaches for Islamic domain are using limited linguistic patterns. This study aims to propose an unsupervised concept hierarchy induction for the Islamic domain by extending the patterns and rules. In fact, Term Frequency-Inverse Document Frequency (TF-IDF) was carried out in order to identify the most frequently used concepts. Furthermore, two syntactical features were used including POS tagging and chunk parser in order to identify the tagging for each word (e.g. verb, noun, adjective, etc.) and extracting Noun Phrases (NP). Hence, the proposed extension patterns aim at utilize lexico-syntactic patterns to induce the concept hierarchy. That demonstrates the usefulness of extending patterns for the Islamic domain.

Original languageEnglish
Title of host publication2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages113-117
Number of pages5
ISBN (Electronic)9781509029549
DOIs
Publication statusPublished - 4 Jan 2017
Event3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Malacca, Malaysia
Duration: 23 Aug 201624 Aug 2016

Other

Other3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016
CountryMalaysia
CityMalacca
Period23/8/1624/8/16

Fingerprint

Glossaries
Query processing
Taxonomies
Syntactics
Linguistics
Processing
Induction

Keywords

  • concept hierarchy
  • lexico-syntactic patterns
  • terminology extraction

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management

Cite this

Ali, A. A., & Saad, S. (2017). Term extraction and hierarchy induction method based on islamic dictionary. In 2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings (pp. 113-117). [7806345] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/INFRKM.2016.7806345

Term extraction and hierarchy induction method based on islamic dictionary. / Ali, Ammar Abdulateef; Saad, Saidah.

2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings. Institute of Electrical and Electronics Engineers Inc., 2017. p. 113-117 7806345.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ali, AA & Saad, S 2017, Term extraction and hierarchy induction method based on islamic dictionary. in 2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings., 7806345, Institute of Electrical and Electronics Engineers Inc., pp. 113-117, 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016, Malacca, Malaysia, 23/8/16. https://doi.org/10.1109/INFRKM.2016.7806345
Ali AA, Saad S. Term extraction and hierarchy induction method based on islamic dictionary. In 2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings. Institute of Electrical and Electronics Engineers Inc. 2017. p. 113-117. 7806345 https://doi.org/10.1109/INFRKM.2016.7806345
Ali, Ammar Abdulateef ; Saad, Saidah. / Term extraction and hierarchy induction method based on islamic dictionary. 2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 113-117
@inproceedings{ec1545ab955d4d689ab1c7f2f2035563,
title = "Term extraction and hierarchy induction method based on islamic dictionary",
abstract = "A machine readable dictionary (MRD) is an electronic dictionary that enables query processing. One of the common processing tasks that has been widely applied is Concept Hierarchy Induction which aims at identifying concepts with its corresponding taxonomies. The existing concept hierarchy approaches for Islamic domain are using limited linguistic patterns. This study aims to propose an unsupervised concept hierarchy induction for the Islamic domain by extending the patterns and rules. In fact, Term Frequency-Inverse Document Frequency (TF-IDF) was carried out in order to identify the most frequently used concepts. Furthermore, two syntactical features were used including POS tagging and chunk parser in order to identify the tagging for each word (e.g. verb, noun, adjective, etc.) and extracting Noun Phrases (NP). Hence, the proposed extension patterns aim at utilize lexico-syntactic patterns to induce the concept hierarchy. That demonstrates the usefulness of extending patterns for the Islamic domain.",
keywords = "concept hierarchy, lexico-syntactic patterns, terminology extraction",
author = "Ali, {Ammar Abdulateef} and Saidah Saad",
year = "2017",
month = "1",
day = "4",
doi = "10.1109/INFRKM.2016.7806345",
language = "English",
pages = "113--117",
booktitle = "2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Term extraction and hierarchy induction method based on islamic dictionary

AU - Ali, Ammar Abdulateef

AU - Saad, Saidah

PY - 2017/1/4

Y1 - 2017/1/4

N2 - A machine readable dictionary (MRD) is an electronic dictionary that enables query processing. One of the common processing tasks that has been widely applied is Concept Hierarchy Induction which aims at identifying concepts with its corresponding taxonomies. The existing concept hierarchy approaches for Islamic domain are using limited linguistic patterns. This study aims to propose an unsupervised concept hierarchy induction for the Islamic domain by extending the patterns and rules. In fact, Term Frequency-Inverse Document Frequency (TF-IDF) was carried out in order to identify the most frequently used concepts. Furthermore, two syntactical features were used including POS tagging and chunk parser in order to identify the tagging for each word (e.g. verb, noun, adjective, etc.) and extracting Noun Phrases (NP). Hence, the proposed extension patterns aim at utilize lexico-syntactic patterns to induce the concept hierarchy. That demonstrates the usefulness of extending patterns for the Islamic domain.

AB - A machine readable dictionary (MRD) is an electronic dictionary that enables query processing. One of the common processing tasks that has been widely applied is Concept Hierarchy Induction which aims at identifying concepts with its corresponding taxonomies. The existing concept hierarchy approaches for Islamic domain are using limited linguistic patterns. This study aims to propose an unsupervised concept hierarchy induction for the Islamic domain by extending the patterns and rules. In fact, Term Frequency-Inverse Document Frequency (TF-IDF) was carried out in order to identify the most frequently used concepts. Furthermore, two syntactical features were used including POS tagging and chunk parser in order to identify the tagging for each word (e.g. verb, noun, adjective, etc.) and extracting Noun Phrases (NP). Hence, the proposed extension patterns aim at utilize lexico-syntactic patterns to induce the concept hierarchy. That demonstrates the usefulness of extending patterns for the Islamic domain.

KW - concept hierarchy

KW - lexico-syntactic patterns

KW - terminology extraction

UR - http://www.scopus.com/inward/record.url?scp=85015959387&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85015959387&partnerID=8YFLogxK

U2 - 10.1109/INFRKM.2016.7806345

DO - 10.1109/INFRKM.2016.7806345

M3 - Conference contribution

SP - 113

EP - 117

BT - 2016 3rd International Conference on Information Retrieval and Knowledge Management, CAMP 2016 - Conference Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -