Domain specific concept ontologies and text summarization as hierarchical fuzzy logic ranking indicator on Malay text corpus

Shaiful Bakhtiar Bin Rodzman, Normaly Kamal Ismail, Nurazzah Abd Rahman, Syed Ahmad Aljunid, Zulhilmi Mohamed Nor, Ahmad Yunus Mohd Noor

Research output: Contribution to journalArticle

Abstract

Ranking function is a predictive algorithm that is used to establish a simple ordering of documents according to its relevance. This step is critical because the results’ quality of a Domain Specific Information Retrieval (IR) such as Hadith Information Retrieval is fundamentally dependent of the ranking function. A Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function, based on the Malay Information retrieval’s BM25 Model. The model examines three-inputs (Ontology BM25 Score, Fabrication Rate of Hadith and Shia Rate of Hadith) and four-output values of Final Ranking Score which consist of three triangular membership functions. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 16 queries, while the BM25 original score and Vector Space Model only yield better result in 9 and 2 queries respectively on the P@10, %no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), %no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of other Malay Semantic elements and another corpus for positive ranking indicator.

Original languageEnglish
Pages (from-to)1527-1534
Number of pages8
JournalIndonesian Journal of Electrical Engineering and Computer Science
Volume15
Issue number3
DOIs
Publication statusPublished - Sep 2019

Fingerprint

Summarization
Ranking Function
Fuzzy Logic
Fuzzy logic
Ontology
Ranking
Information retrieval
Information Retrieval
Query
Vector Space Model
Vector spaces
Type Inference
Text Retrieval
Fuzzy Inference System
Fuzzy Logic Controller
Fuzzy inference
Membership functions
Membership Function
Percentage
Triangular

Keywords

  • BM25 model
  • Fabricated and shia hadith
  • Fuzzy logic
  • Malay text corpus
  • Malay translated hadith
  • Negative ranking indicator
  • Ontology information retrieval
  • Positive ranking indicator

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems
  • Hardware and Architecture
  • Computer Networks and Communications
  • Control and Optimization
  • Electrical and Electronic Engineering

Cite this

Domain specific concept ontologies and text summarization as hierarchical fuzzy logic ranking indicator on Malay text corpus. / Rodzman, Shaiful Bakhtiar Bin; Ismail, Normaly Kamal; Rahman, Nurazzah Abd; Aljunid, Syed Ahmad; Nor, Zulhilmi Mohamed; Noor, Ahmad Yunus Mohd.

In: Indonesian Journal of Electrical Engineering and Computer Science, Vol. 15, No. 3, 09.2019, p. 1527-1534.

Research output: Contribution to journalArticle

Rodzman, Shaiful Bakhtiar Bin ; Ismail, Normaly Kamal ; Rahman, Nurazzah Abd ; Aljunid, Syed Ahmad ; Nor, Zulhilmi Mohamed ; Noor, Ahmad Yunus Mohd. / Domain specific concept ontologies and text summarization as hierarchical fuzzy logic ranking indicator on Malay text corpus. In: Indonesian Journal of Electrical Engineering and Computer Science. 2019 ; Vol. 15, No. 3. pp. 1527-1534.
@article{5b990eae823b466395c5578ed8ff6c92,
title = "Domain specific concept ontologies and text summarization as hierarchical fuzzy logic ranking indicator on Malay text corpus",
abstract = "Ranking function is a predictive algorithm that is used to establish a simple ordering of documents according to its relevance. This step is critical because the results’ quality of a Domain Specific Information Retrieval (IR) such as Hadith Information Retrieval is fundamentally dependent of the ranking function. A Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function, based on the Malay Information retrieval’s BM25 Model. The model examines three-inputs (Ontology BM25 Score, Fabrication Rate of Hadith and Shia Rate of Hadith) and four-output values of Final Ranking Score which consist of three triangular membership functions. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 16 queries, while the BM25 original score and Vector Space Model only yield better result in 9 and 2 queries respectively on the P@10, {\%}no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), {\%}no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of other Malay Semantic elements and another corpus for positive ranking indicator.",
keywords = "BM25 model, Fabricated and shia hadith, Fuzzy logic, Malay text corpus, Malay translated hadith, Negative ranking indicator, Ontology information retrieval, Positive ranking indicator",
author = "Rodzman, {Shaiful Bakhtiar Bin} and Ismail, {Normaly Kamal} and Rahman, {Nurazzah Abd} and Aljunid, {Syed Ahmad} and Nor, {Zulhilmi Mohamed} and Noor, {Ahmad Yunus Mohd}",
year = "2019",
month = "9",
doi = "10.11591/ijeecs.v15.i3.pp1527-1534",
language = "English",
volume = "15",
pages = "1527--1534",
journal = "Indonesian Journal of Electrical Engineering and Computer Science",
issn = "2502-4752",
publisher = "Institute of Advanced Engineering and Science (IAES)",
number = "3",

}

TY - JOUR

T1 - Domain specific concept ontologies and text summarization as hierarchical fuzzy logic ranking indicator on Malay text corpus

AU - Rodzman, Shaiful Bakhtiar Bin

AU - Ismail, Normaly Kamal

AU - Rahman, Nurazzah Abd

AU - Aljunid, Syed Ahmad

AU - Nor, Zulhilmi Mohamed

AU - Noor, Ahmad Yunus Mohd

PY - 2019/9

Y1 - 2019/9

N2 - Ranking function is a predictive algorithm that is used to establish a simple ordering of documents according to its relevance. This step is critical because the results’ quality of a Domain Specific Information Retrieval (IR) such as Hadith Information Retrieval is fundamentally dependent of the ranking function. A Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function, based on the Malay Information retrieval’s BM25 Model. The model examines three-inputs (Ontology BM25 Score, Fabrication Rate of Hadith and Shia Rate of Hadith) and four-output values of Final Ranking Score which consist of three triangular membership functions. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 16 queries, while the BM25 original score and Vector Space Model only yield better result in 9 and 2 queries respectively on the P@10, %no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), %no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of other Malay Semantic elements and another corpus for positive ranking indicator.

AB - Ranking function is a predictive algorithm that is used to establish a simple ordering of documents according to its relevance. This step is critical because the results’ quality of a Domain Specific Information Retrieval (IR) such as Hadith Information Retrieval is fundamentally dependent of the ranking function. A Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function, based on the Malay Information retrieval’s BM25 Model. The model examines three-inputs (Ontology BM25 Score, Fabrication Rate of Hadith and Shia Rate of Hadith) and four-output values of Final Ranking Score which consist of three triangular membership functions. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 16 queries, while the BM25 original score and Vector Space Model only yield better result in 9 and 2 queries respectively on the P@10, %no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), %no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of other Malay Semantic elements and another corpus for positive ranking indicator.

KW - BM25 model

KW - Fabricated and shia hadith

KW - Fuzzy logic

KW - Malay text corpus

KW - Malay translated hadith

KW - Negative ranking indicator

KW - Ontology information retrieval

KW - Positive ranking indicator

UR - http://www.scopus.com/inward/record.url?scp=85073546860&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073546860&partnerID=8YFLogxK

U2 - 10.11591/ijeecs.v15.i3.pp1527-1534

DO - 10.11591/ijeecs.v15.i3.pp1527-1534

M3 - Article

AN - SCOPUS:85073546860

VL - 15

SP - 1527

EP - 1534

JO - Indonesian Journal of Electrical Engineering and Computer Science

JF - Indonesian Journal of Electrical Engineering and Computer Science

SN - 2502-4752

IS - 3

ER -