The architecture of information extraction for ontology population in contractor selection

Rosmayati Mohemad, Abdul Razak Hamdan, Zulaiha Ali Othamn, Noor Maizura Mohamad Noor

Research output: Contribution to journalArticle

Abstract

The enormous amount of unstructured data presents the biggest challenge to decision makers in eliciting meaningful information to support business decision-making. This study explores the potential use of ontologies in extracting and populating the information from various combinations of unstructured and semi-structured data formats such as tabular, form-based and natural language-based text. The main objective of this study is to propose an architecture of information extraction for ontology population. Contractor selection is chosen as the domain of interest. Thus, this research focuses on the extraction of contractor profiles from tender documents in order to enrich ontological contractor profile by populating the relevant extracted information. The findings are significantly good in precision and recall, in which the performance measures have reached an accuracy of 100% precision and recall for extracting information in both tabular and form-based formats. However, the precision score of relevant information extracted in natural language text is average with a percentage of 42.86% due to the limitation of the linguistic approach for processing Malay texts.

Original languageEnglish
Pages (from-to)57-67
Number of pages11
JournalJurnal Teknologi
Volume78
Issue number9-3
DOIs
Publication statusPublished - 2016

Fingerprint

Contractors
Ontology
Text processing
Linguistics
Decision making
Industry

Keywords

  • Contractor selection
  • Decision-making
  • Information extraction
  • Ontology population

ASJC Scopus subject areas

  • Engineering(all)

Cite this

The architecture of information extraction for ontology population in contractor selection. / Mohemad, Rosmayati; Hamdan, Abdul Razak; Othamn, Zulaiha Ali; Mohamad Noor, Noor Maizura.

In: Jurnal Teknologi, Vol. 78, No. 9-3, 2016, p. 57-67.

Research output: Contribution to journalArticle

Mohemad, Rosmayati ; Hamdan, Abdul Razak ; Othamn, Zulaiha Ali ; Mohamad Noor, Noor Maizura. / The architecture of information extraction for ontology population in contractor selection. In: Jurnal Teknologi. 2016 ; Vol. 78, No. 9-3. pp. 57-67.
@article{519ed426481c4d32a13a873e72b0aeff,
title = "The architecture of information extraction for ontology population in contractor selection",
abstract = "The enormous amount of unstructured data presents the biggest challenge to decision makers in eliciting meaningful information to support business decision-making. This study explores the potential use of ontologies in extracting and populating the information from various combinations of unstructured and semi-structured data formats such as tabular, form-based and natural language-based text. The main objective of this study is to propose an architecture of information extraction for ontology population. Contractor selection is chosen as the domain of interest. Thus, this research focuses on the extraction of contractor profiles from tender documents in order to enrich ontological contractor profile by populating the relevant extracted information. The findings are significantly good in precision and recall, in which the performance measures have reached an accuracy of 100{\%} precision and recall for extracting information in both tabular and form-based formats. However, the precision score of relevant information extracted in natural language text is average with a percentage of 42.86{\%} due to the limitation of the linguistic approach for processing Malay texts.",
keywords = "Contractor selection, Decision-making, Information extraction, Ontology population",
author = "Rosmayati Mohemad and Hamdan, {Abdul Razak} and Othamn, {Zulaiha Ali} and {Mohamad Noor}, {Noor Maizura}",
year = "2016",
doi = "10.11113/jt.v78.9719",
language = "English",
volume = "78",
pages = "57--67",
journal = "Jurnal Teknologi",
issn = "0127-9696",
publisher = "Penerbit Universiti Teknologi Malaysia",
number = "9-3",

}

TY - JOUR

T1 - The architecture of information extraction for ontology population in contractor selection

AU - Mohemad, Rosmayati

AU - Hamdan, Abdul Razak

AU - Othamn, Zulaiha Ali

AU - Mohamad Noor, Noor Maizura

PY - 2016

Y1 - 2016

N2 - The enormous amount of unstructured data presents the biggest challenge to decision makers in eliciting meaningful information to support business decision-making. This study explores the potential use of ontologies in extracting and populating the information from various combinations of unstructured and semi-structured data formats such as tabular, form-based and natural language-based text. The main objective of this study is to propose an architecture of information extraction for ontology population. Contractor selection is chosen as the domain of interest. Thus, this research focuses on the extraction of contractor profiles from tender documents in order to enrich ontological contractor profile by populating the relevant extracted information. The findings are significantly good in precision and recall, in which the performance measures have reached an accuracy of 100% precision and recall for extracting information in both tabular and form-based formats. However, the precision score of relevant information extracted in natural language text is average with a percentage of 42.86% due to the limitation of the linguistic approach for processing Malay texts.

AB - The enormous amount of unstructured data presents the biggest challenge to decision makers in eliciting meaningful information to support business decision-making. This study explores the potential use of ontologies in extracting and populating the information from various combinations of unstructured and semi-structured data formats such as tabular, form-based and natural language-based text. The main objective of this study is to propose an architecture of information extraction for ontology population. Contractor selection is chosen as the domain of interest. Thus, this research focuses on the extraction of contractor profiles from tender documents in order to enrich ontological contractor profile by populating the relevant extracted information. The findings are significantly good in precision and recall, in which the performance measures have reached an accuracy of 100% precision and recall for extracting information in both tabular and form-based formats. However, the precision score of relevant information extracted in natural language text is average with a percentage of 42.86% due to the limitation of the linguistic approach for processing Malay texts.

KW - Contractor selection

KW - Decision-making

KW - Information extraction

KW - Ontology population

UR - http://www.scopus.com/inward/record.url?scp=84991720084&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84991720084&partnerID=8YFLogxK

U2 - 10.11113/jt.v78.9719

DO - 10.11113/jt.v78.9719

M3 - Article

VL - 78

SP - 57

EP - 67

JO - Jurnal Teknologi

JF - Jurnal Teknologi

SN - 0127-9696

IS - 9-3

ER -