Skolem preprocessing using WordNet and lexicon in building effective knowledge representation

Kasturi Dewi Varathan, Tengku Mohd Tengku Sembok, Abdul Kadir Rabiah, Nazlia Omar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We are in the information intensive environment in which various forms of digital contents have been growing exponentially. In this era of digital data, knowledge representation has been considered as a crucial component of any information retrieval system. It is also considered as a major problem especially in representing the content of unstructured text in an effective way. Although the mission remains impossible to achieve 100% accuracy, many researchers are indulging themselves in documenting these data in many different techniques so that it can be communicated effectively and easily. Indexing is an important element that determines the success of retrieval. Since we are dealing with multiple documents, preprocessing of data is needed before the data gets indexed. Thus, this paper presents an approach on the preprocessing technique. The semantic data which have been represented in skolem clauses will be preprocessed with the help of automatic lexicon generator output and WordNet. This preprocessing plays an important role in getting rid of redundant data before it gets indexed into the semantic matrix. Besides redundancy, it also helps in dealing with common problem that exists in indexing multiple documents in which similar sentences with more or less the same meaning but have been constructed by using different sets of words. As a conclusion, the integration of WordNet and lexicon leads to better result in terms of building effective knowledge representation.

Original languageEnglish
Title of host publicationProceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011
DOIs
Publication statusPublished - 2011
Event2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011 - Bandung
Duration: 17 Jul 201119 Jul 2011

Other

Other2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011
CityBandung
Period17/7/1119/7/11

Fingerprint

Knowledge representation
Semantics
Information retrieval systems
Redundancy

Keywords

  • indexing
  • lexicon
  • preprocessing
  • skolem

ASJC Scopus subject areas

  • Information Systems
  • Electrical and Electronic Engineering

Cite this

Varathan, K. D., Sembok, T. M. T., Rabiah, A. K., & Omar, N. (2011). Skolem preprocessing using WordNet and lexicon in building effective knowledge representation. In Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011 [6021760] https://doi.org/10.1109/ICEEI.2011.6021760

Skolem preprocessing using WordNet and lexicon in building effective knowledge representation. / Varathan, Kasturi Dewi; Sembok, Tengku Mohd Tengku; Rabiah, Abdul Kadir; Omar, Nazlia.

Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011. 2011. 6021760.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Varathan, KD, Sembok, TMT, Rabiah, AK & Omar, N 2011, Skolem preprocessing using WordNet and lexicon in building effective knowledge representation. in Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011., 6021760, 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011, Bandung, 17/7/11. https://doi.org/10.1109/ICEEI.2011.6021760
Varathan KD, Sembok TMT, Rabiah AK, Omar N. Skolem preprocessing using WordNet and lexicon in building effective knowledge representation. In Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011. 2011. 6021760 https://doi.org/10.1109/ICEEI.2011.6021760
Varathan, Kasturi Dewi ; Sembok, Tengku Mohd Tengku ; Rabiah, Abdul Kadir ; Omar, Nazlia. / Skolem preprocessing using WordNet and lexicon in building effective knowledge representation. Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011. 2011.
@inproceedings{e47fff804b5546d891639aa5de4b264c,
title = "Skolem preprocessing using WordNet and lexicon in building effective knowledge representation",
abstract = "We are in the information intensive environment in which various forms of digital contents have been growing exponentially. In this era of digital data, knowledge representation has been considered as a crucial component of any information retrieval system. It is also considered as a major problem especially in representing the content of unstructured text in an effective way. Although the mission remains impossible to achieve 100{\%} accuracy, many researchers are indulging themselves in documenting these data in many different techniques so that it can be communicated effectively and easily. Indexing is an important element that determines the success of retrieval. Since we are dealing with multiple documents, preprocessing of data is needed before the data gets indexed. Thus, this paper presents an approach on the preprocessing technique. The semantic data which have been represented in skolem clauses will be preprocessed with the help of automatic lexicon generator output and WordNet. This preprocessing plays an important role in getting rid of redundant data before it gets indexed into the semantic matrix. Besides redundancy, it also helps in dealing with common problem that exists in indexing multiple documents in which similar sentences with more or less the same meaning but have been constructed by using different sets of words. As a conclusion, the integration of WordNet and lexicon leads to better result in terms of building effective knowledge representation.",
keywords = "indexing, lexicon, preprocessing, skolem",
author = "Varathan, {Kasturi Dewi} and Sembok, {Tengku Mohd Tengku} and Rabiah, {Abdul Kadir} and Nazlia Omar",
year = "2011",
doi = "10.1109/ICEEI.2011.6021760",
language = "English",
isbn = "9781457707520",
booktitle = "Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011",

}

TY - GEN

T1 - Skolem preprocessing using WordNet and lexicon in building effective knowledge representation

AU - Varathan, Kasturi Dewi

AU - Sembok, Tengku Mohd Tengku

AU - Rabiah, Abdul Kadir

AU - Omar, Nazlia

PY - 2011

Y1 - 2011

N2 - We are in the information intensive environment in which various forms of digital contents have been growing exponentially. In this era of digital data, knowledge representation has been considered as a crucial component of any information retrieval system. It is also considered as a major problem especially in representing the content of unstructured text in an effective way. Although the mission remains impossible to achieve 100% accuracy, many researchers are indulging themselves in documenting these data in many different techniques so that it can be communicated effectively and easily. Indexing is an important element that determines the success of retrieval. Since we are dealing with multiple documents, preprocessing of data is needed before the data gets indexed. Thus, this paper presents an approach on the preprocessing technique. The semantic data which have been represented in skolem clauses will be preprocessed with the help of automatic lexicon generator output and WordNet. This preprocessing plays an important role in getting rid of redundant data before it gets indexed into the semantic matrix. Besides redundancy, it also helps in dealing with common problem that exists in indexing multiple documents in which similar sentences with more or less the same meaning but have been constructed by using different sets of words. As a conclusion, the integration of WordNet and lexicon leads to better result in terms of building effective knowledge representation.

AB - We are in the information intensive environment in which various forms of digital contents have been growing exponentially. In this era of digital data, knowledge representation has been considered as a crucial component of any information retrieval system. It is also considered as a major problem especially in representing the content of unstructured text in an effective way. Although the mission remains impossible to achieve 100% accuracy, many researchers are indulging themselves in documenting these data in many different techniques so that it can be communicated effectively and easily. Indexing is an important element that determines the success of retrieval. Since we are dealing with multiple documents, preprocessing of data is needed before the data gets indexed. Thus, this paper presents an approach on the preprocessing technique. The semantic data which have been represented in skolem clauses will be preprocessed with the help of automatic lexicon generator output and WordNet. This preprocessing plays an important role in getting rid of redundant data before it gets indexed into the semantic matrix. Besides redundancy, it also helps in dealing with common problem that exists in indexing multiple documents in which similar sentences with more or less the same meaning but have been constructed by using different sets of words. As a conclusion, the integration of WordNet and lexicon leads to better result in terms of building effective knowledge representation.

KW - indexing

KW - lexicon

KW - preprocessing

KW - skolem

UR - http://www.scopus.com/inward/record.url?scp=80054051889&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80054051889&partnerID=8YFLogxK

U2 - 10.1109/ICEEI.2011.6021760

DO - 10.1109/ICEEI.2011.6021760

M3 - Conference contribution

AN - SCOPUS:80054051889

SN - 9781457707520

BT - Proceedings of the 2011 International Conference on Electrical Engineering and Informatics, ICEEI 2011

ER -