Query translation architecture for Malay-English cross-language information retrieval system

Nurjannaton Hidayah Rais, Muhamad Taufik Abdullah, Abdul Kadir Rabiah

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.

Original languageEnglish
Title of host publicationProceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10
Pages990-993
Number of pages4
Volume2
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event2010 International Symposium on Information Technology, ITSim'10 - Kuala Lumpur
Duration: 15 Jun 201017 Jun 2010

Other

Other2010 International Symposium on Information Technology, ITSim'10
CityKuala Lumpur
Period15/6/1017/6/10

Fingerprint

Information retrieval systems
Glossaries
Search engines
Information retrieval
Processing

Keywords

  • Component
  • Cross-Language Information Retrieval
  • N-gram matching
  • Part-of-Speech (POS) tagging
  • Query ambiguity
  • Query translation

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems

Cite this

Rais, N. H., Abdullah, M. T., & Rabiah, A. K. (2010). Query translation architecture for Malay-English cross-language information retrieval system. In Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10 (Vol. 2, pp. 990-993). [5561589] https://doi.org/10.1109/ITSIM.2010.5561589

Query translation architecture for Malay-English cross-language information retrieval system. / Rais, Nurjannaton Hidayah; Abdullah, Muhamad Taufik; Rabiah, Abdul Kadir.

Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10. Vol. 2 2010. p. 990-993 5561589.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Rais, NH, Abdullah, MT & Rabiah, AK 2010, Query translation architecture for Malay-English cross-language information retrieval system. in Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10. vol. 2, 5561589, pp. 990-993, 2010 International Symposium on Information Technology, ITSim'10, Kuala Lumpur, 15/6/10. https://doi.org/10.1109/ITSIM.2010.5561589
Rais NH, Abdullah MT, Rabiah AK. Query translation architecture for Malay-English cross-language information retrieval system. In Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10. Vol. 2. 2010. p. 990-993. 5561589 https://doi.org/10.1109/ITSIM.2010.5561589
Rais, Nurjannaton Hidayah ; Abdullah, Muhamad Taufik ; Rabiah, Abdul Kadir. / Query translation architecture for Malay-English cross-language information retrieval system. Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10. Vol. 2 2010. pp. 990-993
@inproceedings{4f0ed641b71f4a5281d3e0f9f0db8885,
title = "Query translation architecture for Malay-English cross-language information retrieval system",
abstract = "This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.",
keywords = "Component, Cross-Language Information Retrieval, N-gram matching, Part-of-Speech (POS) tagging, Query ambiguity, Query translation",
author = "Rais, {Nurjannaton Hidayah} and Abdullah, {Muhamad Taufik} and Rabiah, {Abdul Kadir}",
year = "2010",
doi = "10.1109/ITSIM.2010.5561589",
language = "English",
isbn = "9781424467181",
volume = "2",
pages = "990--993",
booktitle = "Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10",

}

TY - GEN

T1 - Query translation architecture for Malay-English cross-language information retrieval system

AU - Rais, Nurjannaton Hidayah

AU - Abdullah, Muhamad Taufik

AU - Rabiah, Abdul Kadir

PY - 2010

Y1 - 2010

N2 - This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.

AB - This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.

KW - Component

KW - Cross-Language Information Retrieval

KW - N-gram matching

KW - Part-of-Speech (POS) tagging

KW - Query ambiguity

KW - Query translation

UR - http://www.scopus.com/inward/record.url?scp=78049356127&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78049356127&partnerID=8YFLogxK

U2 - 10.1109/ITSIM.2010.5561589

DO - 10.1109/ITSIM.2010.5561589

M3 - Conference contribution

SN - 9781424467181

VL - 2

SP - 990

EP - 993

BT - Proceedings 2010 International Symposium on Information Technology - Engineering Technology, ITSim'10

ER -