Phonetic coding methods for Malay names retrieval

Nor Syahidah Abdul Mutalib, Shahrul Azman Mohd Noah

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38% improvement of the precision measure has been achieved.

Original languageEnglish
Title of host publication2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011
Pages125-129
Number of pages5
DOIs
Publication statusPublished - 2011
Event2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011 - Putrajaya
Duration: 28 Jun 201129 Jun 2011

Other

Other2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011
CityPutrajaya
Period28/6/1129/6/11

Fingerprint

Speech analysis
Information retrieval
Search engines
Information systems
Testing

Keywords

  • Asoundex
  • name retrieval
  • phonetic
  • Soundex

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Information Systems

Cite this

Mutalib, N. S. A., & Mohd Noah, S. A. (2011). Phonetic coding methods for Malay names retrieval. In 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011 (pp. 125-129). [5995776] https://doi.org/10.1109/STAIR.2011.5995776

Phonetic coding methods for Malay names retrieval. / Mutalib, Nor Syahidah Abdul; Mohd Noah, Shahrul Azman.

2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011. 2011. p. 125-129 5995776.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mutalib, NSA & Mohd Noah, SA 2011, Phonetic coding methods for Malay names retrieval. in 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011., 5995776, pp. 125-129, 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011, Putrajaya, 28/6/11. https://doi.org/10.1109/STAIR.2011.5995776
Mutalib NSA, Mohd Noah SA. Phonetic coding methods for Malay names retrieval. In 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011. 2011. p. 125-129. 5995776 https://doi.org/10.1109/STAIR.2011.5995776
Mutalib, Nor Syahidah Abdul ; Mohd Noah, Shahrul Azman. / Phonetic coding methods for Malay names retrieval. 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011. 2011. pp. 125-129
@inproceedings{d089be76d24b41ae9960db026c40b06c,
title = "Phonetic coding methods for Malay names retrieval",
abstract = "Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38{\%} improvement of the precision measure has been achieved.",
keywords = "Asoundex, name retrieval, phonetic, Soundex",
author = "Mutalib, {Nor Syahidah Abdul} and {Mohd Noah}, {Shahrul Azman}",
year = "2011",
doi = "10.1109/STAIR.2011.5995776",
language = "English",
isbn = "9781612843537",
pages = "125--129",
booktitle = "2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011",

}

TY - GEN

T1 - Phonetic coding methods for Malay names retrieval

AU - Mutalib, Nor Syahidah Abdul

AU - Mohd Noah, Shahrul Azman

PY - 2011

Y1 - 2011

N2 - Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38% improvement of the precision measure has been achieved.

AB - Searching for person names has been very popular among users of information systems and search engines. Thus, the effectiveness, accuracy and appropriateness of search results are strongly emphasized. Information retrieval (IR) methods provide high impact in influencing the searching results. Efforts in improving the IR methods have been made due to the fact that names are not unique and have varieties of spelling. This will caused errors during the process of getting accurate names. Searching based on phonetic is said to be a suitable method to solve the aforementioned problem because names have limited spelling standards. Phonetic method is used to recognize and retrieve words that have the same pronunciation. The main aim of this paper is to test the effectiveness of phonetic coding method on Malay name retrieval using Soundex and modified-Asoundex (Asoundex is an Arabic Soundex). The experimental approach used to perform this research consists of two stages; program development and testing Malay name data sets. The development of programs referred to the existing algorithms to generate name code. Code generated from program will be compared with data contained in the test data. The effectiveness of the result is determined by comparing the output with the result obtained from both phonetic approaches. Evaluation is based on the precision and recall measures. The contribution of the research is to provide comparative accuracy of Malay name retrieval using Soundex and modified-Asoundex coding method. Result show that an average of 38.38% improvement of the precision measure has been achieved.

KW - Asoundex

KW - name retrieval

KW - phonetic

KW - Soundex

UR - http://www.scopus.com/inward/record.url?scp=80052559058&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80052559058&partnerID=8YFLogxK

U2 - 10.1109/STAIR.2011.5995776

DO - 10.1109/STAIR.2011.5995776

M3 - Conference contribution

SN - 9781612843537

SP - 125

EP - 129

BT - 2011 International Conference on Semantic Technology and Information Retrieval, STAIR 2011

ER -