Part-of-Speech for Old Malay Manuscript Corpus: A Review

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Research in Malay Part-of-Speech (POS) has increased considerably in the past few years. From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase. Malay language can be written in Roman or Jawi. Three different spelling between Roman and Jawi make this study essential. In this paper, we highlighted the problem and issues related to Malay language, POS general framework, POS approaches and techniques. POS at basis was introduced to get information from Old Malay Manuscripts that contain important information in various spheres of knowledge. Promising result for the auto-tagging of Malay written in Jawi is expected.

Original languageEnglish
Title of host publicationCommunications in Computer and Information Science
PublisherSpringer Verlag
Pages53-66
Number of pages14
Volume378 CCIS
ISBN (Print)9783642405662
DOIs
Publication statusPublished - 2013
Event2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013 - Shah Alam
Duration: 28 Aug 201329 Aug 2013

Publication series

NameCommunications in Computer and Information Science
Volume378 CCIS
ISSN (Print)18650929

Other

Other2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013
CityShah Alam
Period28/8/1329/8/13

Keywords

  • Jawi
  • malay language
  • Part-of-speech tagging
  • tagging framework

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Abu Bakar, J., Omar, K., Nasrudin, M. F., & Murah, M. Z. (2013). Part-of-Speech for Old Malay Manuscript Corpus: A Review. In Communications in Computer and Information Science (Vol. 378 CCIS, pp. 53-66). (Communications in Computer and Information Science; Vol. 378 CCIS). Springer Verlag. https://doi.org/10.1007/978-3-642-40567-9_5

Part-of-Speech for Old Malay Manuscript Corpus : A Review. / Abu Bakar, Juhaida; Omar, Khairuddin; Nasrudin, Mohammad Faidzul; Murah, Mohd. Zamri.

Communications in Computer and Information Science. Vol. 378 CCIS Springer Verlag, 2013. p. 53-66 (Communications in Computer and Information Science; Vol. 378 CCIS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abu Bakar, J, Omar, K, Nasrudin, MF & Murah, MZ 2013, Part-of-Speech for Old Malay Manuscript Corpus: A Review. in Communications in Computer and Information Science. vol. 378 CCIS, Communications in Computer and Information Science, vol. 378 CCIS, Springer Verlag, pp. 53-66, 2nd International Multi-Conference on Artificial Intelligence Technology, M-CAIT 2013, Shah Alam, 28/8/13. https://doi.org/10.1007/978-3-642-40567-9_5
Abu Bakar J, Omar K, Nasrudin MF, Murah MZ. Part-of-Speech for Old Malay Manuscript Corpus: A Review. In Communications in Computer and Information Science. Vol. 378 CCIS. Springer Verlag. 2013. p. 53-66. (Communications in Computer and Information Science). https://doi.org/10.1007/978-3-642-40567-9_5
Abu Bakar, Juhaida ; Omar, Khairuddin ; Nasrudin, Mohammad Faidzul ; Murah, Mohd. Zamri. / Part-of-Speech for Old Malay Manuscript Corpus : A Review. Communications in Computer and Information Science. Vol. 378 CCIS Springer Verlag, 2013. pp. 53-66 (Communications in Computer and Information Science).
@inproceedings{00333e1aa5b84c6ba69daddb173416c5,
title = "Part-of-Speech for Old Malay Manuscript Corpus: A Review",
abstract = "Research in Malay Part-of-Speech (POS) has increased considerably in the past few years. From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase. Malay language can be written in Roman or Jawi. Three different spelling between Roman and Jawi make this study essential. In this paper, we highlighted the problem and issues related to Malay language, POS general framework, POS approaches and techniques. POS at basis was introduced to get information from Old Malay Manuscripts that contain important information in various spheres of knowledge. Promising result for the auto-tagging of Malay written in Jawi is expected.",
keywords = "Jawi, malay language, Part-of-speech tagging, tagging framework",
author = "{Abu Bakar}, Juhaida and Khairuddin Omar and Nasrudin, {Mohammad Faidzul} and Murah, {Mohd. Zamri}",
year = "2013",
doi = "10.1007/978-3-642-40567-9_5",
language = "English",
isbn = "9783642405662",
volume = "378 CCIS",
series = "Communications in Computer and Information Science",
publisher = "Springer Verlag",
pages = "53--66",
booktitle = "Communications in Computer and Information Science",

}

TY - GEN

T1 - Part-of-Speech for Old Malay Manuscript Corpus

T2 - A Review

AU - Abu Bakar, Juhaida

AU - Omar, Khairuddin

AU - Nasrudin, Mohammad Faidzul

AU - Murah, Mohd. Zamri

PY - 2013

Y1 - 2013

N2 - Research in Malay Part-of-Speech (POS) has increased considerably in the past few years. From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase. Malay language can be written in Roman or Jawi. Three different spelling between Roman and Jawi make this study essential. In this paper, we highlighted the problem and issues related to Malay language, POS general framework, POS approaches and techniques. POS at basis was introduced to get information from Old Malay Manuscripts that contain important information in various spheres of knowledge. Promising result for the auto-tagging of Malay written in Jawi is expected.

AB - Research in Malay Part-of-Speech (POS) has increased considerably in the past few years. From the literature, POS are known as the first stage in automated text analysis and the development of language technologies can scarcely begun without this initial phase. Malay language can be written in Roman or Jawi. Three different spelling between Roman and Jawi make this study essential. In this paper, we highlighted the problem and issues related to Malay language, POS general framework, POS approaches and techniques. POS at basis was introduced to get information from Old Malay Manuscripts that contain important information in various spheres of knowledge. Promising result for the auto-tagging of Malay written in Jawi is expected.

KW - Jawi

KW - malay language

KW - Part-of-speech tagging

KW - tagging framework

UR - http://www.scopus.com/inward/record.url?scp=84904687289&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84904687289&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-40567-9_5

DO - 10.1007/978-3-642-40567-9_5

M3 - Conference contribution

AN - SCOPUS:84904687289

SN - 9783642405662

VL - 378 CCIS

T3 - Communications in Computer and Information Science

SP - 53

EP - 66

BT - Communications in Computer and Information Science

PB - Springer Verlag

ER -