Experiments with a stemming algorithm for Malay words

Fatimah Ahmad, Mohammed Yusoff, Tengku M T Sembok

    Research output: Contribution to journalArticle

    45 Citations (Scopus)

    Abstract

    Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Malay documents. The Malay stemming algorithm developed by Othman is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up, the order in which the morphological rules are applied, and the number of rules.

    Original languageEnglish
    Pages (from-to)909-918
    Number of pages10
    JournalJournal of the American Society for Information Science
    Volume47
    Issue number12
    Publication statusPublished - Dec 1996

    Fingerprint

    Information retrieval systems
    experiment
    Glossaries
    indexing
    information retrieval
    dictionary
    Experiments
    language
    performance
    Experiment
    Indexing
    Information retrieval
    Language

    ASJC Scopus subject areas

    • Engineering(all)

    Cite this

    Experiments with a stemming algorithm for Malay words. / Ahmad, Fatimah; Yusoff, Mohammed; Sembok, Tengku M T.

    In: Journal of the American Society for Information Science, Vol. 47, No. 12, 12.1996, p. 909-918.

    Research output: Contribution to journalArticle

    Ahmad, F, Yusoff, M & Sembok, TMT 1996, 'Experiments with a stemming algorithm for Malay words', Journal of the American Society for Information Science, vol. 47, no. 12, pp. 909-918.
    Ahmad, Fatimah ; Yusoff, Mohammed ; Sembok, Tengku M T. / Experiments with a stemming algorithm for Malay words. In: Journal of the American Society for Information Science. 1996 ; Vol. 47, No. 12. pp. 909-918.
    @article{da5f03344bce486294eaf5da551c1b63,
    title = "Experiments with a stemming algorithm for Malay words",
    abstract = "Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Malay documents. The Malay stemming algorithm developed by Othman is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up, the order in which the morphological rules are applied, and the number of rules.",
    author = "Fatimah Ahmad and Mohammed Yusoff and Sembok, {Tengku M T}",
    year = "1996",
    month = "12",
    language = "English",
    volume = "47",
    pages = "909--918",
    journal = "Journal of the Association for Information Science and Technology",
    issn = "2330-1635",
    publisher = "John Wiley and Sons Ltd",
    number = "12",

    }

    TY - JOUR

    T1 - Experiments with a stemming algorithm for Malay words

    AU - Ahmad, Fatimah

    AU - Yusoff, Mohammed

    AU - Sembok, Tengku M T

    PY - 1996/12

    Y1 - 1996/12

    N2 - Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Malay documents. The Malay stemming algorithm developed by Othman is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up, the order in which the morphological rules are applied, and the number of rules.

    AB - Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Malay documents. The Malay stemming algorithm developed by Othman is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up, the order in which the morphological rules are applied, and the number of rules.

    UR - http://www.scopus.com/inward/record.url?scp=0030378256&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0030378256&partnerID=8YFLogxK

    M3 - Article

    AN - SCOPUS:0030378256

    VL - 47

    SP - 909

    EP - 918

    JO - Journal of the Association for Information Science and Technology

    JF - Journal of the Association for Information Science and Technology

    SN - 2330-1635

    IS - 12

    ER -