A rule-based Arabic stemming algorithm

Tengku Mohd T Sembok, Belal Mustafa Abu Ata, Zainab Abu Bakar

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    6 Citations (Scopus)

    Abstract

    Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Arabic documents. The Arabic stemming algorithm developed by Al-Omari is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.

    Original languageEnglish
    Title of host publicationProceedings of the European Computing Conference, ECC '11
    Pages392-397
    Number of pages6
    Publication statusPublished - 2011
    EventEuropean Computing Conference, ECC '11 - Paris
    Duration: 28 Apr 201130 Apr 2011

    Other

    OtherEuropean Computing Conference, ECC '11
    CityParis
    Period28/4/1130/4/11

    Fingerprint

    Retrieval
    Common root
    Information retrieval systems
    Glossaries
    Indexing
    Information Retrieval
    Language
    Dictionary
    Form

    Keywords

    • Arabic morphology
    • Information retrieval
    • Stemming

    ASJC Scopus subject areas

    • Computational Theory and Mathematics
    • Theoretical Computer Science

    Cite this

    Sembok, T. M. T., Abu Ata, B. M., & Bakar, Z. A. (2011). A rule-based Arabic stemming algorithm. In Proceedings of the European Computing Conference, ECC '11 (pp. 392-397)

    A rule-based Arabic stemming algorithm. / Sembok, Tengku Mohd T; Abu Ata, Belal Mustafa; Bakar, Zainab Abu.

    Proceedings of the European Computing Conference, ECC '11. 2011. p. 392-397.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Sembok, TMT, Abu Ata, BM & Bakar, ZA 2011, A rule-based Arabic stemming algorithm. in Proceedings of the European Computing Conference, ECC '11. pp. 392-397, European Computing Conference, ECC '11, Paris, 28/4/11.
    Sembok TMT, Abu Ata BM, Bakar ZA. A rule-based Arabic stemming algorithm. In Proceedings of the European Computing Conference, ECC '11. 2011. p. 392-397
    Sembok, Tengku Mohd T ; Abu Ata, Belal Mustafa ; Bakar, Zainab Abu. / A rule-based Arabic stemming algorithm. Proceedings of the European Computing Conference, ECC '11. 2011. pp. 392-397
    @inproceedings{7ab1ff78b11f40249274e03cb1a4de6d,
    title = "A rule-based Arabic stemming algorithm",
    abstract = "Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Arabic documents. The Arabic stemming algorithm developed by Al-Omari is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.",
    keywords = "Arabic morphology, Information retrieval, Stemming",
    author = "Sembok, {Tengku Mohd T} and {Abu Ata}, {Belal Mustafa} and Bakar, {Zainab Abu}",
    year = "2011",
    language = "English",
    isbn = "9789604742974",
    pages = "392--397",
    booktitle = "Proceedings of the European Computing Conference, ECC '11",

    }

    TY - GEN

    T1 - A rule-based Arabic stemming algorithm

    AU - Sembok, Tengku Mohd T

    AU - Abu Ata, Belal Mustafa

    AU - Bakar, Zainab Abu

    PY - 2011

    Y1 - 2011

    N2 - Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Arabic documents. The Arabic stemming algorithm developed by Al-Omari is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.

    AB - Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Arabic documents. The Arabic stemming algorithm developed by Al-Omari is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.

    KW - Arabic morphology

    KW - Information retrieval

    KW - Stemming

    UR - http://www.scopus.com/inward/record.url?scp=80053164019&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=80053164019&partnerID=8YFLogxK

    M3 - Conference contribution

    SN - 9789604742974

    SP - 392

    EP - 397

    BT - Proceedings of the European Computing Conference, ECC '11

    ER -