Character strings to natural language processing in information retrieval

Tengku Mohd Tengku Sembok

    Research output: Contribution to journalArticle

    Abstract

    The levels-of-processing theory proposes that there are many ways to process and code information. The level of processing adopted will determine the quality of the representation used to store the information in the computer memory or storage. The levels-of-processing applied in information retrieval can be classified as follows: string processing, morphological processing, syntactic processing and semantic processing. These level-of-processing are imbedded into various models of information retrieval. Conventional information retrieval models, such as Boolean and vector space models rely on an extensive use of keywords, as independent strings, and their frequencies in storing and retrieving information. Thus string processing and morphological processing are mainly adopted in these models. It is believed that such an approach has reached its upper limit of retrieval effectiveness, and therefore, new approaches should be investigated for the development of future systems that will be more effective. With current advances in programming languages and techniques, natural language processing and understanding, and generally in the fields of artificial intelligence and cognitive science, there are now attempts made to include knowledge representation and linguistic processing into information retrieval systems. We also focus our research on the application of certain techniques on specific languages. Besides English, we focus the application of certain techniques especially on Malay. In this paper we will highlight some of the research done in the area of information retrieval at the various levels of processing, and also expound the current research we are doing and the future direction that we would like to undertake.

    Original languageEnglish
    Pages (from-to)26-33
    Number of pages8
    JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2911
    Publication statusPublished - 2003

    Fingerprint

    Natural Language Processing
    Information Storage and Retrieval
    Information retrieval
    Information Retrieval
    Natural Language
    Strings
    Processing
    Research
    Programming Languages
    Space Simulation
    Cognitive Science
    Artificial Intelligence
    Linguistics
    Semantics
    Information Systems
    Language
    Character
    Information retrieval systems
    Vector Space Model
    Knowledge representation

    ASJC Scopus subject areas

    • Computer Science(all)
    • Biochemistry, Genetics and Molecular Biology(all)
    • Theoretical Computer Science
    • Engineering(all)

    Cite this

    @article{5373368744ec4640be47bf3099bfe44e,
    title = "Character strings to natural language processing in information retrieval",
    abstract = "The levels-of-processing theory proposes that there are many ways to process and code information. The level of processing adopted will determine the quality of the representation used to store the information in the computer memory or storage. The levels-of-processing applied in information retrieval can be classified as follows: string processing, morphological processing, syntactic processing and semantic processing. These level-of-processing are imbedded into various models of information retrieval. Conventional information retrieval models, such as Boolean and vector space models rely on an extensive use of keywords, as independent strings, and their frequencies in storing and retrieving information. Thus string processing and morphological processing are mainly adopted in these models. It is believed that such an approach has reached its upper limit of retrieval effectiveness, and therefore, new approaches should be investigated for the development of future systems that will be more effective. With current advances in programming languages and techniques, natural language processing and understanding, and generally in the fields of artificial intelligence and cognitive science, there are now attempts made to include knowledge representation and linguistic processing into information retrieval systems. We also focus our research on the application of certain techniques on specific languages. Besides English, we focus the application of certain techniques especially on Malay. In this paper we will highlight some of the research done in the area of information retrieval at the various levels of processing, and also expound the current research we are doing and the future direction that we would like to undertake.",
    author = "Sembok, {Tengku Mohd Tengku}",
    year = "2003",
    language = "English",
    volume = "2911",
    pages = "26--33",
    journal = "Lecture Notes in Computer Science",
    issn = "0302-9743",
    publisher = "Springer Verlag",

    }

    TY - JOUR

    T1 - Character strings to natural language processing in information retrieval

    AU - Sembok, Tengku Mohd Tengku

    PY - 2003

    Y1 - 2003

    N2 - The levels-of-processing theory proposes that there are many ways to process and code information. The level of processing adopted will determine the quality of the representation used to store the information in the computer memory or storage. The levels-of-processing applied in information retrieval can be classified as follows: string processing, morphological processing, syntactic processing and semantic processing. These level-of-processing are imbedded into various models of information retrieval. Conventional information retrieval models, such as Boolean and vector space models rely on an extensive use of keywords, as independent strings, and their frequencies in storing and retrieving information. Thus string processing and morphological processing are mainly adopted in these models. It is believed that such an approach has reached its upper limit of retrieval effectiveness, and therefore, new approaches should be investigated for the development of future systems that will be more effective. With current advances in programming languages and techniques, natural language processing and understanding, and generally in the fields of artificial intelligence and cognitive science, there are now attempts made to include knowledge representation and linguistic processing into information retrieval systems. We also focus our research on the application of certain techniques on specific languages. Besides English, we focus the application of certain techniques especially on Malay. In this paper we will highlight some of the research done in the area of information retrieval at the various levels of processing, and also expound the current research we are doing and the future direction that we would like to undertake.

    AB - The levels-of-processing theory proposes that there are many ways to process and code information. The level of processing adopted will determine the quality of the representation used to store the information in the computer memory or storage. The levels-of-processing applied in information retrieval can be classified as follows: string processing, morphological processing, syntactic processing and semantic processing. These level-of-processing are imbedded into various models of information retrieval. Conventional information retrieval models, such as Boolean and vector space models rely on an extensive use of keywords, as independent strings, and their frequencies in storing and retrieving information. Thus string processing and morphological processing are mainly adopted in these models. It is believed that such an approach has reached its upper limit of retrieval effectiveness, and therefore, new approaches should be investigated for the development of future systems that will be more effective. With current advances in programming languages and techniques, natural language processing and understanding, and generally in the fields of artificial intelligence and cognitive science, there are now attempts made to include knowledge representation and linguistic processing into information retrieval systems. We also focus our research on the application of certain techniques on specific languages. Besides English, we focus the application of certain techniques especially on Malay. In this paper we will highlight some of the research done in the area of information retrieval at the various levels of processing, and also expound the current research we are doing and the future direction that we would like to undertake.

    UR - http://www.scopus.com/inward/record.url?scp=0348216529&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0348216529&partnerID=8YFLogxK

    M3 - Article

    AN - SCOPUS:0348216529

    VL - 2911

    SP - 26

    EP - 33

    JO - Lecture Notes in Computer Science

    JF - Lecture Notes in Computer Science

    SN - 0302-9743

    ER -