Going beyond the surrounding text to semantically annotate and search digital images

Shahrul Azman Mohd Noah, Datul Aida Ali, Arifah Che Alhadi, Junaidah Mohamed Kassim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Digital objects such as images and videos are fundamental resources in digital library. To assist in retrieving such objects usually they are being tagged by some keywords or sentences. The popular approach to tag digital objects is based on associated text. However, relying on associated text alone such as the surrounding text unable to semantically describe such objects. This paper discusses the use of WordNet and ConceptNet to tag digital images beyond terms available in the surrounding text. WordNet is used to disambiguate concepts or terms from the associated text and ConceptNet is meant to infer topics or common-sense knowledge from summarizing the text that describe the images. However, relying on WordNet alone is not sufficed particularly when it comes to disambiguate specific or domain dependent concepts. As such the Name Entity Recognition (NER) technique is required to annotate important entities such as name of a person, location and organization. Our work focused on on-lines news images that are richly described with textual description.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages169-179
Number of pages11
Volume5990 LNAI
EditionPART 1
DOIs
Publication statusPublished - 2010
Event2010 Asian Conference on Intelligent Information and Database Systems, ACIIDS 2010 - Hue City
Duration: 24 Mar 201026 Mar 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume5990 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other2010 Asian Conference on Intelligent Information and Database Systems, ACIIDS 2010
CityHue City
Period24/3/1026/3/10

Fingerprint

Digital libraries
Digital Image
WordNet
Digital Libraries
Term
Text
Person
Resources
Object
Dependent
Line

Keywords

  • information retrieval
  • natural language processing
  • ontology
  • semantic annotation

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Mohd Noah, S. A., Ali, D. A., Alhadi, A. C., & Mohamed Kassim, J. (2010). Going beyond the surrounding text to semantically annotate and search digital images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (PART 1 ed., Vol. 5990 LNAI, pp. 169-179). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5990 LNAI, No. PART 1). https://doi.org/10.1007/978-3-642-12145-6_18

Going beyond the surrounding text to semantically annotate and search digital images. / Mohd Noah, Shahrul Azman; Ali, Datul Aida; Alhadi, Arifah Che; Mohamed Kassim, Junaidah.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5990 LNAI PART 1. ed. 2010. p. 169-179 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5990 LNAI, No. PART 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mohd Noah, SA, Ali, DA, Alhadi, AC & Mohamed Kassim, J 2010, Going beyond the surrounding text to semantically annotate and search digital images. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 1 edn, vol. 5990 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 5990 LNAI, pp. 169-179, 2010 Asian Conference on Intelligent Information and Database Systems, ACIIDS 2010, Hue City, 24/3/10. https://doi.org/10.1007/978-3-642-12145-6_18
Mohd Noah SA, Ali DA, Alhadi AC, Mohamed Kassim J. Going beyond the surrounding text to semantically annotate and search digital images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 1 ed. Vol. 5990 LNAI. 2010. p. 169-179. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). https://doi.org/10.1007/978-3-642-12145-6_18
Mohd Noah, Shahrul Azman ; Ali, Datul Aida ; Alhadi, Arifah Che ; Mohamed Kassim, Junaidah. / Going beyond the surrounding text to semantically annotate and search digital images. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5990 LNAI PART 1. ed. 2010. pp. 169-179 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1).
@inproceedings{ed2fb0b3b0a7425f96fa4d386ee3e47e,
title = "Going beyond the surrounding text to semantically annotate and search digital images",
abstract = "Digital objects such as images and videos are fundamental resources in digital library. To assist in retrieving such objects usually they are being tagged by some keywords or sentences. The popular approach to tag digital objects is based on associated text. However, relying on associated text alone such as the surrounding text unable to semantically describe such objects. This paper discusses the use of WordNet and ConceptNet to tag digital images beyond terms available in the surrounding text. WordNet is used to disambiguate concepts or terms from the associated text and ConceptNet is meant to infer topics or common-sense knowledge from summarizing the text that describe the images. However, relying on WordNet alone is not sufficed particularly when it comes to disambiguate specific or domain dependent concepts. As such the Name Entity Recognition (NER) technique is required to annotate important entities such as name of a person, location and organization. Our work focused on on-lines news images that are richly described with textual description.",
keywords = "information retrieval, natural language processing, ontology, semantic annotation",
author = "{Mohd Noah}, {Shahrul Azman} and Ali, {Datul Aida} and Alhadi, {Arifah Che} and {Mohamed Kassim}, Junaidah",
year = "2010",
doi = "10.1007/978-3-642-12145-6_18",
language = "English",
isbn = "3642121446",
volume = "5990 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 1",
pages = "169--179",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
edition = "PART 1",

}

TY - GEN

T1 - Going beyond the surrounding text to semantically annotate and search digital images

AU - Mohd Noah, Shahrul Azman

AU - Ali, Datul Aida

AU - Alhadi, Arifah Che

AU - Mohamed Kassim, Junaidah

PY - 2010

Y1 - 2010

N2 - Digital objects such as images and videos are fundamental resources in digital library. To assist in retrieving such objects usually they are being tagged by some keywords or sentences. The popular approach to tag digital objects is based on associated text. However, relying on associated text alone such as the surrounding text unable to semantically describe such objects. This paper discusses the use of WordNet and ConceptNet to tag digital images beyond terms available in the surrounding text. WordNet is used to disambiguate concepts or terms from the associated text and ConceptNet is meant to infer topics or common-sense knowledge from summarizing the text that describe the images. However, relying on WordNet alone is not sufficed particularly when it comes to disambiguate specific or domain dependent concepts. As such the Name Entity Recognition (NER) technique is required to annotate important entities such as name of a person, location and organization. Our work focused on on-lines news images that are richly described with textual description.

AB - Digital objects such as images and videos are fundamental resources in digital library. To assist in retrieving such objects usually they are being tagged by some keywords or sentences. The popular approach to tag digital objects is based on associated text. However, relying on associated text alone such as the surrounding text unable to semantically describe such objects. This paper discusses the use of WordNet and ConceptNet to tag digital images beyond terms available in the surrounding text. WordNet is used to disambiguate concepts or terms from the associated text and ConceptNet is meant to infer topics or common-sense knowledge from summarizing the text that describe the images. However, relying on WordNet alone is not sufficed particularly when it comes to disambiguate specific or domain dependent concepts. As such the Name Entity Recognition (NER) technique is required to annotate important entities such as name of a person, location and organization. Our work focused on on-lines news images that are richly described with textual description.

KW - information retrieval

KW - natural language processing

KW - ontology

KW - semantic annotation

UR - http://www.scopus.com/inward/record.url?scp=77957009444&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77957009444&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-12145-6_18

DO - 10.1007/978-3-642-12145-6_18

M3 - Conference contribution

AN - SCOPUS:77957009444

SN - 3642121446

SN - 9783642121449

VL - 5990 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 169

EP - 179

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -