Webpage segmentation for extracting images and their surrounding contextual information

Wan Fariza Paizi@Fauzi, Jer Lang Hong, Mohammed Belkhatir

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention has been given to address issues in mining this contextual information. In this paper, we propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages. We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results compared to an existing segmentation algorithm.

Original languageEnglish
Title of host publicationMM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums
Pages649-652
Number of pages4
DOIs
Publication statusPublished - 28 Dec 2009
Externally publishedYes
Event17th ACM International Conference on Multimedia, MM'09, with Co-located Workshops and Symposiums - Beijing, China
Duration: 19 Oct 200924 Oct 2009

Other

Other17th ACM International Conference on Multimedia, MM'09, with Co-located Workshops and Symposiums
CountryChina
CityBeijing
Period19/10/0924/10/09

Fingerprint

Semantics
Experiments

Keywords

  • Dom tree-based algorithm
  • Surrounding information
  • Webpage segmentation
  • WWW images

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Software

Cite this

Paizi@Fauzi, W. F., Hong, J. L., & Belkhatir, M. (2009). Webpage segmentation for extracting images and their surrounding contextual information. In MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums (pp. 649-652) https://doi.org/10.1145/1631272.1631379

Webpage segmentation for extracting images and their surrounding contextual information. / Paizi@Fauzi, Wan Fariza; Hong, Jer Lang; Belkhatir, Mohammed.

MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums. 2009. p. 649-652.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Paizi@Fauzi, WF, Hong, JL & Belkhatir, M 2009, Webpage segmentation for extracting images and their surrounding contextual information. in MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums. pp. 649-652, 17th ACM International Conference on Multimedia, MM'09, with Co-located Workshops and Symposiums, Beijing, China, 19/10/09. https://doi.org/10.1145/1631272.1631379
Paizi@Fauzi WF, Hong JL, Belkhatir M. Webpage segmentation for extracting images and their surrounding contextual information. In MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums. 2009. p. 649-652 https://doi.org/10.1145/1631272.1631379
Paizi@Fauzi, Wan Fariza ; Hong, Jer Lang ; Belkhatir, Mohammed. / Webpage segmentation for extracting images and their surrounding contextual information. MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums. 2009. pp. 649-652
@inproceedings{0206ae4888234516b15030c877e478cb,
title = "Webpage segmentation for extracting images and their surrounding contextual information",
abstract = "Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention has been given to address issues in mining this contextual information. In this paper, we propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages. We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results compared to an existing segmentation algorithm.",
keywords = "Dom tree-based algorithm, Surrounding information, Webpage segmentation, WWW images",
author = "Paizi@Fauzi, {Wan Fariza} and Hong, {Jer Lang} and Mohammed Belkhatir",
year = "2009",
month = "12",
day = "28",
doi = "10.1145/1631272.1631379",
language = "English",
isbn = "9781605586083",
pages = "649--652",
booktitle = "MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums",

}

TY - GEN

T1 - Webpage segmentation for extracting images and their surrounding contextual information

AU - Paizi@Fauzi, Wan Fariza

AU - Hong, Jer Lang

AU - Belkhatir, Mohammed

PY - 2009/12/28

Y1 - 2009/12/28

N2 - Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention has been given to address issues in mining this contextual information. In this paper, we propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages. We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results compared to an existing segmentation algorithm.

AB - Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention has been given to address issues in mining this contextual information. In this paper, we propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages. We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results compared to an existing segmentation algorithm.

KW - Dom tree-based algorithm

KW - Surrounding information

KW - Webpage segmentation

KW - WWW images

UR - http://www.scopus.com/inward/record.url?scp=72449126143&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=72449126143&partnerID=8YFLogxK

U2 - 10.1145/1631272.1631379

DO - 10.1145/1631272.1631379

M3 - Conference contribution

SN - 9781605586083

SP - 649

EP - 652

BT - MM'09 - Proceedings of the 2009 ACM Multimedia Conference, with Co-located Workshops and Symposiums

ER -