Rough K-means outlier factor based on entropy computation

Djoko Budiyanto Setyohadi, Azuraliza Abu Bakar, Zulaiha Ali Othman

Research output: Contribution to journalArticle

Abstract

Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are assumed to be similar. The outlier objects intuitively can be grouped into the outlier clusters and the outlier factors of each object within the outlier cluster should be different gradually. It is not natural if the outlierness of each object within outlier cluster is similar. This study proposes the new outlier detection method based on the hybrid of the Rough K-Means clustering algorithm and the entropy computation. We introduce the outlier degree measure namely the entropy outlier factor for the cluster based outlier detection. The proposed algorithm sequentially finds the outlier cluster and calculates the outlier factor degree of the objects within outlier cluster. Each object within outlier cluster is evaluated using entropy cluster-based to a whole cluster. The performance of the algorithm has been tested on four UCI benchmark data sets and show outperform especially in detection rate.

Original languageEnglish
Pages (from-to)398-409
Number of pages12
JournalResearch Journal of Applied Sciences, Engineering and Technology
Volume8
Issue number3
Publication statusPublished - 2014

Fingerprint

Entropy
Clustering algorithms

Keywords

  • Entropy outlier
  • Outlier detection
  • Rough k-means

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science(all)

Cite this

Rough K-means outlier factor based on entropy computation. / Setyohadi, Djoko Budiyanto; Abu Bakar, Azuraliza; Ali Othman, Zulaiha.

In: Research Journal of Applied Sciences, Engineering and Technology, Vol. 8, No. 3, 2014, p. 398-409.

Research output: Contribution to journalArticle

@article{855bb6e9bce44bfbaa58d605a273500c,
title = "Rough K-means outlier factor based on entropy computation",
abstract = "Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are assumed to be similar. The outlier objects intuitively can be grouped into the outlier clusters and the outlier factors of each object within the outlier cluster should be different gradually. It is not natural if the outlierness of each object within outlier cluster is similar. This study proposes the new outlier detection method based on the hybrid of the Rough K-Means clustering algorithm and the entropy computation. We introduce the outlier degree measure namely the entropy outlier factor for the cluster based outlier detection. The proposed algorithm sequentially finds the outlier cluster and calculates the outlier factor degree of the objects within outlier cluster. Each object within outlier cluster is evaluated using entropy cluster-based to a whole cluster. The performance of the algorithm has been tested on four UCI benchmark data sets and show outperform especially in detection rate.",
keywords = "Entropy outlier, Outlier detection, Rough k-means",
author = "Setyohadi, {Djoko Budiyanto} and {Abu Bakar}, Azuraliza and {Ali Othman}, Zulaiha",
year = "2014",
language = "English",
volume = "8",
pages = "398--409",
journal = "Research Journal of Applied Sciences, Engineering and Technology",
issn = "2040-7459",
publisher = "Maxwell Scientific Publications",
number = "3",

}

TY - JOUR

T1 - Rough K-means outlier factor based on entropy computation

AU - Setyohadi, Djoko Budiyanto

AU - Abu Bakar, Azuraliza

AU - Ali Othman, Zulaiha

PY - 2014

Y1 - 2014

N2 - Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are assumed to be similar. The outlier objects intuitively can be grouped into the outlier clusters and the outlier factors of each object within the outlier cluster should be different gradually. It is not natural if the outlierness of each object within outlier cluster is similar. This study proposes the new outlier detection method based on the hybrid of the Rough K-Means clustering algorithm and the entropy computation. We introduce the outlier degree measure namely the entropy outlier factor for the cluster based outlier detection. The proposed algorithm sequentially finds the outlier cluster and calculates the outlier factor degree of the objects within outlier cluster. Each object within outlier cluster is evaluated using entropy cluster-based to a whole cluster. The performance of the algorithm has been tested on four UCI benchmark data sets and show outperform especially in detection rate.

AB - Many studies of outlier detection have been developed based on the cluster-based outlier detection approach, since it does not need any prior knowledge of the dataset. However, the previous studies only regard the outlier factor computation with respect to a single point or a small cluster, which reflects its deviates from a common cluster. Furthermore, all objects within outlier cluster are assumed to be similar. The outlier objects intuitively can be grouped into the outlier clusters and the outlier factors of each object within the outlier cluster should be different gradually. It is not natural if the outlierness of each object within outlier cluster is similar. This study proposes the new outlier detection method based on the hybrid of the Rough K-Means clustering algorithm and the entropy computation. We introduce the outlier degree measure namely the entropy outlier factor for the cluster based outlier detection. The proposed algorithm sequentially finds the outlier cluster and calculates the outlier factor degree of the objects within outlier cluster. Each object within outlier cluster is evaluated using entropy cluster-based to a whole cluster. The performance of the algorithm has been tested on four UCI benchmark data sets and show outperform especially in detection rate.

KW - Entropy outlier

KW - Outlier detection

KW - Rough k-means

UR - http://www.scopus.com/inward/record.url?scp=84908575251&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84908575251&partnerID=8YFLogxK

M3 - Article

VL - 8

SP - 398

EP - 409

JO - Research Journal of Applied Sciences, Engineering and Technology

JF - Research Journal of Applied Sciences, Engineering and Technology

SN - 2040-7459

IS - 3

ER -