On machine learning technique selection for classification

Rahmad Kurniawan, Mohd Zakree Ahmad Nazri, M. Irsyad, Rado Yendra, Anis Aklima

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Extracting meaningful pattern from data can be challenging. Irrelevant, redundant, noisy and unreliable data, misinterpretation of results and incompatibility of a technique to extract unknown patterns from data may lead analyst to develop an erroneous classifier. This research is encouraged by 'No Free Lunch' theorem that can be simplified as no classification technique that works best for every problem. This study tries to make a comparison amongst three main approaches in data mining, i.e. Decision Tree (DT), Artificial Neural Network (ANN), and Rough Set Theory (RST). A comparative analysis of the above techniques has been conducted by using open source's software ROSETTA and WEKA on five different datasets. The sample sizes are categorized in relation to the number of attributes and number of instances available in the dataset. Assessments on the classification model are based on accuracy, amount and length of the generated rules, error rate and standard deviation. Based on nine experiments, results show that Artificial Neural Network provides better accuracy than Decision Tree and Rough Set approach while Rough Set creates more rules and Decision Tree generate rules faster than the compared techniques. The results show the trade off of using different approaches for other researchers in finding the best model for a particular problem.

Original languageEnglish
Title of host publicationProceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages540-545
Number of pages6
ISBN (Print)9781467373197
DOIs
Publication statusPublished - 10 Dec 2015
Event5th International Conference on Electrical Engineering and Informatics, ICEEI 2015 - Legian-Bali, Indonesia
Duration: 10 Aug 201511 Aug 2015

Other

Other5th International Conference on Electrical Engineering and Informatics, ICEEI 2015
CountryIndonesia
CityLegian-Bali
Period10/8/1511/8/15

Fingerprint

Decision trees
Learning systems
Neural networks
Rough set theory
Data mining
Classifiers
Experiments

Keywords

  • artificial neural network
  • decision tree
  • Rough set theory

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Kurniawan, R., Ahmad Nazri, M. Z., Irsyad, M., Yendra, R., & Aklima, A. (2015). On machine learning technique selection for classification. In Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015 (pp. 540-545). [7352559] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICEEI.2015.7352559

On machine learning technique selection for classification. / Kurniawan, Rahmad; Ahmad Nazri, Mohd Zakree; Irsyad, M.; Yendra, Rado; Aklima, Anis.

Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 540-545 7352559.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kurniawan, R, Ahmad Nazri, MZ, Irsyad, M, Yendra, R & Aklima, A 2015, On machine learning technique selection for classification. in Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015., 7352559, Institute of Electrical and Electronics Engineers Inc., pp. 540-545, 5th International Conference on Electrical Engineering and Informatics, ICEEI 2015, Legian-Bali, Indonesia, 10/8/15. https://doi.org/10.1109/ICEEI.2015.7352559
Kurniawan R, Ahmad Nazri MZ, Irsyad M, Yendra R, Aklima A. On machine learning technique selection for classification. In Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 540-545. 7352559 https://doi.org/10.1109/ICEEI.2015.7352559
Kurniawan, Rahmad ; Ahmad Nazri, Mohd Zakree ; Irsyad, M. ; Yendra, Rado ; Aklima, Anis. / On machine learning technique selection for classification. Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 540-545
@inproceedings{b9ae18337db34ae79532a2e1c3abf936,
title = "On machine learning technique selection for classification",
abstract = "Extracting meaningful pattern from data can be challenging. Irrelevant, redundant, noisy and unreliable data, misinterpretation of results and incompatibility of a technique to extract unknown patterns from data may lead analyst to develop an erroneous classifier. This research is encouraged by 'No Free Lunch' theorem that can be simplified as no classification technique that works best for every problem. This study tries to make a comparison amongst three main approaches in data mining, i.e. Decision Tree (DT), Artificial Neural Network (ANN), and Rough Set Theory (RST). A comparative analysis of the above techniques has been conducted by using open source's software ROSETTA and WEKA on five different datasets. The sample sizes are categorized in relation to the number of attributes and number of instances available in the dataset. Assessments on the classification model are based on accuracy, amount and length of the generated rules, error rate and standard deviation. Based on nine experiments, results show that Artificial Neural Network provides better accuracy than Decision Tree and Rough Set approach while Rough Set creates more rules and Decision Tree generate rules faster than the compared techniques. The results show the trade off of using different approaches for other researchers in finding the best model for a particular problem.",
keywords = "artificial neural network, decision tree, Rough set theory",
author = "Rahmad Kurniawan and {Ahmad Nazri}, {Mohd Zakree} and M. Irsyad and Rado Yendra and Anis Aklima",
year = "2015",
month = "12",
day = "10",
doi = "10.1109/ICEEI.2015.7352559",
language = "English",
isbn = "9781467373197",
pages = "540--545",
booktitle = "Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - On machine learning technique selection for classification

AU - Kurniawan, Rahmad

AU - Ahmad Nazri, Mohd Zakree

AU - Irsyad, M.

AU - Yendra, Rado

AU - Aklima, Anis

PY - 2015/12/10

Y1 - 2015/12/10

N2 - Extracting meaningful pattern from data can be challenging. Irrelevant, redundant, noisy and unreliable data, misinterpretation of results and incompatibility of a technique to extract unknown patterns from data may lead analyst to develop an erroneous classifier. This research is encouraged by 'No Free Lunch' theorem that can be simplified as no classification technique that works best for every problem. This study tries to make a comparison amongst three main approaches in data mining, i.e. Decision Tree (DT), Artificial Neural Network (ANN), and Rough Set Theory (RST). A comparative analysis of the above techniques has been conducted by using open source's software ROSETTA and WEKA on five different datasets. The sample sizes are categorized in relation to the number of attributes and number of instances available in the dataset. Assessments on the classification model are based on accuracy, amount and length of the generated rules, error rate and standard deviation. Based on nine experiments, results show that Artificial Neural Network provides better accuracy than Decision Tree and Rough Set approach while Rough Set creates more rules and Decision Tree generate rules faster than the compared techniques. The results show the trade off of using different approaches for other researchers in finding the best model for a particular problem.

AB - Extracting meaningful pattern from data can be challenging. Irrelevant, redundant, noisy and unreliable data, misinterpretation of results and incompatibility of a technique to extract unknown patterns from data may lead analyst to develop an erroneous classifier. This research is encouraged by 'No Free Lunch' theorem that can be simplified as no classification technique that works best for every problem. This study tries to make a comparison amongst three main approaches in data mining, i.e. Decision Tree (DT), Artificial Neural Network (ANN), and Rough Set Theory (RST). A comparative analysis of the above techniques has been conducted by using open source's software ROSETTA and WEKA on five different datasets. The sample sizes are categorized in relation to the number of attributes and number of instances available in the dataset. Assessments on the classification model are based on accuracy, amount and length of the generated rules, error rate and standard deviation. Based on nine experiments, results show that Artificial Neural Network provides better accuracy than Decision Tree and Rough Set approach while Rough Set creates more rules and Decision Tree generate rules faster than the compared techniques. The results show the trade off of using different approaches for other researchers in finding the best model for a particular problem.

KW - artificial neural network

KW - decision tree

KW - Rough set theory

UR - http://www.scopus.com/inward/record.url?scp=84961710625&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84961710625&partnerID=8YFLogxK

U2 - 10.1109/ICEEI.2015.7352559

DO - 10.1109/ICEEI.2015.7352559

M3 - Conference contribution

AN - SCOPUS:84961710625

SN - 9781467373197

SP - 540

EP - 545

BT - Proceedings - 5th International Conference on Electrical Engineering and Informatics: Bridging the Knowledge between Academic, Industry, and Community, ICEEI 2015

PB - Institute of Electrical and Electronics Engineers Inc.

ER -