Performance Comparison between Bootstrap and Multiscale Bootstrap for Assessing Phylogenetic Tree for RNA polymerase

Safinah Sharuddin, Nora Muda

Research output: Contribution to journalArticle

Abstract

Phylogenetic inference refers to the reconstruction of evolutionary relationships among various species that is usually presented in the form of a tree. This study constructs the phylogenetic tree by using a novel distance-based method known as Modified one step M-estimator (MOM) method. The branches of the phylogenetic tree constructed were then evaluated to see their reliability. The performance of the reliability was then compared between the p-value of multiscale bootstrap (AU value) and bootstrap p-value (BP value). The aim of this study was to compare the performance between the AU value and BP value for assessing phylogenetic tree of RNA polymerase. The results have shown that multiscale bootstrap analysis can detect high sampling errors but not in bootstrap analysis. To overcome this problem, the multiscale bootstrap analysis has reduced the sampling error by increasing the number of replications. The clusters were indicated as significant if AU values or BP values were 95% or higher. From the analysis, the results showed that the BP and AU values differ at 11th and 15th branch of the phylogenetic tree. The BP values at both branches were 72 and 85%, respectively, thereby making the cluster not significant but by looking at the AU values, the two branches were more than 95% and the clusters were significant. This was due to the biasness in calculation of the probability of bootstrap analysis, therefore, the multiscale bootstrap analysis has improved the calculation of the probability value compared to the bootstrap analysis.

Original languageEnglish
Pages (from-to)1643-1651
Number of pages9
JournalSains Malaysiana
Volume44
Issue number11
Publication statusPublished - 1 Nov 2015

Fingerprint

DNA-directed RNA polymerase
phylogeny
probability analysis
sampling
methodology

Keywords

  • Distance-based method
  • Median absolute deviation (MADn)
  • Modified one-step M-estimator (MOM)
  • Phylogenetic inference

ASJC Scopus subject areas

  • General

Cite this

Performance Comparison between Bootstrap and Multiscale Bootstrap for Assessing Phylogenetic Tree for RNA polymerase. / Sharuddin, Safinah; Muda, Nora.

In: Sains Malaysiana, Vol. 44, No. 11, 01.11.2015, p. 1643-1651.

Research output: Contribution to journalArticle

@article{3d414b26e40e441f899f8f51c0759c5b,
title = "Performance Comparison between Bootstrap and Multiscale Bootstrap for Assessing Phylogenetic Tree for RNA polymerase",
abstract = "Phylogenetic inference refers to the reconstruction of evolutionary relationships among various species that is usually presented in the form of a tree. This study constructs the phylogenetic tree by using a novel distance-based method known as Modified one step M-estimator (MOM) method. The branches of the phylogenetic tree constructed were then evaluated to see their reliability. The performance of the reliability was then compared between the p-value of multiscale bootstrap (AU value) and bootstrap p-value (BP value). The aim of this study was to compare the performance between the AU value and BP value for assessing phylogenetic tree of RNA polymerase. The results have shown that multiscale bootstrap analysis can detect high sampling errors but not in bootstrap analysis. To overcome this problem, the multiscale bootstrap analysis has reduced the sampling error by increasing the number of replications. The clusters were indicated as significant if AU values or BP values were 95{\%} or higher. From the analysis, the results showed that the BP and AU values differ at 11th and 15th branch of the phylogenetic tree. The BP values at both branches were 72 and 85{\%}, respectively, thereby making the cluster not significant but by looking at the AU values, the two branches were more than 95{\%} and the clusters were significant. This was due to the biasness in calculation of the probability of bootstrap analysis, therefore, the multiscale bootstrap analysis has improved the calculation of the probability value compared to the bootstrap analysis.",
keywords = "Distance-based method, Median absolute deviation (MADn), Modified one-step M-estimator (MOM), Phylogenetic inference",
author = "Safinah Sharuddin and Nora Muda",
year = "2015",
month = "11",
day = "1",
language = "English",
volume = "44",
pages = "1643--1651",
journal = "Sains Malaysiana",
issn = "0126-6039",
publisher = "Penerbit Universiti Kebangsaan Malaysia",
number = "11",

}

TY - JOUR

T1 - Performance Comparison between Bootstrap and Multiscale Bootstrap for Assessing Phylogenetic Tree for RNA polymerase

AU - Sharuddin, Safinah

AU - Muda, Nora

PY - 2015/11/1

Y1 - 2015/11/1

N2 - Phylogenetic inference refers to the reconstruction of evolutionary relationships among various species that is usually presented in the form of a tree. This study constructs the phylogenetic tree by using a novel distance-based method known as Modified one step M-estimator (MOM) method. The branches of the phylogenetic tree constructed were then evaluated to see their reliability. The performance of the reliability was then compared between the p-value of multiscale bootstrap (AU value) and bootstrap p-value (BP value). The aim of this study was to compare the performance between the AU value and BP value for assessing phylogenetic tree of RNA polymerase. The results have shown that multiscale bootstrap analysis can detect high sampling errors but not in bootstrap analysis. To overcome this problem, the multiscale bootstrap analysis has reduced the sampling error by increasing the number of replications. The clusters were indicated as significant if AU values or BP values were 95% or higher. From the analysis, the results showed that the BP and AU values differ at 11th and 15th branch of the phylogenetic tree. The BP values at both branches were 72 and 85%, respectively, thereby making the cluster not significant but by looking at the AU values, the two branches were more than 95% and the clusters were significant. This was due to the biasness in calculation of the probability of bootstrap analysis, therefore, the multiscale bootstrap analysis has improved the calculation of the probability value compared to the bootstrap analysis.

AB - Phylogenetic inference refers to the reconstruction of evolutionary relationships among various species that is usually presented in the form of a tree. This study constructs the phylogenetic tree by using a novel distance-based method known as Modified one step M-estimator (MOM) method. The branches of the phylogenetic tree constructed were then evaluated to see their reliability. The performance of the reliability was then compared between the p-value of multiscale bootstrap (AU value) and bootstrap p-value (BP value). The aim of this study was to compare the performance between the AU value and BP value for assessing phylogenetic tree of RNA polymerase. The results have shown that multiscale bootstrap analysis can detect high sampling errors but not in bootstrap analysis. To overcome this problem, the multiscale bootstrap analysis has reduced the sampling error by increasing the number of replications. The clusters were indicated as significant if AU values or BP values were 95% or higher. From the analysis, the results showed that the BP and AU values differ at 11th and 15th branch of the phylogenetic tree. The BP values at both branches were 72 and 85%, respectively, thereby making the cluster not significant but by looking at the AU values, the two branches were more than 95% and the clusters were significant. This was due to the biasness in calculation of the probability of bootstrap analysis, therefore, the multiscale bootstrap analysis has improved the calculation of the probability value compared to the bootstrap analysis.

KW - Distance-based method

KW - Median absolute deviation (MADn)

KW - Modified one-step M-estimator (MOM)

KW - Phylogenetic inference

UR - http://www.scopus.com/inward/record.url?scp=84951838930&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951838930&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84951838930

VL - 44

SP - 1643

EP - 1651

JO - Sains Malaysiana

JF - Sains Malaysiana

SN - 0126-6039

IS - 11

ER -