Performances of speech signal biometric systems based on signal to noise ratio degradation

Dzati Athiar Ramli, Salina Abdul Samad, Aini Hussain

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this study the performances of speech based biometric systems at different levels of signal to noise ratio i.e. clean, 30dB, 20dB and 10dB are experimented. This study also suggests the integration of visual information to the speech based biometric systems in order to enhance the audio only systems performances. The weighting factor for combination of audio and visual scores is optimized by performing the validation data set evaluation and the min-max normalization technique is then used for fusion scheme. Incorporating visual information to the systems increases the decision accuracy compared to the audio only system. The EER performance of the integration system in clean, 30dB, 20dB and 10dB SNRs are observed as 0.0019%, 0.0084%, 0.9356% and 5.0160%, respectively compared to the EER performances of 1.1599%, 2.5113%, 19.3423% and 39.8649% for audio only system. In this study, Support Vector Machine (SVM) classifier is used for pattern matching and Mel Frequency Cepstral Coefficient (MFCC) are extracted as audio features.

Original languageEnglish
Title of host publicationAdvances in Intelligent and Soft Computing
Pages73-80
Number of pages8
Volume85
DOIs
Publication statusPublished - 2010

Publication series

NameAdvances in Intelligent and Soft Computing
Volume85
ISSN (Print)18675662

Fingerprint

Biometrics
Signal to noise ratio
Degradation
Pattern matching
Support vector machines
Classifiers
Fusion reactions

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Ramli, D. A., Abdul Samad, S., & Hussain, A. (2010). Performances of speech signal biometric systems based on signal to noise ratio degradation. In Advances in Intelligent and Soft Computing (Vol. 85, pp. 73-80). (Advances in Intelligent and Soft Computing; Vol. 85). https://doi.org/10.1007/978-3-642-16626-6_8

Performances of speech signal biometric systems based on signal to noise ratio degradation. / Ramli, Dzati Athiar; Abdul Samad, Salina; Hussain, Aini.

Advances in Intelligent and Soft Computing. Vol. 85 2010. p. 73-80 (Advances in Intelligent and Soft Computing; Vol. 85).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ramli, DA, Abdul Samad, S & Hussain, A 2010, Performances of speech signal biometric systems based on signal to noise ratio degradation. in Advances in Intelligent and Soft Computing. vol. 85, Advances in Intelligent and Soft Computing, vol. 85, pp. 73-80. https://doi.org/10.1007/978-3-642-16626-6_8
Ramli DA, Abdul Samad S, Hussain A. Performances of speech signal biometric systems based on signal to noise ratio degradation. In Advances in Intelligent and Soft Computing. Vol. 85. 2010. p. 73-80. (Advances in Intelligent and Soft Computing). https://doi.org/10.1007/978-3-642-16626-6_8
Ramli, Dzati Athiar ; Abdul Samad, Salina ; Hussain, Aini. / Performances of speech signal biometric systems based on signal to noise ratio degradation. Advances in Intelligent and Soft Computing. Vol. 85 2010. pp. 73-80 (Advances in Intelligent and Soft Computing).
@inproceedings{89af5fa4cef24b858090e813a77c5337,
title = "Performances of speech signal biometric systems based on signal to noise ratio degradation",
abstract = "In this study the performances of speech based biometric systems at different levels of signal to noise ratio i.e. clean, 30dB, 20dB and 10dB are experimented. This study also suggests the integration of visual information to the speech based biometric systems in order to enhance the audio only systems performances. The weighting factor for combination of audio and visual scores is optimized by performing the validation data set evaluation and the min-max normalization technique is then used for fusion scheme. Incorporating visual information to the systems increases the decision accuracy compared to the audio only system. The EER performance of the integration system in clean, 30dB, 20dB and 10dB SNRs are observed as 0.0019{\%}, 0.0084{\%}, 0.9356{\%} and 5.0160{\%}, respectively compared to the EER performances of 1.1599{\%}, 2.5113{\%}, 19.3423{\%} and 39.8649{\%} for audio only system. In this study, Support Vector Machine (SVM) classifier is used for pattern matching and Mel Frequency Cepstral Coefficient (MFCC) are extracted as audio features.",
author = "Ramli, {Dzati Athiar} and {Abdul Samad}, Salina and Aini Hussain",
year = "2010",
doi = "10.1007/978-3-642-16626-6_8",
language = "English",
isbn = "9783642166259",
volume = "85",
series = "Advances in Intelligent and Soft Computing",
pages = "73--80",
booktitle = "Advances in Intelligent and Soft Computing",

}

TY - GEN

T1 - Performances of speech signal biometric systems based on signal to noise ratio degradation

AU - Ramli, Dzati Athiar

AU - Abdul Samad, Salina

AU - Hussain, Aini

PY - 2010

Y1 - 2010

N2 - In this study the performances of speech based biometric systems at different levels of signal to noise ratio i.e. clean, 30dB, 20dB and 10dB are experimented. This study also suggests the integration of visual information to the speech based biometric systems in order to enhance the audio only systems performances. The weighting factor for combination of audio and visual scores is optimized by performing the validation data set evaluation and the min-max normalization technique is then used for fusion scheme. Incorporating visual information to the systems increases the decision accuracy compared to the audio only system. The EER performance of the integration system in clean, 30dB, 20dB and 10dB SNRs are observed as 0.0019%, 0.0084%, 0.9356% and 5.0160%, respectively compared to the EER performances of 1.1599%, 2.5113%, 19.3423% and 39.8649% for audio only system. In this study, Support Vector Machine (SVM) classifier is used for pattern matching and Mel Frequency Cepstral Coefficient (MFCC) are extracted as audio features.

AB - In this study the performances of speech based biometric systems at different levels of signal to noise ratio i.e. clean, 30dB, 20dB and 10dB are experimented. This study also suggests the integration of visual information to the speech based biometric systems in order to enhance the audio only systems performances. The weighting factor for combination of audio and visual scores is optimized by performing the validation data set evaluation and the min-max normalization technique is then used for fusion scheme. Incorporating visual information to the systems increases the decision accuracy compared to the audio only system. The EER performance of the integration system in clean, 30dB, 20dB and 10dB SNRs are observed as 0.0019%, 0.0084%, 0.9356% and 5.0160%, respectively compared to the EER performances of 1.1599%, 2.5113%, 19.3423% and 39.8649% for audio only system. In this study, Support Vector Machine (SVM) classifier is used for pattern matching and Mel Frequency Cepstral Coefficient (MFCC) are extracted as audio features.

UR - http://www.scopus.com/inward/record.url?scp=80053024347&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053024347&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-16626-6_8

DO - 10.1007/978-3-642-16626-6_8

M3 - Conference contribution

SN - 9783642166259

VL - 85

T3 - Advances in Intelligent and Soft Computing

SP - 73

EP - 80

BT - Advances in Intelligent and Soft Computing

ER -