Robust digit recognition with dynamic time warping and recursive least squares

Syed Abdul Rahman Al-Haddad, Khairul Anuar Ishak, Salina Abdul Samad, Ali O. Abid, Aini Hussain Noor

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Robustness is a key issue in speech recognition. This paper proposes a speech recognition algorithm for Malay digits from 0 to 9. This paper also proposes an algorithm for noise cancellation by using recursive least squares (RLS). This system consists of speech processing inclusive of digit margin and recognition which uses zero crossing and energy calculations. Mel-Frequency Cepstral Coefficient (MFCC) vectors are used to provide an estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is used to detect the nearest recorded voice with appropriate global constraint. The global constraint is used to set a valid search region because the variation of the speech rate of the speaker is considered to be limited in a reasonable range, which means that it can prune the unreasonable search space. The algorithm is tested on speech samples that are recorded as a part of a Malay corpus. The results show that the algorithm can recognize almost 80.5% of the Malay digits for all recorded words. By adding RLS noise canceller in the preprocessing stage it increases the accuracy to 92.3%.

Original languageEnglish
Title of host publicationProceedings - International Symposium on Information Technology 2008, ITSim
Volume2
DOIs
Publication statusPublished - 2008
EventInternational Symposium on Information Technology 2008, ITSim - Kuala Lumpur
Duration: 26 Aug 200829 Aug 2008

Other

OtherInternational Symposium on Information Technology 2008, ITSim
CityKuala Lumpur
Period26/8/0829/8/08

Fingerprint

Speech recognition
Speech processing

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Cite this

Al-Haddad, S. A. R., Ishak, K. A., Abdul Samad, S., Abid, A. O., & Noor, A. H. (2008). Robust digit recognition with dynamic time warping and recursive least squares. In Proceedings - International Symposium on Information Technology 2008, ITSim (Vol. 2). [4631680] https://doi.org/10.1109/ITSIM.2008.4631680

Robust digit recognition with dynamic time warping and recursive least squares. / Al-Haddad, Syed Abdul Rahman; Ishak, Khairul Anuar; Abdul Samad, Salina; Abid, Ali O.; Noor, Aini Hussain.

Proceedings - International Symposium on Information Technology 2008, ITSim. Vol. 2 2008. 4631680.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Al-Haddad, SAR, Ishak, KA, Abdul Samad, S, Abid, AO & Noor, AH 2008, Robust digit recognition with dynamic time warping and recursive least squares. in Proceedings - International Symposium on Information Technology 2008, ITSim. vol. 2, 4631680, International Symposium on Information Technology 2008, ITSim, Kuala Lumpur, 26/8/08. https://doi.org/10.1109/ITSIM.2008.4631680
Al-Haddad SAR, Ishak KA, Abdul Samad S, Abid AO, Noor AH. Robust digit recognition with dynamic time warping and recursive least squares. In Proceedings - International Symposium on Information Technology 2008, ITSim. Vol. 2. 2008. 4631680 https://doi.org/10.1109/ITSIM.2008.4631680
Al-Haddad, Syed Abdul Rahman ; Ishak, Khairul Anuar ; Abdul Samad, Salina ; Abid, Ali O. ; Noor, Aini Hussain. / Robust digit recognition with dynamic time warping and recursive least squares. Proceedings - International Symposium on Information Technology 2008, ITSim. Vol. 2 2008.
@inproceedings{96e471edff7f4023a7acda415f1d12cf,
title = "Robust digit recognition with dynamic time warping and recursive least squares",
abstract = "Robustness is a key issue in speech recognition. This paper proposes a speech recognition algorithm for Malay digits from 0 to 9. This paper also proposes an algorithm for noise cancellation by using recursive least squares (RLS). This system consists of speech processing inclusive of digit margin and recognition which uses zero crossing and energy calculations. Mel-Frequency Cepstral Coefficient (MFCC) vectors are used to provide an estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is used to detect the nearest recorded voice with appropriate global constraint. The global constraint is used to set a valid search region because the variation of the speech rate of the speaker is considered to be limited in a reasonable range, which means that it can prune the unreasonable search space. The algorithm is tested on speech samples that are recorded as a part of a Malay corpus. The results show that the algorithm can recognize almost 80.5{\%} of the Malay digits for all recorded words. By adding RLS noise canceller in the preprocessing stage it increases the accuracy to 92.3{\%}.",
author = "Al-Haddad, {Syed Abdul Rahman} and Ishak, {Khairul Anuar} and {Abdul Samad}, Salina and Abid, {Ali O.} and Noor, {Aini Hussain}",
year = "2008",
doi = "10.1109/ITSIM.2008.4631680",
language = "English",
isbn = "9781424423286",
volume = "2",
booktitle = "Proceedings - International Symposium on Information Technology 2008, ITSim",

}

TY - GEN

T1 - Robust digit recognition with dynamic time warping and recursive least squares

AU - Al-Haddad, Syed Abdul Rahman

AU - Ishak, Khairul Anuar

AU - Abdul Samad, Salina

AU - Abid, Ali O.

AU - Noor, Aini Hussain

PY - 2008

Y1 - 2008

N2 - Robustness is a key issue in speech recognition. This paper proposes a speech recognition algorithm for Malay digits from 0 to 9. This paper also proposes an algorithm for noise cancellation by using recursive least squares (RLS). This system consists of speech processing inclusive of digit margin and recognition which uses zero crossing and energy calculations. Mel-Frequency Cepstral Coefficient (MFCC) vectors are used to provide an estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is used to detect the nearest recorded voice with appropriate global constraint. The global constraint is used to set a valid search region because the variation of the speech rate of the speaker is considered to be limited in a reasonable range, which means that it can prune the unreasonable search space. The algorithm is tested on speech samples that are recorded as a part of a Malay corpus. The results show that the algorithm can recognize almost 80.5% of the Malay digits for all recorded words. By adding RLS noise canceller in the preprocessing stage it increases the accuracy to 92.3%.

AB - Robustness is a key issue in speech recognition. This paper proposes a speech recognition algorithm for Malay digits from 0 to 9. This paper also proposes an algorithm for noise cancellation by using recursive least squares (RLS). This system consists of speech processing inclusive of digit margin and recognition which uses zero crossing and energy calculations. Mel-Frequency Cepstral Coefficient (MFCC) vectors are used to provide an estimate of the vocal tract filter. Meanwhile dynamic time warping (DTW) is used to detect the nearest recorded voice with appropriate global constraint. The global constraint is used to set a valid search region because the variation of the speech rate of the speaker is considered to be limited in a reasonable range, which means that it can prune the unreasonable search space. The algorithm is tested on speech samples that are recorded as a part of a Malay corpus. The results show that the algorithm can recognize almost 80.5% of the Malay digits for all recorded words. By adding RLS noise canceller in the preprocessing stage it increases the accuracy to 92.3%.

UR - http://www.scopus.com/inward/record.url?scp=57349133054&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=57349133054&partnerID=8YFLogxK

U2 - 10.1109/ITSIM.2008.4631680

DO - 10.1109/ITSIM.2008.4631680

M3 - Conference contribution

AN - SCOPUS:57349133054

SN - 9781424423286

VL - 2

BT - Proceedings - International Symposium on Information Technology 2008, ITSim

ER -