Automatic detection of Shadda in modern standard Arabic continuous speech

Ammar Al-Sabri, Afzan Adam, Fadhilah Rosdi

Research output: Contribution to journalArticle

Abstract

The presence of diacritics Shadda in Arabic continuous speech may lead to the reduction of the accuracy of automatic Word Boundary Detection (WBD), which caused one word will be wrongly detected as two words. Therefore, this will affect the accuracy of Automatic Speech Recognition (ASR), if it is based on WBD. Shadda is one of the essential characteristics of the Arabic language which represents a consonant doubling. In this paper, a proposed method of automatic detection of Shadda in Modern Standard Arabic (MSA) continuous speech was introduced to improve the accuracy of WBD in MSA continuous speech. The prosodic features namely Short Time Energy (STE), Fundamental Frequency and Intensity were investigated for its ability as Shadda pattern detection in continuous MSA speech. We have analyzed the proposed features by implementing a separated algorithm for each feature to detect Shadda pattern automatically. In addition, a new proposed method which is a combination of STE and Intensity were introduced. The dataset in this work is a collection of 1-hour TV broadcast news from Aljazeera Arabic TV channel for 2018-broadcasters. We found that the Shadda pattern is very similar to unvoiced regions of speech, and this represents a big challenge for the improvement of WDB using Shadda. Results showed that the detection of Shadda using Short Time Energy and Intensity outperforms the Fundamental frequency with 55% of accuracy. Intensity achieved 71.5% in accuracy. In addition, a combination between Intensity & STE features was performed and achieved good results with 67.15% in accuracy. The number of false positive too has been reduced compared to Intensity alone.

Original languageEnglish
Pages (from-to)1810-1819
Number of pages10
JournalInternational Journal on Advanced Science, Engineering and Information Technology
Volume8
Issue number4-2
Publication statusPublished - 1 Jan 2018

Fingerprint

automatic detection
energy
broadcasters
Aptitude
Speech recognition
Language
methodology

Keywords

  • Fundamental frequency
  • Gemination
  • Intensity
  • Modern standard Arabic
  • Shadda
  • Short time energy
  • Word boundary

ASJC Scopus subject areas

  • Computer Science(all)
  • Agricultural and Biological Sciences(all)
  • Engineering(all)

Cite this

@article{efeb418646434b87b39b58016fd5613b,
title = "Automatic detection of Shadda in modern standard Arabic continuous speech",
abstract = "The presence of diacritics Shadda in Arabic continuous speech may lead to the reduction of the accuracy of automatic Word Boundary Detection (WBD), which caused one word will be wrongly detected as two words. Therefore, this will affect the accuracy of Automatic Speech Recognition (ASR), if it is based on WBD. Shadda is one of the essential characteristics of the Arabic language which represents a consonant doubling. In this paper, a proposed method of automatic detection of Shadda in Modern Standard Arabic (MSA) continuous speech was introduced to improve the accuracy of WBD in MSA continuous speech. The prosodic features namely Short Time Energy (STE), Fundamental Frequency and Intensity were investigated for its ability as Shadda pattern detection in continuous MSA speech. We have analyzed the proposed features by implementing a separated algorithm for each feature to detect Shadda pattern automatically. In addition, a new proposed method which is a combination of STE and Intensity were introduced. The dataset in this work is a collection of 1-hour TV broadcast news from Aljazeera Arabic TV channel for 2018-broadcasters. We found that the Shadda pattern is very similar to unvoiced regions of speech, and this represents a big challenge for the improvement of WDB using Shadda. Results showed that the detection of Shadda using Short Time Energy and Intensity outperforms the Fundamental frequency with 55{\%} of accuracy. Intensity achieved 71.5{\%} in accuracy. In addition, a combination between Intensity & STE features was performed and achieved good results with 67.15{\%} in accuracy. The number of false positive too has been reduced compared to Intensity alone.",
keywords = "Fundamental frequency, Gemination, Intensity, Modern standard Arabic, Shadda, Short time energy, Word boundary",
author = "Ammar Al-Sabri and Afzan Adam and Fadhilah Rosdi",
year = "2018",
month = "1",
day = "1",
language = "English",
volume = "8",
pages = "1810--1819",
journal = "International Journal on Advanced Science, Engineering and Information Technology",
issn = "2088-5334",
publisher = "INSIGHT - Indonesian Society for Knowledge and Human Development",
number = "4-2",

}

TY - JOUR

T1 - Automatic detection of Shadda in modern standard Arabic continuous speech

AU - Al-Sabri, Ammar

AU - Adam, Afzan

AU - Rosdi, Fadhilah

PY - 2018/1/1

Y1 - 2018/1/1

N2 - The presence of diacritics Shadda in Arabic continuous speech may lead to the reduction of the accuracy of automatic Word Boundary Detection (WBD), which caused one word will be wrongly detected as two words. Therefore, this will affect the accuracy of Automatic Speech Recognition (ASR), if it is based on WBD. Shadda is one of the essential characteristics of the Arabic language which represents a consonant doubling. In this paper, a proposed method of automatic detection of Shadda in Modern Standard Arabic (MSA) continuous speech was introduced to improve the accuracy of WBD in MSA continuous speech. The prosodic features namely Short Time Energy (STE), Fundamental Frequency and Intensity were investigated for its ability as Shadda pattern detection in continuous MSA speech. We have analyzed the proposed features by implementing a separated algorithm for each feature to detect Shadda pattern automatically. In addition, a new proposed method which is a combination of STE and Intensity were introduced. The dataset in this work is a collection of 1-hour TV broadcast news from Aljazeera Arabic TV channel for 2018-broadcasters. We found that the Shadda pattern is very similar to unvoiced regions of speech, and this represents a big challenge for the improvement of WDB using Shadda. Results showed that the detection of Shadda using Short Time Energy and Intensity outperforms the Fundamental frequency with 55% of accuracy. Intensity achieved 71.5% in accuracy. In addition, a combination between Intensity & STE features was performed and achieved good results with 67.15% in accuracy. The number of false positive too has been reduced compared to Intensity alone.

AB - The presence of diacritics Shadda in Arabic continuous speech may lead to the reduction of the accuracy of automatic Word Boundary Detection (WBD), which caused one word will be wrongly detected as two words. Therefore, this will affect the accuracy of Automatic Speech Recognition (ASR), if it is based on WBD. Shadda is one of the essential characteristics of the Arabic language which represents a consonant doubling. In this paper, a proposed method of automatic detection of Shadda in Modern Standard Arabic (MSA) continuous speech was introduced to improve the accuracy of WBD in MSA continuous speech. The prosodic features namely Short Time Energy (STE), Fundamental Frequency and Intensity were investigated for its ability as Shadda pattern detection in continuous MSA speech. We have analyzed the proposed features by implementing a separated algorithm for each feature to detect Shadda pattern automatically. In addition, a new proposed method which is a combination of STE and Intensity were introduced. The dataset in this work is a collection of 1-hour TV broadcast news from Aljazeera Arabic TV channel for 2018-broadcasters. We found that the Shadda pattern is very similar to unvoiced regions of speech, and this represents a big challenge for the improvement of WDB using Shadda. Results showed that the detection of Shadda using Short Time Energy and Intensity outperforms the Fundamental frequency with 55% of accuracy. Intensity achieved 71.5% in accuracy. In addition, a combination between Intensity & STE features was performed and achieved good results with 67.15% in accuracy. The number of false positive too has been reduced compared to Intensity alone.

KW - Fundamental frequency

KW - Gemination

KW - Intensity

KW - Modern standard Arabic

KW - Shadda

KW - Short time energy

KW - Word boundary

UR - http://www.scopus.com/inward/record.url?scp=85055335907&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055335907&partnerID=8YFLogxK

M3 - Article

VL - 8

SP - 1810

EP - 1819

JO - International Journal on Advanced Science, Engineering and Information Technology

JF - International Journal on Advanced Science, Engineering and Information Technology

SN - 2088-5334

IS - 4-2

ER -