Bangla Speech Recognition: Power Spectral Analysis, LPC & MFCC as Feature Extraction Techniques in Deep Learning

dc.contributor.authorMd. Shafiul Alam Chowdhury
dc.contributor.authorMd. Farukuzzaman Khan
dc.contributor.authorMohammed Sowket Al
dc.contributor.authorShahriar Ahmed
dc.contributor.authorMd. Abdul Mannan
dc.contributor.author, Md. Amanat Ullah
dc.date.accessioned2026-03-24T04:18:32Z
dc.date.issued2025-05-31
dc.description.abstractSpeech recognition technology has already become a part of our everyday lives, and many works have been done mostly in the English language because it is an international language, but there is still more that researchers could do. Speech recognition technology has already become a part of the daily life. As can be seen, AI robots can converse with people, particularly in English. The topic of this study is speech recognition in Bangla (Bengali). To determine the highest feasible speech recognition accuracy in the Bangla (Bengali) language, several methods have been employed for pattern recognition and deep learning. Native speakers of Bangla provided the core dataset. It includes extensive experiments with Bangla phonemes, isolated words, commands, and sentences. Speech samples are subjected to feature extraction using MFCC. Simultaneously, LPC and FFT are employed. Using the maximum-likelihood approach, a multilayer feedforward deep neural network model has been utilized. A random dataset has been used to assess the model’s accuracy in speech recognition. Deep learning using a neural network model and feature extraction using MFCC outperform Power spectral testing and linear predictor coefficient tests regarding recognition outcomes. The investigation found that increasing the number of speech samples affected the recognition accuracy rate, as did the speech samples from the opposing gender
dc.identifier.citationChowdhury, Md Shafiul Alam, et al. "Bangla Speech Recognition: Power Spectral Analysis, LPC & MFCC as Feature Extraction Techniques in Deep Learning."
dc.identifier.issn23490918
dc.identifier.urihttp://dspace.uttarauniversity.edu.bd:4000/handle/123456789/1362
dc.language.isoen_US
dc.publisherInternational Journal of Engineering Trends and Technology
dc.subjectBangla Speech Recognition
dc.subjectBengali Automatic Speech Recognition (ASR)
dc.subjectSpeech Signal Processing
dc.subjectDeep Learning for Speech Recognition
dc.titleBangla Speech Recognition: Power Spectral Analysis, LPC & MFCC as Feature Extraction Techniques in Deep Learning
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Bangla_Speech_Recognition_Power_Spectral_Analysis_.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections