Bangla Speech Processing: An Analytical Study of Feature Extraction and Recognition Methods

dc.contributor.authorMd. Shafiul Alam Chowdhury
dc.contributor.authorMd. Farukuzzaman Khan
dc.contributor.authorMohammed Sowket Ali
dc.contributor.authorMd. Zahidul Islam
dc.contributor.authorMd. Abdul Mannan
dc.contributor.authorMd. Amanat Ullah
dc.date.accessioned2026-04-09T05:03:55Z
dc.date.issued2025-07-31
dc.description.abstractSpeech recognition has always been an interesting yet challenging task for researchers, especially when working with Bangla, which is complex due to its linguistic structure. This research is extensive in scale, encompassing Bangla phonemes, isolated Bangla words, commands, and sentences in the experiments. Bangla speech recognition is a comparison analysis in large scale that focuses on different feature extraction techniques, recognition tools, window frame feature, other methods and techniques applied. A system is developed by writing code in MATLAB. Mel Frequency Cepstral Coefficient (MFCC), Power Spectral Analysis (FFT), and Linear Predictor Coefficient Analysis (LPC) methods are utilized as feature extraction techniques. Time delays neural network (time series) and a two-layer feed forward hidden neural network are used as speech recognition tools. The maximum likelihood method is also incorporated to enhance the accuracy of speech recognition. Blackman, Hamming, and Hanning Window frame techniques are applied in parallel during feature extraction to observe their influences on speech recognition accuracy. The datasets gathered from native speakers. MFCC as a feature extraction technique, combined with two-layer Feed Forward Neural Network (FFNN) or TDNN as speech recognition tools, outperforms FFT and LPC with the deep learning tools. The study discovered that both the quantity of speech samples, the opposite gender’s voice, and different windowing techniques all had an impact on the recognition accuracy rate. This study will encourage researchers to conduct further research to advance Bangla speech recognition
dc.identifier.citationChowdhury, Md Shafiul Alam, et al. "Bangla Speech Processing: An Analytical Study of Feature Extraction and Recognition Methods." Mathematical Modelling of Engineering Problems 12.7 (2025).
dc.identifier.issn23690739
dc.identifier.urihttp://dspace.uttarauniversity.edu.bd:4000/handle/123456789/1398
dc.language.isoen_US
dc.publisherMathematical Modelling of Engineering Problems
dc.subjectBangla Speech Processing
dc.subjectAutomatic Speech Recognition (ASR)
dc.subjectSpeech Recognition
dc.subjectFeature Extraction Techniques
dc.titleBangla Speech Processing: An Analytical Study of Feature Extraction and Recognition Methods
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
mmep.pdf
Size:
2.13 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections