Статья

Adaptive Pronunciation Assessment Based on Acoustic Feature Profiling and Wav2vec 2.0 For English Language Learning

Abdumalikov Akmaljon Abduxoliq o'g'liComputer science and programming, Jizzakh branch of national university of Uzbekistan, 130100, UzbekistanRohmonqulov Muhammadyusuf Egamberdi o'g'liComputer science and programming, Jizzakh branch of national university of Uzbekistan, 130100, Uzbekistan

American Journal Of Applied Science And Technologyjournal2026

ABI

Аннотация

This paper presents a speaker-adaptive approach to automatic pronunciation assessment for English language learning, with a focus on Uzbek learners. The proposed methodology integrates acoustic signal processing, feature extraction, and deep learning-based modeling within a unified framework. A key contribution of the study is the introduction of dynamic speaker profiling based on fundamental frequency and energy, enabling adaptive dataset selection according to speaker characteristics such as age and gender. Mel-Frequency Cepstral Coefficients are employed for acoustic feature representation, while Wav2Vec 2.0 is utilized for deep contextual embedding and pronunciation evaluation. Experimental results demonstrate improved accuracy and efficiency compared to conventional approaches.

Перевод пока недоступен

Темы

Speech Recognition and Synthesis Emotion and Mood Recognition Voice and Speech Disorders

Идентификаторы

DOI: 10.37547/ajast/volume06issue05-38

Цитирования и источники

Цитирований: 0Использованных источников: 0

Показатели — AkademScholar