Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

Adaptive Pronunciation Assessment Based on Acoustic Feature Profiling and Wav2vec 2.0 For English Language Learning

Abdumalikov Akmaljon Abduxoliq o'g'liComputer science and programming, Jizzakh branch of national university of Uzbekistan, 130100, UzbekistanRohmonqulov Muhammadyusuf Egamberdi o'g'liComputer science and programming, Jizzakh branch of national university of Uzbekistan, 130100, Uzbekistan
ABI

Аннотация

This paper presents a speaker-adaptive approach to automatic pronunciation assessment for English language learning, with a focus on Uzbek learners. The proposed methodology integrates acoustic signal processing, feature extraction, and deep learning-based modeling within a unified framework. A key contribution of the study is the introduction of dynamic speaker profiling based on fundamental frequency and energy, enabling adaptive dataset selection according to speaker characteristics such as age and gender. Mel-Frequency Cepstral Coefficients are employed for acoustic feature representation, while Wav2Vec 2.0 is utilized for deep contextual embedding and pronunciation evaluation. Experimental results demonstrate improved accuracy and efficiency compared to conventional approaches.

Перевод пока недоступен

Темы

Идентификаторы

Цитирования и источники

Цитирований: 0Использованных источников: 0