Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseскороОткрытый API экосистемы
Латиница
Русский
Статья

Speaker Separation: Use Neural Networks

Mekhriddin RakhimovTashkent University of Information Technology named after Muhammad al-Khwarizmi, TUIT, Tashkent, UzbekistanBoburkhon TuraevTashkent University of Information Technology named after Muhammad al-Khwarizmi, TUIT, Tashkent, UzbekistanTuraev KhurshidTashkent University of Information Technology named after Muhammad al-Khwarizmi, TUIT, Tashkent, Uzbekistan
ABI

Аннотация

Speaker separation is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channel). We train Neural Network for learning when a person is speaking. We use different type of Neural Networks specifically, Single Layer Perceptron (SLP), Multi Layer Perceptron (MLP), Recurrent Neural Network (RNN) and Convolution Neural Network (CNN) we achieve uzbek speech commands ~88% of accuracy with RNN.

Темы

Идентификаторы

Цитирования и источники

Показатели — AkademScholar · Скоро