Stacking Ensemble Framework for Hate Speech Detection in Bangla Videos
Аннотация
The proliferation of internet usage and social media platforms has significantly enhanced the ability of individuals to express their opinions on various topics. However, this freedom of expression sometimes morphs into a vehicle for disseminating hate speech, leading to increased incidents of cyberbullying, violations, and conflicts. Particularly on video-sharing websites, which have become a prominent stage for such activities due to their widespread use and the dynamic nature of video content. This study aims to address the issue of hate speech in video content by developing a robust method for detecting hate speech in Bangla language videos. The focus is on the spoken content within these videos, which is a primary vector for the transmission of harmful messages. We constructed a comprehensive dataset by extracting and converting audio from a collection of videos into text. Utilizing this dataset, we applied machine learning techniques and deep learning models to analyze and classify the content. Specifically, our approach involves a stacking ensemble model that combines the strengths of Convolutional Neural Networks (CNN) and Bidirectional Gated Recurrent Units (BiGRU) with the analytical capabilities of customized feature extraction applied to Multinomial Naive Bayes and Random Forest classifiers serving as a meta-model. The proposed stacking ensemble model demonstrates a high level of efficacy, achieving an accuracy rate of 96% in detecting hate speech within the tested video content. This performance indicates a significant advancement over existing methods, underlining the effectiveness of our hybrid, multi-model approach.
Перевод пока недоступен