Статья

Stacking Ensemble Framework for Hate Speech Detection in Bangla Videos

Shamrin Jahan MimRangamati Science and Technology University,Department of Computer Science and Engineering,Rangamati,Bangladesh,4500Tanjim MahmudRangamati Science and Technology University,Dept. of Computer Science and Engineering,Rangamati,Bangladesh,4500Md Hasan AliRangamati Science and Technology University,Department of Computer Science and Engineering,Rangamati,BangladeshMohammad Tarek AzizChittagong University of Engineering & Technology,Department of Computer Science and Engineering,Chittagong,Bangladesh

2024en

ABI

Аннотация

The proliferation of internet usage and social media platforms has significantly enhanced the ability of individuals to express their opinions on various topics. However, this freedom of expression sometimes morphs into a vehicle for disseminating hate speech, leading to increased incidents of cyberbullying, violations, and conflicts. Particularly on video-sharing websites, which have become a prominent stage for such activities due to their widespread use and the dynamic nature of video content. This study aims to address the issue of hate speech in video content by developing a robust method for detecting hate speech in Bangla language videos. The focus is on the spoken content within these videos, which is a primary vector for the transmission of harmful messages. We constructed a comprehensive dataset by extracting and converting audio from a collection of videos into text. Utilizing this dataset, we applied machine learning techniques and deep learning models to analyze and classify the content. Specifically, our approach involves a stacking ensemble model that combines the strengths of Convolutional Neural Networks (CNN) and Bidirectional Gated Recurrent Units (BiGRU) with the analytical capabilities of customized feature extraction applied to Multinomial Naive Bayes and Random Forest classifiers serving as a meta-model. The proposed stacking ensemble model demonstrates a high level of efficacy, achieving an accuracy rate of 96% in detecting hate speech within the tested video content. This performance indicates a significant advancement over existing methods, underlining the effectiveness of our hybrid, multi-model approach.

Перевод пока недоступен

Идентификаторы

DOI: 10.1109/compas60761.2024.10796759

Цитирования и источники

Цитирований: 3Использованных источников: 0

Показатели — AkademScholar