Статья

Deep Learning for Low-Resource Language: Sentiment Analysis of Karakalpak Texts in Energy Sector by Fine-Tuning mBERT

Davlatyor MenglievCyber University,Nurafshon,UzbekistanDiloro NabiyevaAndijan State University,Andijan,UzbekistanAydin SultanovaNukus State Pedagogical Institute named after Ajiniyaz,Nukus,UzbekistanBarno Toirkulovna TurdikulovaGulistan State Pedagogical Institute,Gulistan,UzbekistanNigora TursunovaNational University of Uzbekistan,Tashkent,UzbekistanShakhribon MusurmankulovaGulistan State Pedagogical Institute,Gulistan,Uzbekistan

2025

ABI

Аннотация

This article considers the problem of sentiment analysis of the Karakalpak language in the energy sector. Although many existing solutions are focused on popular world languages, their adaptation to low-resource languages (Karakalpak, Uzbek, etc.) remains a difficult, and even impossible task, due to the different nature of the languages. At the same time, there are no solutions focused on the Karakalpak language, especially in the context of user reviews in the energy sector. The authors propose a solution based on a neural network model, for training which an annotated corpus of texts in the Karakalpak language, consisting of complaints, suggestions and general comments, covering the period from 2019 to 2024, was used. Experimental assessments confirm the reliability of the work in terms of accuracy, recall and F1 score, where the values were 93%, 94% and 93.5%, respectively. The results can be used not only in the field of sentiment analysis of consumer feedback in the electric power industry, but also in other related areas of the economy.

Темы

Advanced Computational Techniques in Science and Engineering Sentiment Analysis and Opinion Mining Language Acquisition and Education

Идентификаторы

DOI: 10.1109/icec2nt65402.2025.11380009

Цитирования и источники

Цитирований: 0Использованных источников: 16

Показатели — AkademScholar · Скоро