Article

UzSentiment Sentiment Analysis Methods for Uzbek Texts

Elov Botir BoltayevichAlisher Navoi Tashkent State University of Uzbek Language and Literature,Department of Computer Linguistics and Digital Technology,Tashkent,UzbekistanYakubov Islamjon AhmedjanovichAlisher Navoi Tashkent State University of Uzbek Language and Literature,Translation Theory and Practice department,Tashkent,UzbekistanMalika Suyunova Odil QiziState University of Uzbek Language and Literature,Alisher Navoi Tashkent,Department of Computational Linguistics and Digital Technology,Tashkent,UzbekistanSafarova Maftuna Zoir QizBukhara state university,Department of Uzbek linguistics and journalism,Bukhara,UzbekistanAbdullayev Abdulla QuranbayevichInnovation and Training of Scientific and Pedagogical Personnel of the Urganch Innovative University,the Department of Scientific Research,Urganch,UzbekistanMukimova Gulandom AkhadovnaBukhara State University,Department of Russian Language and Literature,Bukhara,Uzbekistan

2026

ABI

Abstract

This article is devoted to modern methods of sentiment analysis in Uzbek texts and the practical experience of applying them. The study compares two main approaches Naive Bayes and Support Vector Machine (SVM) - theoretically and practically, based on a corpus called UzSentiment, which consists of 100,000 texts labelled with positive and negative tags. The article begins with a literature review of existing international and Uzbek-language research in the field of sentiment analysis, highlighting the key challenges faced by lowresource and morphologically rich languages. Then, the selected models are described in detail with their mathematical formulations (Bayesian probability, margin optimisation). All models are trained on the UzSentiment corpus and evaluated using Accuracy, Precision, Recall, and F1-score, with results compared through tables. The findings show that SVM achieves higher accuracy compared to Naive Bayes, while both approaches offer advantages in terms of computational efficiency. Additionally, the strengths and weaknesses of each method, optimisation opportunities for the Uzbek language, and practical application scenarios are examined. This study focuses on classical machine learning approaches, while deep learning models are considered beyond the scope of this work.

Topics

Sentiment Analysis and Opinion Mining Emotion and Mood Recognition Text and Document Classification Technologies

Identifiers

DOI: 10.1109/icnlp69856.2026.11527652

Citations and references

Cited by 08 references

Metrics — AkademScholar