Maqola

Multi-Task Learning for Uzbek News Text Classification

Bahor EshmirzayevaTashkent State University of Economics,Department of Economics,Tashkent,UzbekistanShirali KadyrovNew Uzbekistan University,Department of General Education,Tashkent,Uzbekistan

2026

ABI

Annotatsiya

We explore multi-task learning for Uzbek news text classification in a low-resource setting. We introduce a new dataset of more than sixteen thousand Uzbek news articles collected from Kun.uz, annotated for three supervised tasks: topic classification, author gender identification, and publication year prediction. Using a shared-encoder, multi-head architecture based on a pretrained Uzbek transformer model, we compare singletask baselines with several multi-task learning variants, including layer-wise aggregation and cross-task attention. Experimental results show that transformer-based models outperform classical baselines across all tasks. Multi-task learning produces generally stable performance, with modest gains for topic classification and consistent results for gender and temporal prediction. Our analysis using confusion matrices and shared-representation visualizations shows that, in Uzbek news classification under lowresource conditions, task difficulty is mainly determined by how well the labels align with the underlying semantic representations, rather than by architectural complexity. Our study provides the first systematic evaluation of multi-task learning for Uzbek news classification and contributes an annotated dataset and empirical insights for future low-resource natural language processing research.

Mavzular

Text and Document Classification Technologies Advanced Computational Techniques in Science and Engineering Data Mining Algorithms and Applications

Identifikatorlar

DOI: 10.1109/icecco67619.2026.11488794

Iqtiboslar va manbalar

0 ta iqtibos15 ta foydalanilgan manba

Koʻrsatkichlar — AkademScholar · Tez orada