Article

Machine Learning for Uzbek Language Syntactic Analysis: A Review and Comparative Experiment

Rano SayfullayevaNational University of Uzbekistan Named After Mirzo Ulugbek,Tashkent,UzbekistanKasimova ZiyodaNational University of Uzbekistan Named After Mirzo Ulugbek,Tashkent,UzbekistanAziza Furkatovna ShamahmudovaSamarkand State Institute of Foreign Languages,Samarkand,UzbekistanUmidjon KuziyevNamangan State University,Namangan,UzbekistanMatluba YakubovaAdina D. EgamberganovaCyber University,Nurafshon,Uzbekistan

2025

ABI

Abstract

The article discusses approaches to automatic syntactic analysis of Uzbek texts using statistical methods of machine learning - Naive Bayes, support vector machines and linear regression. The emphasis is on the specifics of the Uzbek language as an agglutinative language, which requires greater attention to morphological analysis and flexible word order. The proposed methods demonstrate stable results with limited volumes of labeled data and relatively low costs of computing resources. The study includes an analysis of the accuracy of determining syntactic dependencies, a description of the corpus preparation and data labeling process, as well as recommendations for further improvement of algorithms and expansion of the experimental base. The results can be used to develop full-featured machine translation systems, automatic correction of grammatical errors and other applications related to the processing of Uzbek written speech. Besides, authors conducted additional research for comparative analysis of existing solutions, which helps to determine actuality of the work.

Topics

Advanced Computational Techniques in Science and Engineering Economic and Industrial Development Education, Innovation and Language Studies

Identifiers

DOI: 10.1109/apeie66761.2025.11289373

Citations and references

Cited by 019 references

Metrics — AkademScholar · Coming soon