Article

Corpus-Based Error Analysis of Uzbek EFL Learners’ Academic Writing

Barno KutlimuratovaUrgench State University,Department of Theory of Translation and Practice,Urgench,UzbekistanElmurod KuriyozovUrgench State University,Department of Computer Sciences and Artificial Intelligence Technologies,Urgench,UzbekistanAbdulla UrazbaevUrgench Ranch University of Technology,Urgench,UzbekistanG. RakhimovaUrgench State University,Department of Theory of Translation and Practice,Urgench,Uzbekistan

2025

ABI

Abstract

English academic writing is a frequent source of difficulty for Uzbek learners of English as Foreign Language, particularly regarding learning grammar, vocabulary, and punctuation. For empirical evidence on these difficulties, we approached this issue in a computational way by constructing a small learner corpus consisting of 40 IELTS Academic Writing essays (about 9,000 words) from undergraduate students at one of the Uzbek higher educational institutions. The corpus was converted to digital form and incorporated into Sketch Engine corpus analysis platform, and all texts were manually annotated using a 13-category error-tagging system covering spelling, punctuation, articles, word choice, morphology, and syntax. Our quantitative assessments found most frequent errors and a comparison of error frequencies between first- and last-year undergraduate learners. A gender-based assessment found female learners to have averaged fewer errors, although this was controlled by male learners’ preference for more challenging topics for their writings. In this work, we propose a novel dataset of written learner corpus of Uzbek essays and present experimental results of our computational approach into this dataset analysis.

Topics

Second Language Acquisition and Learning Text Readability and Simplification Natural Language Processing Techniques

Identifiers

DOI: 10.1109/apeie66761.2025.11289237

Citations and references

Cited by 08 references

Metrics — AkademScholar · Coming soon