Building a Comprehensive Uzbek Lexicon: Bridging Dialects for Text Standardization
Аннотация
As part of the study, the authors developed a dictionary of the formal Uzbek language and its dialects, which can be used in the tasks of standardizing mixed texts in various dialects of the Uzbek language into a single - formal format. The proposed dictionary was developed jointly with linguists and experts in the field of dialectology, it contains more than 210,000 (70 thousand for each dialect) words and affixes for a full analysis of word forms. In addition, the authors focused on three main dialects of Uzbek - Karluk, Oguz and Kipchak dialects.At the same time, the article contains information on the morphological analysis of word forms, the stages of processing and transliteration (translation) from a dialectal form to a formal one, as well as other related technical issues.In addition, the authors conducted a comparative analysis of existing alternative works, provided an objective assessment of each similar work, as well as the difference between their work and the alternative.
Ҳали таржима қилинмаган