Skip to main content
Article

Advancing Karakalpak Linguistics with Dictionary-Based Morphological Analysis: Implications for Text Correction Systems

Davlatyor MenglievNovosibirsk State University,Novosibirsk,Russia Urgench branch of Tashkent University of Information Technologies named after Muhammad al-Khwarizmi Urgench, UzbekistanVladimir BarakhninUrgench branch of Tashkent University of Information Technologies named after Muhammad al-Khwarizmi,Urgench,UzbekistanNodirbek BoltayevUrgench branch of Tashkent University of Information Technologies named after Muhammad al-Khwarizmi Urgench, UzbekistanS. PolatovaUrgench branch of Tashkent University of Information Technologies named after Muhammad al-Khwarizmi Urgench, UzbekistanMukhriddin EshkulovJizzakh polytechnic institute,Jizzakh,UzbekistanBahodir IbragimovUrgench State University,Urgench,Uzbekistan
2024en
ABI

Abstract

The article presents an original method of morphological analysis of the Karakalpak language, based on the dictionary approach, focusing on its application in text correction systems. The algorithm analyzes words, identifying their roots and affixes using an extensive dictionary of roots of more than ten-thousand-word forms and affixes, as well as a dictionary of exceptions for words that do not applicable for general grammatical rules. Proposed approach allows for high accuracy in determining the morphological structure of words and offers the user correction options for potentially misspelled words. The work contributes to the development of linguistic tools for the Karakalpak language and highlights the importance of developing technologies to support linguistic diversity and digital inclusion. In addition, as part of this work, the authors analyzed a number of existing scientific studies closely related to the topic under study in order to develop the most relevant and effective solution for automatic text correction of texts in Karakalpak language.

Topics

Identifiers

Citations and references

Metrics — AkademScholar · Coming soon