Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseскороОткрытый API экосистемы
Латиница
Статья

The Problem of Pos Tagging and Stemming for Agglutinative Languages (Turkish, Uyghur, Uzbek Languages)

Elov Botir BoltayevichTashkent State University of Uzbek Language and Literature named Alisher Navo'i,Dept. of Computational Linguistics and Digital Technologies,Tashkent,UzbekistanEşref AdalıIstanbul Technical University,Computer Engineering and Informatics Faculty,Istanbul,TürkiyeKhamroeva Shahlo MirdjonovnaTashkent State University of Uzbek Language and Literature named Alisher Navo'i,Dept. of Computational Linguistics and Digital Technologies,Tashkent,UzbekistanAbdullayeva Oqila Xolmo‘minovnaTashkent State University of Uzbek Language and Literature named Alisher Navo'i,Dept. of Computational Linguistics and Digital Technologies,Tashkent,UzbekistanXusainova Zilola YuldashevnaTashkent State University of Uzbek Language and Literature named Alisher Navo'i,Dept. of Computational Linguistics and Digital Technologies,Tashkent,UzbekistanXudayberganov Nizomaddin Uktamboy O'g'liTashkent State University of Uzbek Language and Literature named Alisher Navo'i,Dept. of Computational Linguistics and Digital Technologies,Tashkent,Uzbekistan
2023en
ABI

Аннотация

The number of possible word forms in agglutinative languages is theoretically unlimited. This, in turn, creates the problem of POS tagging (part-of-speech) of out-of-vocabulary (OOV) words in agglutinative languages. In agglutinative languages, words are formed by adding suffixes to the stem. Due to the occurrence of phonetic harmony and disharmony while adding suffixes to the stem, it is necessary to analyze both phonetic and morphological changes. When solving many NLP tasks, it is necessary to reduce word forms to the stem (stemming). Removing all inflectional affixes from a word and lemmatizing the rest of the word is considered one of the important tasks of natural language processing (NLP), and this process is called stemming. The stemming process is important in information retrieval (IR) systems.

Темы

Идентификаторы

Цитирования и источники

Показатели — AkademScholar · Скоро