Maqola

A Technique for Automatic Extraction of Basis Words: A Case Study on “Uzbek Primary School Corpus”

Khabibulla MadatovUrgench State University,Computer Science department,Urgench,UzbekistanShukurla BekchanovUrgench State University,The departments of Computer science,Urgench,Khorezm,UzbekistanSurayyo KhajibaevaUrgench State University,The departments of Computer science,Urgench,Khorezm,Uzbekistan

2024en

ABI

Annotatsiya

Extracting the basis words from Uzbek language texts is one of the most important tasks that facilitate the school student's learning process—this study, mainly selected such words from among the words in the Uzbek language texts, which can be used to express almost all words. Namely, the process has reduced the set of words to such an extent that it is possible to construct other words using these words. A high-frequency detection method was used to detect these basis words. For the investigation, we have collected 35 primary school textbooks for grades 1–4 approved by the Ministry of Preschool and School Education of the Republic of Uzbekistan and named the “Uzbek Primary School Corpus” (UPSC) by the authors. As a result, it was determined that a first-grade student should know 366 basis words, a second-grade student 462, a third-grade student 486, and a fourth-grade student 512 basis words.

Mavzular

Lexicography and Language Studies Natural Language Processing Techniques Second Language Acquisition and Learning

Identifikatorlar

DOI: 10.1109/ubmk63289.2024.10773460

Iqtiboslar va manbalar

3 ta iqtibos 11 ta foydalanilgan manba

Koʻrsatkichlar — AkademScholar · Tez orada