Uzbek 65 million web corpus
Surayyo KhajibaevaUrgench State UniversityKhabibulla MadatovUrgench State UniversityJernej VičičResearch Centre of the Slovenian Academy of Sciences and Arts
Zenodo (CERN European Organization for Nuclear Research)repository2026
ABI
Annotatsiya
A contemporary web corpus gathered in 2026. The corpus contains approximately 60 million tokens. Name of the corpus Number of tokens Number of sentences Wikipedia resource 27711575 2481232 Web resources 33169827 2389912 School Corpus 1408830 154239
Hali tarjima qilinmagan
Identifikatorlar
Iqtiboslar va manbalar
0 ta iqtibos0 ta foydalanilgan manba