Uzbek 65 million web corpus
Surayyo KhajibaevaUrgench State UniversityKhabibulla MadatovUrgench State UniversityJernej VičičResearch Centre of the Slovenian Academy of Sciences and Arts
Zenodo (CERN European Organization for Nuclear Research)repository2026
ABI
Аннотация
A contemporary web corpus gathered in 2026. The corpus contains approximately 60 million tokens. Name of the corpus Number of tokens Number of sentences Wikipedia resource 27711575 2481232 Web resources 33169827 2389912 School Corpus 1408830 154239
Перевод пока недоступен
Идентификаторы
Цитирования и источники
Цитирований: 0Использованных источников: 0