Skip to main content
Other

Uzbek 65 million web corpus

Surayyo KhajibaevaUrgench State UniversityKhabibulla MadatovUrgench State UniversityJernej VičičResearch Centre of the Slovenian Academy of Sciences and Arts
ABI

Abstract

A contemporary web corpus gathered in 2026. The corpus contains approximately 60 million tokens. Name of the corpus Number of tokens Number of sentences Wikipedia resource 27711575 2481232 Web resources 33169827 2389912 School Corpus 1408830 154239

Identifiers

Citations and references

Cited by 00 references