Asosiy kontentga oʻtish
AkademIndex

Mahsulotlar

Ishlab chiquvchilar uchun

AkademBasetez oradaEkotizim uchun ochiq API
Lotin
Oʻzbek
Maqola

A Corpus-Based Model of the Middle Turkic Language using Weighted Averaging

B.I. BoborahimovDigital Technologies and Artificial Intelligence Development Research InstituteDiana AkhmedjanovaUzbekistan State World Languages UniversityKh.D. SharipovDigital Technologies and Artificial Intelligence Development Research Institute
ABI

Annotatsiya

This paper formalizes and extends the classic concept of the "Middle Turkic"language (Karimov–Mutalov, 1992) by applying tools from corpus and computational linguistics. The proposed model frames the selection of an "optimal"form for a linguistic unit as a multi-criteria optimization problem over a set of factors, including frequency, prevalence, simplicity, cultural compatibility, and a matrix of interlingual mutual intelligibility. To ensure stability and interpretability, the model introduces soft (softmax) and "saturating" weights, domain and diachronic modifiers, and inter-language "fairness"constraints. The model achieves up to 89% accuracy in linguistic form selection, a 78% intelligibility level, high grammatical consistency (0.92), and a high assimilation rate (0.85). The paper concludes by discussing the model’s potential applications in machine translation, its limitations, and directions for future validation.

Mavzular

Identifikatorlar

Iqtiboslar va manbalar

0 ta iqtibos0 ta foydalanilgan manba
Koʻrsatkichlar — AkademScholar · Tez orada