Статья

Reconstructing Proto-Languages Using Neural Machine Translation and Deep Learning

Zilola SattorovaTashkent State University of Oriental Studies,UzbekistanGulnara SeydametovaKarakalpak State University Named After Berdakh,UzbekistanKhilolakhon KhujamberdievaAndijan State Institute of Foreign Languages,UzbekistanAnorgul AshirovaYunus JumaniyozovUrgench State University,Department of Psychology and Pedagogy,Urgench,UzbekistanBakhtigul MamadaminovaNational Research University,Tashkent Institute of Irrigation and Agricultural Mechanization Engineers,Tashkent,Uzbekistan

2025en

ABI

Аннотация

Reconstructing proto-languages is a crucial task in historical linguistics, helping to trace language evolution and relationships. Recent advancements in Neural Machine Translation (NMT) and deep learning offer novel approaches to automate and enhance this process. Traditional methods rely on manual linguistic reconstruction, which is time-consuming, prone to human error, and limited by the availability of linguistic data. These approaches struggle with handling large datasets and complex phonetic or grammatical transformations across languages. To address these limitations, we propose a framework leveraging NMT models trained on linguistic corpora of modern and ancient languages. Our approach integrates sequence-to-sequence learning, attention mechanisms, and phoneme embedding techniques to predict proto-language forms with greater accuracy. The model is further refined using comparative linguistic data and unsupervised learning techniques to identify patterns in language evolution. The proposed method automates proto-language reconstruction, enhances the accuracy of linguistic predictions, and efficiently processes vast linguistic datasets. This enables researchers to uncover deeper historical connections between languages. Our findings demonstrate that deep learning models can outperform traditional methods in reconstructing linguistic ancestries. The results indicate a promising direction for computational historical linguistics, offering a scalable and data-driven approach to understanding language evolution.

Перевод пока недоступен

Темы

Natural Language Processing Techniques Language and cultural evolution

Идентификаторы

DOI: 10.1109/iccies63851.2025.11032538

Цитирования и источники

Цитирований: 0Использованных источников: 10

Показатели — AkademScholar