Article

Predicting Phonetic Changes Using Transformer-Based Phoneme Models

Mashkhura HusanovaAndijan State Institute of Foreign LanguagesBakhtiyor KholmuhamedovSamarkand State University Named After Sharof Rashidov,Samarkand,UzbekistanAnorgul AshirovaZilola SattorovaTashkent State University of Oriental Studies,UzbekistanZarnigor MirzayevaAndijan State Institute of Foreign Languages,UzbekistanLaziz BakhtiyorovNational Research University,Tashkent Institute of Irrigation and Agricultural Mechanization Engineers,Tashkent,Uzbekistan

2025en

ABI

Abstract

Predicting phonetic changes is a crucial aspect of historical linguistics, aiding in the reconstruction of language evolution. This study presents a Transformer-Based Phoneme Model (HL-TPM) to analyze and predict phonetic shifts over time with improved accuracy. Existing methods often rely on rule-based or statistical approaches, which struggle with capturing complex, long-range dependencies in phonetic transformations. These limitations lead to reduced predictive performance and inefficiencies in modeling diachronic phonological changes. To address these challenges, HL-TPM leverages Transformer-Based deep learning architectures, which excel at sequence-to-sequence learning. The model is trained on historical phonetic data, utilizing self-attention mechanisms to identify patterns and predict pronunciation shifts in different linguistic lineages. Key techniques include contextual phoneme embedding, phonetic sequence alignment, and cross-temporal feature extraction. The proposed method enhances the study of language evolution by providing a robust, data-driven approach to phoneme prediction. It facilitates the reconstruction of proto-languages, improves phonetic annotation in historical texts, and aids linguistic researchers in tracing phonological shifts. Experimental results show that HL-TPM outperforms traditional models in accuracy and generalization, effectively predicting phonetic changes across languages. These findings demonstrate the potential of Transformer models in advancing computational historical linguistics.

Topics

Speech Recognition and Synthesis

Identifiers

DOI: 10.1109/iccies63851.2025.11032634

Citations and references

Cited by 014 references

Metrics — AkademScholar