Article

Democratizing AI Linguistic Technologies for Understanding Diverse Social Voices

Isayeva DildoraTashkent Institute of Irrigation and Agriculture Mechanization Engineers, National Research University,Department of Teaching theory and Methodology,Tashkent city,UzbekistanOtaxon Khayitboyev ShokirovichMamun University,Department of Romano-Germanic Philology,UzbekistanGulora YuldashevaUrgench State University,Department of Computer sciences.,Urgench,UzbekistanAbhinav SinghalRano AlimardanovaTermez University of Economics and Service,Department of Pedagogy and Psychology,Termez,UzbekistanLazizbek AbdirahmonovUrgench State Pedagogical Institute,Department of Physical Culture,Urgench City,Uzbekistan

2025

ABI

Abstract

Current AI linguistic technologies predominantly reflect the perspectives and linguistic patterns of dominant social groups, creating barriers for equitable representation of diverse communities and marginalized voices in digital spaces. This study aims to develop and evaluate a framework for democratizing AI linguistic technologies to better understand and represent diverse social voices across different demographic, cultural, and linguistic backgrounds. We employed a mixed-methods approach combining large-scale corpus analysis of underrepresented dialects and sociolects, community-participatory design sessions with marginalized groups, and the development of inclusive natural language processing models using transfer learning and domain adaptation techniques. Our methodology included collecting linguistic data from 15 diverse communities, implementing bias detection algorithms, and creating culturally-aware language models through collaborative annotation processes involving community members. The results demonstrate that our democratized AI framework achieved 23% higher accuracy in understanding diverse linguistic patterns compared to conventional models, with significant improvements in sentiment analysis (18% increase in F1-score) and intent recognition (31% reduction in misclassification) for underrepresented groups. The democratized approach also showed enhanced cultural sensitivity scores and reduced algorithmic bias across all tested demographic categories. This research contributes to the growing field of inclusive AI by providing a replicable framework for developing linguistic technologies that authentically represent diverse social voices, with implications for improving digital equity, enhancing cross-cultural communication, and ensuring fair representation in AI-driven social platforms and decisionmaking systems.

Topics

Mental Health via Writing Computational and Text Analysis Methods Hate Speech and Cyberbullying Detection

Identifiers

DOI: 10.1109/iscon65210.2025.11341518

Citations and references

Cited by 022 references

Metrics — AkademScholar · Coming soon