Мақола

Development of Hybrid Algorithm for Named Entity Recognition in Uzbek Public Administration Texts

Nilufar AbdurakhmonovaNational University of Uzbekistan Named After Mirzo Ulugbek,Tashkent,UzbekistanNilufar Istamovna AdizovaBukhara State University,Bukhara,UzbekistanDilnoza JamolitdinovaKokand State University,Kokand,UzbekistanAbbos BegimovCyber University,Nurafshon,UzbekistanА Н СултановаNukus State Pedagogical Institute Named After Ajiniyaz,Nukus,UzbekistanGulkhayo JuraevaTurin Polytechnic University in Tashkent,Tashkent,Uzbekistan

2025

ABI

Аннотация

The accelerating digital transformation of public administration in Uzbekistan generates large volumes of unstructured legal text that needs to be indexed, verified, and searched in real time. This paper presents a hybrid named entity recognition framework adapted for text analysis in this area. The algorithm starts with two rule-based preprocessing steps — alphabet standardization (Cyrillic → Latin) and dialect normalization — that mitigate orthographic variability and regional lexical drift. In addition, Bidirectional LSTM coupled with convolutional neural networks and a conditional random fields layer were used as neural network technologies for the named entity recognition language model. The system is trained on a curated, manually annotated corpus of Uzbek government documents that covers five main entity types: date/ time, organization, location, person, and money. It is worth noting that the sentences in the corpus are marked up using the BIOES scheme, which provides comprehensive marking of the entire text. Experiments show that each preprocessing component provides measurable performance improvements and that the hybrid architecture outperforms the baseline Transformers.

Ҳали таржима қилинмаган

Мавзулар

Topic Modeling Text and Document Classification Technologies Advanced Text Analysis Techniques

Идентификаторлар

DOI: 10.1109/apeie66761.2025.11289443

Иқтибослар ва манбалар

0 та иқтибос18 та фойдаланилган манба

Кўрсаткичлар — AkademScholar