Article

Hybrid Feature Selection and Optimiser-assisted Ensemble Learning for Credit Card Fraud Detection

Virendra BoraGraphic Era Hill University,School of Management,IndiaZokir MamadiyarovTermez University of Economics and Service,Department of Finance and Tourism,TermezAshraf AbdualimovDenau Institute of Entrepreneurship and Pedagogy,Department of Digital Economics,Denau,UzbekistanZafar BerdinazarovGraduate School of Business and Entrepreneurship,Department of Business Management and Entrepreneurship (MBA),Tashkent,UzbekistanRavshan AsamovTashkent State Agrarian, University International Agriculture University,Tashkent,UzbekistanGulnoza SabirovaUrgench State University,Department of Foreign Philology,Urgench,Uzbekistan

2025

ABI

Abstract

A modular detection pipeline is presented that integrates boundary-aware resampling (SMOTE-ENN), multi-criteria feature prioritization (TOPSIS), nonlinear compression via an undercomplete autoencoder, and a PSO-stabilised Extreme Learning Machine (ELM) within a stacking ensemble (SVM, KNN, PSO-ELM; Gradient-Boosting meta-learner). The architecture is designed to balance the conflicting goals of minority-class recoverability, low false-positive rates, computational efficiency during inference, and feature interpretability. TOPSIS combines complementary statistical and information-theoretic criteria to produce a compact, interpretable subset, which is then mapped onto a latent manifold by the autoencoder to reduce residual noise and encode nonlinear dependencies. PSO stabilises the ELM hidden-layer initialisation and hyperparameters, reducing variability in ensemble contributions. Empirical testing on the standard credit-card dataset (284,807 transactions) shows superior discrimination (Accuracy ≈ 99.95%, Recall ≈ 99.97%, AUC ≈ 1.00) and an acceptable false-positive rate (FPR ≈ 0.02%); cross-domain validation on PaySim confirms robustness (Accuracy ≈ 98.84%, AUC ≈ 0.99). The study demonstrates that combining boundary-aware resampling, multi-criterion ranking, compact nonlinear representation, and optimiser-assisted ELM initialisation significantly improves minority-class detection while limiting false alarms, offering a practical path toward deployable, high-fidelity fraud detection systems.

Topics

Imbalanced Data Classification Techniques Financial Distress and Bankruptcy Prediction Artificial Intelligence in Healthcare

Identifiers

DOI: 10.1109/ic-eeta66496.2025.11548372

Citations and references

Cited by 025 references

Metrics — AkademScholar