Article

Wavelet Transform Enhanced Signal Processing for Low-Latency Audio Recognition in Edge Devices

Haider M. FahimUniversity of Hilla,Faculty of Sciences,AI Department,Babylon,Iraq,51011D KalidossKalinga University,Raipur,IndiaKhabibullayeva Sayyorakhon Makhamadali KiziTuran International University,Faculty of Humanities & Pedagogy,Namangan,UzbekistanAbdullah Ghanim JaberUniversity of Information Technology and Communication,Baghdad,Iraq

2025

ABI

Abstract

Low-latency audio recognition in edge devices is critical for real-time applications such as voice assistants, surveillance, and wearable tech. Wavelet Transform has emerged as a powerful tool in signal processing due to its ability to analyze non-stationary signals at multiple resolutions. However, existing methods often suffer from high computational costs, limited noise resilience, and suboptimal performance under constrained edge environments, making them less suitable for real-time deployment. To address these challenges, this paper introduces a novel framework named WESP-LAR (Wavelet Enhanced Signal Processing for Low-latency Audio Recognition), which leverages Discrete Wavelet Transform (DWT) for efficient time-frequency feature extraction and integrates lightweight neural models for classification. WESP-LAR is optimized for low-memory usage and real-time execution on edge processors. The proposed method enhances audio feature representation while significantly reducing latency and computational overhead, making it ideal for embedded AI scenarios. Experimental results demonstrate that WESP-LAR achieves superior accuracy and robustness across diverse audio datasets, outperforming traditional FFT-based and raw waveform approaches.

Topics

Music and Audio Processing Speech Recognition and Synthesis Speech and Audio Processing

Identifiers

DOI: 10.1109/iccr67387.2025.11291859

Citations and references

Cited by 028 references

Metrics — AkademScholar