Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Препринт

Advancing Audio Fingerprinting Accuracy with AI and ML: Addressing Background Noise and Distortion Challenges

Navin KamuniSathishkumar ChintalaFisher Investments,Department of IT,USANaveen KunchakuriJyothi Swaroop Arlagadda NarasimharajuGoogle,Santa Clara,CAVenkat KumarAI ML M.Tech BITS Pilani WILP,India
2024en
ABI

Аннотация

Audio fingerprinting, exemplified by pioneers like Shazam, has transformed digital audio recognition. However, existing systems struggle with accuracy in challenging conditions, limiting broad applicability. This research proposes an AI and ML integrated audio fingerprinting algorithm to enhance accuracy. Built on the Dejavu Project’s foundations, the study emphasizes real-world scenario simulations with diverse background noises and distortions. Signal processing, central to Dejavu’s model, includes the Fast Fourier Transform, spectrograms, and peak extraction. The “constellation” concept and fingerprint hashing enable unique song identification. Performance evaluation attests to 100% accuracy within a 5-second audio input, with a system showcasing predictable matching speed for efficiency. Storage analysis highlights the critical space-speed trade-off for practical implementation. This research advances audio fingerprinting’s adaptability, addressing challenges in varied environments and applications.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 2Использованных источников: 0