Skip to main content
Article

Sound Event Detection: A tutorial

Annamaria MesarosComputing Sciences, Tampere University, Tampere, FinlandToni HeittolaComputing Sciences, Tampere University, Tampere, 33720, FinlandTuomas VirtanenFaculty of Information Technology and Communication Sciences, Tampere University, Tampere, FI-33720, FinlandMark D. PlumbleyCentre for Vision, Speech and Signal Processing (CVSSP), University of Surrey, Guildford, GU2 7XH, United Kingdom of Great Britain and Northern Ireland
2021en
ABI

Abstract

Imagine standing on a street corner in the city. With your eyes closed you can hear and recognize a succession of sounds: cars passing by, people speaking, their footsteps when they walk by, and the continuous falling of rain. The recognition of all these sounds and interpretation of the perceived scene as a city street soundscape comes naturally to humans. It is, however, the result of years of "training": encountering and learning associations among the vast varieties of sounds in everyday life, the sources producing these sounds, and the names given to them.

Identifiers

Citations and references

Cited by 30 references