Detection and Localization of Audio Event for Home Surveillance Using CRNN

Szczegóły
Opis

Tytuł:: Detection and Localization of Audio Event for Home Surveillance Using CRNN
Autorzy:: Suruthhi, V. S.
Smita, V.
Rolant Gini, J.
Ramachandran, K. I.
Powiązania:: https://bibliotekanauki.pl/articles/2055274.pdf
Data publikacji:: 2021
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: convolutional recurrent neural network
CRNN
gated recurrent unit
GRU
long short-term memory
LSTM
sound event localization and detection
SELD
Źródło:: International Journal of Electronics and Telecommunications; 2021, 67, 4; 735--741
2300-1933
Język:: angielski
Prawa:: CC BY: Creative Commons Uznanie autorstwa 4.0
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

Safety and security have been a prime priority in people’s lives, and having a surveillance system at home keeps people and their property more secured. In this paper, an audio surveillance system has been proposed that does both the detection and localization of the audio or sound events. The combined task of detecting and localizing the audio events is known as Sound Event Localization and Detection (SELD). The SELD in this work is executed through Convolutional Recurrent Neural Network (CRNN) architecture. CRNN is a stacked layer of convolutional neural network (CNN), recurrent neural network (RNN) and fully connected neural network (FNN). The CRNN takes multichannel audio as input, extracts features and does the detection and localization of the input audio events in parallel. The SELD results obtained by CRNN with the gated recurrent unit (GRU) and with long short-term memory (LSTM) unit are compared and discussed in this paper. The SELD results of CRNN with LSTM unit gives 75% F1 score and 82.8% frame recall for one overlapping sound. Therefore, the proposed audio surveillance system that uses LSTM unit produces better detection and overall performance for one overlapping sound.

Informacja

Powiązane pozycje