Lossy coding impact on speech recognition with convolutional neural networks

Szczegóły
Opis

Tytuł:: Lossy coding impact on speech recognition with convolutional neural networks
Autorzy:: Kucharski, Mateusz
Powiązania:: https://bibliotekanauki.pl/articles/24201985.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: lossy coding
convolutional neural networks
speech recognition
kodowanie stratne
konwolucyjne sieci neuronowe
rozpoznawanie mowy
Źródło:: Vibrations in Physical Systems; 2022, 33, 3; art. no. 2022302
0860-6897
Język:: angielski
Prawa:: Wszystkie prawa zastrzeżone. Swoboda użytkownika ograniczona do ustawowego zakresu dozwolonego użytku
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

This paper presents research of lossy coding impact on speech recognition with convolutional neural networks. For this purpose, google speech commands dataset containing utterances of 30 words was encoded using four most common all-purpose codecs: mp3, aac, wma and ogg. A convolutional neural network was taught using part of the original files and later tested with the rest of the files, as well as their counterparts encoded with different codecs and bitrates. The same network model was also taught using mp3 encoded data showing the biggest loss in effectiveness of the previous network. Results show that lossy coding does have an effect on speech recognition, especially for low bitrates.

Informacja

Powiązane pozycje