- Tytuł:
- Frequency Selection Based Separation of Speech Signals with Reduced Computational Time Using Sparse NMF
- Autorzy:
-
Varshney, Y. V.
Abbasi, Z. A.
Abidi, M. R.
Farooq, O. - Powiązania:
- https://bibliotekanauki.pl/articles/176829.pdf
- Data publikacji:
- 2017
- Wydawca:
- Polska Akademia Nauk. Czytelnia Czasopism PAN
- Tematy:
-
sparse NMF
non-negative matrix factorisation
mixed speech recognition
machine learning - Opis:
- Application of wavelet decomposition is described to speed up the mixed speech signal separation with the help of non-negative matrix factorisation (NMF). It is assumed that the basis vectors of training data of individual speakers had been recorded. In this paper, the spectrogram magnitude of a mixed signal has been factorised with the help of NMF with consideration of sparseness of speech signals. The high frequency components of signal contain very small amount of signal energy. By rejecting the high frequency components, the size of input signal is reduced, which reduces the computational time of matrix factorisation. The signal of lower energy has been separated by using wavelet decomposition. The present work is done for wideband microphone speech signal and standard audio signal from digital video equipment. This shows an improvement in the separation capability using the proposed model as compared with an existing one in terms of correlation between separated and original signals. Obtained signal to distortion ratio (SDR) and signal to interference ratio (SIR) are also larger as compare of the existing model. The proposed model also shows a reduction in computational time, which results in faster operation.
- Źródło:
-
Archives of Acoustics; 2017, 42, 2; 287-295
0137-5075 - Pojawia się w:
- Archives of Acoustics
- Dostawca treści:
- Biblioteka Nauki