Speech Enhancement Based on Constrained Low-rank Sparse Matrix Decomposition Integrated with Temporal Continuity Regularisation

Szczegóły
Opis

Tytuł:: Speech Enhancement Based on Constrained Low-rank Sparse Matrix Decomposition Integrated with Temporal Continuity Regularisation
Autorzy:: Sun, Chengli
Yuan, Conglin
Powiązania:: https://bibliotekanauki.pl/articles/178075.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Czasopisma i Monografie PAN
Tematy:: speech enhancement
temporal continuity
low-rank decomposition
sparse decomposition
Źródło:: Archives of Acoustics; 2019, 44, 4; 681-692
0137-5075
Język:: angielski
Prawa:: CC BY-SA: Creative Commons Uznanie autorstwa - Na tych samych warunkach 4.0
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

Speech enhancement in strong noise condition is a challenging problem. Low-rank and sparse matrix decomposition (LSMD) theory has been applied to speech enhancement recently and good performance was obtained. Existing LSMD algorithms consider each frame as an individual observation. However, real-world speeches usually have a temporal structure, and their acoustic characteristics vary slowly as a function of time. In this paper, we propose a temporal continuity constrained low-rank sparse matrix decomposition (TCCLSMD) based speech enhancement method. In this method, speech separation is formulated as a TCCLSMD problem and temporal continuity constraints are imposed in the LSMD process. We develop an alternative optimisation algorithm for noisy spectrogram decomposition. By means of TCCLSMD, the recovery speech spectrogram is more consistent with the structure of the clean speech spectrogram, and it can lead to more stable and reasonable results than the existing LSMD algorithm. Experiments with various types of noises show the proposed algorithm can achieve a better performance than traditional speech enhancement algorithms, in terms of yielding less residual noise and lower speech distortion.

Informacja

Powiązane pozycje