Temat: mowy - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Coding effects on changes in formant frequencies in Japanese speech signals
Autorzy:: Kucharski, Mateusz
Brachmański, Stefan
Powiązania:: https://bibliotekanauki.pl/articles/128083.pdf
Data publikacji:: 2019
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: speech
speech coding
formants
mowa
kodowanie mowy
formanty
Opis:: This paper presents results of research on effects of lossy coding on formant frequencies for japanese speech signals. Additionally changes in pitch of the voice were inspected. For this research four most popular lossy coding standards were chosen, MP3, WMA, AAC and OGG, and compared to original WAVE files. Audio files were created by the author based on ITU-T P.501 recommendation in two sampling frequencies, 16 kHz and 48 kHz, and converted into chosen codecs. To extract the data from audio files, open license software Praat was used. Due to discovered differences in time duration between original and encoded files, that also differed between individual codecs, only OGG and WMA standards were compared directly. MP3 and AAC standards were divided into Japanese syllables, averaged and then compared into also averaged WAVE files. Results were additionally compared to FLAC lossless codec.
Źródło:: Vibrations in Physical Systems; 2019, 30, 1; 1-8
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Recognition of speaker’s age group and gender for a large database of telephone-recorded voices
Autorzy:: Staroniewicz, Piotr
Powiązania:: https://bibliotekanauki.pl/articles/2202432.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: speech processing
automatic age recognition
przetwarzanie mowy
automatyczne rozpoznawanie wieku
Opis:: The paper presents the results of the automatic recognition of age group and gender of speakers performed for the large SpeechDAT(E) acoustic database for the Polish language, containing recordings of 1000 speakers (486 males/514 females) aged 12 to 73, recorded in telephone conditions. Three age groups were recognised for each gender. Mel Frequency Cepstral Coefficients (MFCC) were used to describe the recognized signals parametrically. Among the classification methods tested in this study, the best results were obtained for the SVM (Support Vector Machines) method.
Źródło:: Vibrations in Physical Systems; 2022, 33, 2; art. no. 2022203
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Effect of changing body position on selected voice parameters
Autorzy:: Węglarz, Karolina
Wszołek, Wiesław
Hemmerling, Daria
Powiązania:: https://bibliotekanauki.pl/articles/24201974.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: speech acoustics
signal processing
medical diagnostics
akustyka mowy
przetwarzanie sygnałów
diagnostyka medyczna
Opis:: Correct posture is a key element in the proper functioning of the entire body. Both defects and postural disorders lead to overload syndromes and degenerative changes in the musculoskeletal system. Different body positions correlate with respiratory parameters, which form the basis in modifying loudness and accentuation when speaking or singing Body posture can affect the quality of the voice signal and its fatigue. As movement and duration intensify, vocal effort increases. What is still open, however, is the problem of speech signal evaluation, especially in order to obtain assessments useful in the context of supporting medical diagnosis, optimizing therapy and monitoring rehabilitation. Meanwhile, such evaluations are what we need in medicine, rehabilitation and sports. This paper presents excerpts from a study of the effects of changes in posture and fatigue in healthy subjects, and those with phonation disorders, on changes in the acoustic parameters of the speech signal.
Źródło:: Vibrations in Physical Systems; 2022, 33, 3; art. no. 2022320
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Impact of the level of noise and echo on the reaction time of listeners in the perception of logatoms
Autorzy:: Brachmański, Stefan
Dobrucki, Andrzej
Powiązania:: https://bibliotekanauki.pl/articles/2146651.pdf
Data publikacji:: 2021
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: perception
listener’s time reaction
speech quality
percepcja
czas reakcji słuchacza
jakość mowy
Opis:: The article presents the results of research regarding the impact of the degree of distortion and noise of the logatom (nonsense word) on the listener's reaction time. The study aimed to determine the maximum reaction time of listeners, which will allow determining the time after which the logatom will be exposed in the speech quality assessment method with an alternative choice. The research was carried out with the participation of a group of ten students. A strong relationship between the results obtained and the concentration of the listeners was found, as well as the effect of fatigue, training, and the gender of the listener. The obtained results indicate that in the method with an alternative choice before the logatom emission should appear 1.1 s initial sequence, which will eliminate the situation when the listeners did not recognize the initial phoneme transmitted from the logatom.
Źródło:: Vibrations in Physical Systems; 2021, 32, 2; art. no. 2021215
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Lossy coding impact on speech recognition with convolutional neural networks
Autorzy:: Kucharski, Mateusz
Powiązania:: https://bibliotekanauki.pl/articles/24201985.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: lossy coding
convolutional neural networks
speech recognition
kodowanie stratne
konwolucyjne sieci neuronowe
rozpoznawanie mowy
Opis:: This paper presents research of lossy coding impact on speech recognition with convolutional neural networks. For this purpose, google speech commands dataset containing utterances of 30 words was encoded using four most common all-purpose codecs: mp3, aac, wma and ogg. A convolutional neural network was taught using part of the original files and later tested with the rest of the files, as well as their counterparts encoded with different codecs and bitrates. The same network model was also taught using mp3 encoded data showing the biggest loss in effectiveness of the previous network. Results show that lossy coding does have an effect on speech recognition, especially for low bitrates.
Źródło:: Vibrations in Physical Systems; 2022, 33, 3; art. no. 2022302
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Effect of highpass filtering on the speech transmission index
Autorzy:: Dziechciński, Paweł
Powiązania:: https://bibliotekanauki.pl/articles/24201998.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: speech transmission index
STIPA
public address system
highpass filter
wskaźnik transmisji mowy
system nagłośnieniowy
filtry górnoprzepustowe
Opis:: Highpass filters are commonly used in the signal chain of public address systems. One of the reasons for using a highpass filter is to protect the loudspeaker from unwanted low-frequency signals. In addition, it can increase the intelligibility of speech. In this paper, the effect of the cutoff frequency and order of a highpass filter on the speech transmission index, the crest factor, and the sound level are presented. Analyses were performed for an ideal transmission channel, taking into account reverberation time, interfering noise, and high levels of sound. A computer model of the public address system developed by the author, based on the direct STIPA method, was used. This model enables analyses in the nonlinear range of power amplifier operation, which is often used in public address systems but is not considered in commercially available simulation programs.
Źródło:: Vibrations in Physical Systems; 2022, 33, 3; art. no. 2022306
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: A computer model for calculating the speech transmission index using the direct STIPA method
Autorzy:: Dziechciński, Paweł
Powiązania:: https://bibliotekanauki.pl/articles/128226.pdf
Data publikacji:: 2019
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: speech transmission index
STIPA
attenuation of sound by the atmosphere
wskaźnik transmisji mowy
pochłanianie dźwięku przez atmosferę
Opis:: Computer models currently used for the simulation of the speech transmission index (STI) calculate the STI using the statistical method or are based on numerically determined impulse response of the transmission channel. The limitation of both these computational methods is that they do not allow to take into account the nonlinear properties of the transmission channel and fluctuating background noise. This paper presents a proposition of a model based on the direct STIPA method. This model allows computer simulations of STIPA for distributed sound systems, and enables analysis to include both changes in signal dynamics and fluctuating background noise. The paper presents the idea of the model and validation of its basic elements - the generator and the analyser. The possibilities of using the model for computer simulation of outdoor public address systems were also discussed.
Źródło:: Vibrations in Physical Systems; 2019, 30, 1; 1-8
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Enhancing speech signals based on an mems microphone array and temporal differences in the incoming signal
Autorzy:: Felcyn, Jan
Raszewski, Michał
Powiązania:: https://bibliotekanauki.pl/articles/2202430.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:: microphone array
speech enhancement
direction of arrival
signal processing
macierz mikrofonów
uzdatnianie mowy
kierunek nadejścia sygnału
przetwarzanie sygnałów
Opis:: The development of the Internet of things and automatisation in everyday life also influences our houses. There are more and more devices on the market which can be controlled remotely. One kind of such control involves the use of voice signals. This method tends to use microphone arrays and dedicated algorithms to enhance the speech signal and recognize the words in it. In this project, a small 5-microphone array was developed. To enhance the quality of the signal, dedicated software was written. It consists of several modules, including the direction of arrival estimation, denoising, and differentiation between adults and children. The results showed that the custom algorithm can increase the signal to noise ratio by up to 6 dB.
Źródło:: Vibrations in Physical Systems; 2022, 33, 2; art. no. 2022202
0860-6897
Pojawia się w:: Vibrations in Physical Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "mowy" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język