Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "43.72.-p" wg kryterium: Temat


Wyświetlanie 1-4 z 4
Tytuł:
Development of Large Vocabulary Continuous Speech Recognition for Polish
Autorzy:
Demenko, G.
Szymański, M.
Cecko, R.
Kuśmierek, E.
Lange, M.
Wegner, K.
Klessa, K.
Owsianny, M.
Powiązania:
https://bibliotekanauki.pl/articles/1490468.pdf
Data publikacji:
2012-01
Wydawca:
Polska Akademia Nauk. Instytut Fizyki PAN
Tematy:
43.72.-p
43.72.+q
Opis:
In this study, the results of acoustic modeling used in a large vocabulary continuous speech recognition system are presented. The acoustic models have been developed with the use of a phonetically controlled large corpus of contemporary spoken Polish. Evaluation experiments showed that relatively good speech recognition results may be obtained with adequate training material, taking into account: (a) the presence of lexical stress; (b) speech styles (a variety of segmental and prosodic structures, various degrees of spontaneity of speech (spontaneous vs. read speech), pronunciation variants and dialects); (c) the influence of the sound level and background noises. The present large vocabulary continuous speech recognition evaluation results were obtained with Sclite assessment software. Moreover, the article delivers information about the speech corpus structure and contents and also a brief outline of the design and architecture of the automatic speech recognition system.
Źródło:
Acta Physica Polonica A; 2012, 121, 1A; A-086-A-091
0587-4246
1898-794X
Pojawia się w:
Acta Physica Polonica A
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Analysis of Natural Speech under Stress
Autorzy:
Demenko, G.
Jastrzębska, M.
Powiązania:
https://bibliotekanauki.pl/articles/1490489.pdf
Data publikacji:
2012-01
Wydawca:
Polska Akademia Nauk. Instytut Fizyki PAN
Tematy:
43.72.-p
43.72.Uv
Opis:
This paper presents how voice stress is manifested in the acoustic and phonetic structure of the speech signal. Out of 60000 authentic Police 997 emergency phone calls, 22000 were automatically selected, a few hundred of which were chosen for acoustic evaluation, the basis for selection being a perceptual assessment. In highly stressful conditions (e.g. panic) a systematic dynamic over-one-octave shift in pitch and significant increase in speech tempo was observed. In states of depression a systematic down shift in pitch and significant decrease in speech tempo was observed. Basic statistical measurements for stressed and neutral speech run over the database showed the relevance of the arousal and potency dimension in stress processing. In speech produced under fear an upward shift in pitch register was significant (in comparison to neutral speech), while speech recorded during experiencing anger was characterized by an increase in $F_0$ range.
Źródło:
Acta Physica Polonica A; 2012, 121, 1A; A-092-A-095
0587-4246
1898-794X
Pojawia się w:
Acta Physica Polonica A
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Analysis of Voice Modifications for Persons After Tonsillectomy
Autorzy:
Potępa, Ł.
Szaleniec, J.
Wszołek, W.
Steczko, A.
Składzień, J.
Powiązania:
https://bibliotekanauki.pl/articles/1197520.pdf
Data publikacji:
2014-04
Wydawca:
Polska Akademia Nauk. Instytut Fizyki PAN
Tematy:
43.72.-p
43.70.Gr
Opis:
The goal of the research described in the present paper was the determination of modification range for voice acoustic parameters resulting from tonsillectomy. Within the scope of the described research program, an attempt has been made to determine the changes of selected voice parameters for persons after such a treatment and also to elaborate work out some premises for prediction of potential voice modifications for persons who have not yet decided to undergo such a treatment. In order to achieve the goal, analyses have been carried out for voice utterances of persons before the tonsillectomy surgery and after the treatment. The first voice recordings took place between one and three days before the surgery. The post-treatment recordings have been carried out about 6 weeks after the surgery, as a procedure accompanying the follow-up examinations. In the present paper, an analysis has been carried out concerning phonemes /a/, /e/, /i/, and /u/ with prolonged phonation. The completed research shows that for evaluation of voice modification in the aspect of changes resulting from tonsillectomy, the most useful parameters are some of the mel-cepstral coefficients, the formant frequencies, and also the relative power coefficients.
Źródło:
Acta Physica Polonica A; 2014, 125, 4A; A-49-A-56
0587-4246
1898-794X
Pojawia się w:
Acta Physica Polonica A
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Convolutive Blind Signal Separation Spatial Effectiveness in Speech Intelligibility Improvement
Autorzy:
Kociński, J.
Sęk, A.
Libiszewski, P.
Powiązania:
https://bibliotekanauki.pl/articles/1504381.pdf
Data publikacji:
2011-06
Wydawca:
Polska Akademia Nauk. Instytut Fizyki PAN
Tematy:
43.72.-p
43.60.-c
43.60.+d
Opis:
Blind signal separation is one of the latest methods to improve the signal to noise ratio. The main objective of blind source separation is the transformation of mixtures of recorded signals to obtain each source signal at the output of the procedure, assuming that they are statistically independent. For acoustic signals it can be concluded that the correct separation is possible only if the source signals are spatially separated. That finding suggests analogies with the classical spatial filtering (beamforming). In this study we analyzed an effect of the angular separation of two source signals (i.e. speech and babble noise) to improve speech intelligibility. For this purpose, we chose the blind source separation algorithm based on the convolutive separation, based on second order statistics only. As a system of sensors a dummy head was used (one microphone inside each ear canal), which simulated two hearing aids of a hearing impaired person. The speech reception threshold, before and after the blind source separation was determined. The results have shown significant improvement in speech intelligibility after applying blind source separation (speach reception threshold fell even more than a dozen dB) in cases where the source signals were angularly separated. However, in cases where the source signals were coming from the same directions, the improvement was not observed. Moreover, the effectiveness of the blind source separation, to a large extent, depended on the relative positions of signal sources in space.
Źródło:
Acta Physica Polonica A; 2011, 119, 6A; 996-999
0587-4246
1898-794X
Pojawia się w:
Acta Physica Polonica A
Dostawca treści:
Biblioteka Nauki
Artykuł
    Wyświetlanie 1-4 z 4

    Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies