Temat: LiP - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition
Autorzy:: Kubanek, M.
Bobulski, J.
Adrjanowicz, L.
Powiązania:: https://bibliotekanauki.pl/articles/201266.pdf
Data publikacji:: 2012
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: coupled hidden Markov models
audiovisual speech recognition
lip reading
Opis:: This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of the highly disturbed audio speech signal. Recognition of audio-visual speech was based on combined hidden Markov models (CHMM). The described methods were developed for a single isolated command, nevertheless their effectiveness indicated that they would also work similarly in continuous audiovisual speech recognition. The problem of a visual speech analysis is very difficult and computationally demanding, mostly because of an extreme amount of data that needs to be processed. Therefore, the method of audio-video speech recognition is used only while the audiospeech signal is exposed to a considerable level of distortion. There are proposed the authors’ own methods of the lip edges detection and a visual characteristic extraction in this paper. Moreover, the method of fusing speech characteristics for an audio-video signal was proposed and tested. A significant increase of recognition effectiveness and processing speed were noted during tests – for properly selected CHMM parameters and an adequate codebook size, besides the use of the appropriate fusion of audio-visual characteristics. The experimental results were very promising and close to those achieved by leading scientists in the field of audio-visual speech recognition.
Źródło:: Bulletin of the Polish Academy of Sciences. Technical Sciences; 2012, 60, 2; 307-316
0239-7528
Pojawia się w:: Bulletin of the Polish Academy of Sciences. Technical Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: An Acoustic Study of the Emphatic Occlusive [ṭ] in School-Going Children with Cleft Palate or Cleft Lip
Autorzy:: Baazi, Khaled
Guerti, Mhania
Powiązania:: https://bibliotekanauki.pl/articles/2141635.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: cleft palate
cleft lip and cleft palate
acoustic analysis
school children
Opis:: The aim of this acoustic study is to analyse the phoneme [ṭ] produced by school children surgically operated on for the cleft palate or cleft lip, in order to examine their vocal characteristics, to provide speech therapists with numerous concrete analyses of voice and speech, to effectively support them and to prevent some serious outcomes on their psychological and academic development. The motivation for this study was mainly stemming from the difficulties that Algerian schoolchildren with clefts encounter in the pronunciation of this phoneme. To carry out the study, several acoustic parameters were investigated in terms of the fundamental frequency F0, the first three formants F1, F2, and F3, the energy E0, the Voice Onset Time (VOT), the durations [CV] and [V] of the subsequent vowel [a]. For the analysis, further important parameters in the field of pathological speech were deployed, namely the degree of disturbance of F0 (jitter), the degree of disturbance of intensity (shimmer) and the HNR (Harmonics to Noise Ratio). Results revealed disturbance in the values of F1, F2, and F3 and stability in the values of F0. Another important reported aspect is the increase in the value of the VOT due to the difficulties in controlling the plosives’ successive closure and release.
Źródło:: Archives of Acoustics; 2022, 47, 2; 141-149
0137-5075
Pojawia się w:: Archives of Acoustics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Mimicking speaker’s lip movement on a 3D head model using cosine function fitting
Autorzy:: Lüsi, I.
Anbarjafari, G.
Powiązania:: https://bibliotekanauki.pl/articles/199798.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: 3D lip movement modelling
mathematical modelling
depth information analysis
cosine function fitting
human-computer interaction
modelowanie ruchu warg 3D
modelowanie matematyczne
funkcja cosinus
interakcja człowiek-komputer
Opis:: Real-time mimicking of human facial movement on a 3D head model is a challenge which has attracted attention of many researchers. In this research work we propose a new method for enhancing the capturing of the shape of lips. We present an automatic lip movement tracking method which employs a cosine function to interpolate between extracted lip features in order to make the detection more accurate. In order to test the proposed method, mimicking lip movements of a speaker on a 3D head model is studied. Microsoft Kinect II is used in order to capture videos and both RGB and depth information are used to locate the mouth of a speaker followed by fitting a cosine function in order to track the changes of the features extracted from the lips.
Źródło:: Bulletin of the Polish Academy of Sciences. Technical Sciences; 2017, 65, 5; 733-739
0239-7528
Pojawia się w:: Bulletin of the Polish Academy of Sciences. Technical Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "LiP" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język