Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "voice analysis" wg kryterium: Temat


Tytuł:
Acoustic Parameters in the Evaluation of Voice Quality of Choral Singers. Prototype of Mobile Application for Voice Quality Evaluation
Autorzy:
Szklanny, Krzysztof
Powiązania:
https://bibliotekanauki.pl/articles/178097.pdf
Data publikacji:
2019
Wydawca:
Polska Akademia Nauk. Czasopisma i Monografie PAN
Tematy:
web application
voice analysis
voice quality
acoustic analysis
COVAREP
Opis:
Choral singers are among intensive voice users whose excessive vocal effort puts them at risk of developing voice disorders. The aim of the work was to assess voice quality for choral singers in the choir at the Polish-Japanese Academy of Information Technology. This evaluation was carried out using the acoustic parameters from the COVAREP (A Collaborative Voice Analysis Repository For Speech Technologies) repository. A prototype of a mobile application was also prepared to allow the calculation of these parameters. The study group comprised 6 male and 19 female choir singers. The control group consisted of health non-singing individuals, 50 men and 39 women. Auditory perceptual assessment (using the RBH scale) as well as acoustic analysis were used to test the voice quality of all the participants. The voice quality of the female choir singers proved to be normal in comparison with the control group. The male choir singers were found to have tense voice in comparison with the controls. The parameters which proved most effective for voice evaluation were Peak Slope and Normalized Amplitude Quotient.
Źródło:
Archives of Acoustics; 2019, 44, 3; 439-446
0137-5075
Pojawia się w:
Archives of Acoustics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Speech Emotion Recognition Based on Voice Fundamental Frequency
Autorzy:
Dimitrova-Grekow, Teodora
Klis, Aneta
Igras-Cybulska, Magdalena
Powiązania:
https://bibliotekanauki.pl/articles/177227.pdf
Data publikacji:
2019
Wydawca:
Polska Akademia Nauk. Czasopisma i Monografie PAN
Tematy:
emotion recognition
speech signal analysis
voice analysis
fundamental frequency
speech corpora
Opis:
The human voice is one of the basic means of communication, thanks to which one also can easily convey the emotional state. This paper presents experiments on emotion recognition in human speech based on the fundamental frequency. AGH Emotional Speech Corpus was used. This database consists of audio samples of seven emotions acted by 12 different speakers (6 female and 6 male). We explored phrases of all the emotions – all together and in various combinations. Fast Fourier Transformation and magnitude spectrum analysis were applied to extract the fundamental tone out of the speech audio samples. After extraction of several statistical features of the fundamental frequency, we studied if they carry information on the emotional state of the speaker applying different AI methods. Analysis of the outcome data was conducted with classifiers: K-Nearest Neighbours with local induction, Random Forest, Bagging, JRip, and Random Subspace Method from algorithms collection for data mining WEKA. The results prove that the fundamental frequency is a prospective choice for further experiments.
Źródło:
Archives of Acoustics; 2019, 44, 2; 277-286
0137-5075
Pojawia się w:
Archives of Acoustics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The application of High-Speed camera (HS), acoustic analysis and Voice Handicap Index (VHI) questionnaire in diagnosis of voice disorders in elderly men
Autorzy:
Kosztyła-Hojna, Bożena
Zdrojkowski, Maciej
Duchnowska, Emilia
Powiązania:
https://bibliotekanauki.pl/articles/1397805.pdf
Data publikacji:
2019
Wydawca:
Index Copernicus International
Tematy:
acoustic voice analysis
HSDI
hypofunctional dysphonia
presbyphonia
vocal fold atrophy
Opis:
Objective: The process of ageing begins after 60 years of age and is referred to as presbyphonia (Vox senium). The causes include functional or organic voice disorders, often coexisting with dry upper respiratory tract infection. Introduction: The aim of the study is the use of high-speed camera and acoustic voice analysis in diagnostics of the clinical form of presbyphonia. M aterials and methods: The study included a group of 50 men, non-smokers, age from 51 to 72, who do not use their voice professionally. High-Speed Digital Imaging and HS camera have been used, allowing evaluation of real vibrations of vocal folds, along with acoustic voice analysis using a software by DiagNova Technologies. Results: VHI questionnaire has been used for self-assessment of voice disability. Visualizations of the larynx enabled recognition of hypofunctional dysphonia or atrophy of vocal folds that cause voice disorders. This was confirmed by parameters of voice acoustic evaluation: F0, NHR, narrowband spectrography. The pathological value of NHR and the presence of nonharmonic components in the range of high frequency levels indicated glottal insufficiency, recorded with the visualization technique of the larynx by HS camera. A significant shortening of maximum phonation time in relation to the control group has also been recorded. Discussion: The objective examination of voice pathology is crucial in diagnosis and rehabilitation, however, subjective assessment of the patient is important in the scope of the procedure used. The patient’s subjective self-rating assessment (VHI) confirmed the sense of voice disorders in elderly men, indicating the need for rapid and accurate clinical diagnosis.
Źródło:
Polish Journal of Otolaryngology; 2019, 73, 5; 25-30
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Bezpieczeństwo połączeń w telefonii PSTN
Safety calls PSTN telephony
Autorzy:
Piotrowski, Z.
Różanowski, K.
Gajewski, P.
Powiązania:
https://bibliotekanauki.pl/articles/91441.pdf
Data publikacji:
2012
Wydawca:
Warszawska Wyższa Szkoła Informatyki
Tematy:
voice spoofing
telefonia
impersonalizacja
bezpieczeństwo połączeń
analiza głosu
telephony
impersonalisation
security calls
voice analysis
Opis:
Odpowiednio wczesne zabezpieczenie krytycznych systemów infrastruktury na potencjalnie groźne ataki typu voice spoofing jest warunkowane opracowaniem skutecznych metod i istnieniem dedykowanych rozwiązań technicznych. Metody ataków i obrony przed impersonizacją skupiają się zasadniczo na dwóch obszarach: zmianie głosu abonenta na inny głos (wirtualny lub innej osoby) oraz nieautoryzowanej edycji komunikatów głosowych. W nowych generacjach ataków na łącza telefoniczne, w których następuje zmiana głosu mówcy w czasie rzeczywistym lub odtwarzany jest uprzednio spreparowany komunikat, stosuje się metody obrony polegające na m.in. weryfikacji wspólnie posiadanej wiedzy lub posiadanego klucza.
In order to protect critical infrastructure systems early enough against potentially dangerous attacks called spoofing voice it is required to develop effective methods and implement dedicated solutions. Methods of attack and defence against impersonalisation focus basically on two areas: changing of original voice to the voice of other subscriber (virtual simulation or voice of different person) or unauthorized editing of voice messages. The new generations of attacks on telephone lines, in which the speaker’s voice is being changed in real time or prepared message is being played, require other methods of defence involving verification of common knowledge or of the authorisation key.
Źródło:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki; 2012, 6, 8; 99-104
1896-396X
2082-8349
Pojawia się w:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Perceptual and acoustic voice analysis in patients with glottis cancer after endoscopic laser cordectomy
Autorzy:
Kosztyła-Hojna, Bożena
Łuczaj, Jarosław
Berger, Greta
Duchnowska, Emilia
Zdrojkowski, Maciej
Łobaczuk-Sitnik, Anna
Biszewska, Jolanta
Powiązania:
https://bibliotekanauki.pl/articles/1397318.pdf
Data publikacji:
2020
Wydawca:
Index Copernicus International
Tematy:
laser cordectomy
glottis cancer
voice quality
voice acoustic analysis
Opis:
Introduction: Treatment of glottis cancer, despite oncological safety, should consider postoperative voice quality. CO2 laser endoscopic cordectomy allows radical removal of the tumor while maintaining respiratory, defensive and phonatory functions. The aim: The aim of the study is perceptual and acoustic evaluation of voice in patients after endoscopic CO2 III–Va laser cordectomy due to glottis cancer. Material and method: The study included 30 men after CO2 cordectomy. 13 (43%) patients underwent type III cordectomy, 6 (20%) – type IV; 11 (37%) – type Va. Voice quality has been assessed 6 months after the surgery. Control group included 30 healthy men of the same age. GRBAS scale has been used in perceptual evaluation of voice. Acoustic analysis has been performed using DiagnoScope Specjalista software. Narrowband spectrography and Maximum Phonation Time (MPT) measure has been performed. Results: In study group, voice has been classified as G1R1B0A0S0 after type III cordectomy; as G1R1B1A1S2 in type IV and as G2R1B1A0S3 in type Va. Acoustic evaluation revealed the highest values of F0, Jitter, Shimmer and NHR after Va cordectomy as well as non-harmonic components in narrowband spectrography and reduction of MPT. Conclusions: Postoperative voice quality depends on the type of cordectomy. Perceptual assessment indicates that type IV and Va cordectomy cause intensification of voice disorders. Parameters of acoustic evaluation increase with the extent of the procedure. The presence of non-harmonic components in narrowband spectrography increases with the extent of cordectomy, such as the reduction of MPT. Preservation of anterior commissure influences good voice quality in perceptual and acoustic assessment.
Źródło:
Polish Journal of Otolaryngology; 2020, 74, 3; 23-28
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Tests of basic voice stress detection techniques
Autorzy:
Staroniewicz, Piotr
Powiązania:
https://bibliotekanauki.pl/articles/128166.pdf
Data publikacji:
2019
Wydawca:
Politechnika Poznańska. Instytut Mechaniki Stosowanej
Tematy:
Voice Stress Analysis
Empirical Mode Decomposition
analiza napięcia głosowego
VSA
empiryczna dekompozycja sygnału
EMD
Opis:
The modern speech processing techniques enable new possibilities of potential applications. Besides speech and speaker recognition, also the information about speakers’ physical condition, emotional state or stress can be detected in speech signal. Since emotional stress can occur during deception, its detection in speech could be used for law or security services. The paper presents the comparative tests of two voice stress detection techniques: one based on trials of microtremors detection relying on an iterative EMD method (Empirical Mode Decomposition) and the second one based on the statistical analysis of fundamental frequency and MFCC parameters. The preliminary tests were carried on the group of 12 speakers (6 males and 6 females) answering yes/no to the list of a few dozen personal questions. The presented research revealed the speakers’ very high personal influence on the obtained results.
Źródło:
Vibrations in Physical Systems; 2019, 30, 1; 1-6
0860-6897
Pojawia się w:
Vibrations in Physical Systems
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Classification of Parkinson’s disease and other neurological disorders using voice features extraction and reduction techniques
Klasyfikacja choroby Parkinsona i innych zaburzeń neurologicznych z wykorzystaniem ekstrakcji cech głosowych i technik redukcji
Autorzy:
Majdoubi, Oumaima
Benba, Achraf
Hammouch, Ahmed
Powiązania:
https://bibliotekanauki.pl/articles/27315435.pdf
Data publikacji:
2023
Wydawca:
Politechnika Lubelska. Wydawnictwo Politechniki Lubelskiej
Tematy:
voice analysis
Parkinson’s disease
MFCC
PCA
naive Bayes kernel
machine learning
analiza głosu
choroba Parkinsona
naiwne jądro bayesowskie
uczenie maszynowe
Opis:
This study aimed to differentiate individuals with Parkinson's disease (PD) from those with other neurological disorders (ND) by analyzing voice samples, considering the association between voice disorders and PD. Voice samples were collected from 76 participants using different recording devices and conditions, with participants instructed to sustain the vowel /a/ comfortably. PRAAT software was employed to extract features including autocorrelation (AC), cross-correlation (CC), and Mel frequency cepstral coefficients (MFCC) from the voice samples. Principal component analysis (PCA) was utilized to reduce the dimensionality of the features. Classification Tree (CT), Logistic Regression, Naive Bayes (NB), Support Vector Machines (SVM), and Ensemble methods were employed as supervised machine learning techniques for classification. Each method provided distinct strengths and characteristics, facilitating a comprehensive evaluation of their effectiveness in distinguishing PD patients from individuals with other neurological disorders. The Naive Bayes kernel, using seven PCA-derived components, achieved the highest accuracy rate of 86.84% among the tested classification methods. It is worth noting that classifier performance may vary based on the dataset and specific characteristics of the voice samples. In conclusion, this study demonstrated the potential of voice analysis as a diagnostic tool for distinguishing PD patients from individuals with other neurological disorders. By employing a variety of voice analysis techniques and utilizing different machine learning algorithms, including Classification Tree, Logistic Regression, Naive Bayes, Support Vector Machines, and Ensemble methods, a notable accuracy rate was attained. However, further research and validation using larger datasets are required to consolidate and generalize these findings for future clinical applications.
Przedstawione badanie miało na celu różnicowanie osób z chorobą Parkinsona (PD) od osób z innymi zaburzeniami neurologicznymi poprzez analizę próbek głosowych, biorąc pod uwagę związek między zaburzeniami głosu a PD. Próbki głosowe zostały zebrane od 76 uczestników przy użyciu różnych urządzeń i warunków nagrywania, a uczestnicy byli instruowani, aby wydłużyć samogłoskę /a/ w wygodnym tempie. Oprogramowanie PRAAT zostało zastosowane do ekstrakcji cech, takich jak autokorelacja (AC), krzyżowa korelacja (CC) i współczynniki cepstralne Mel (MFCC) z próbek głosowych. Analiza składowych głównych (PCA) została wykorzystana w celu zmniejszenia wymiarowości cech. Jako techniki nadzorowanego uczenia maszynowego wykorzystano drzewa decyzyjne (CT), regresję logistyczną, naiwny klasyfikator Bayesa (NB), maszyny wektorów nośnych (SVM) oraz metody zespołowe. Każda z tych metod posiadała swoje unikalne mocne strony i charakterystyki, umożliwiając kompleksową ocenę ich skuteczności w rozróżnianiu pacjentów z PD od osób z innymi zaburzeniami neurologicznymi. Naiwny klasyfikator Bayesa, wykorzystujący siedem składowych PCA, osiągnął najwyższy wskaźnik dokładności na poziomie 86,84% wśród przetestowanych metod klasyfikacji. Należy jednak zauważyć, że wydajność klasyfikatora może się różnić w zależności od zbioru danych i konkretnych cech próbek głosowych. Podsumowując, to badanie wykazało potencjał analizy głosu jako narzędzia diagnostycznego do rozróżniania pacjentów z PD od osób z innymi zaburzeniami neurologicznymi. Poprzez zastosowanie różnych technik analizy głosu i wykorzystanie różnych algorytmów uczenia maszynowego, takich jak drzewa decyzyjne, regresja logistyczna, naiwny klasyfikator Bayesa, maszyny wektorów nośnych i metody zespołowe, osiągnięto znaczący poziom dokładności. Niemniej jednak, konieczne są dalsze badania i walidacja na większych zbiorach danych w celu skonsolidowania i uogólnienia tych wyników dla przyszłych zastosowań klinicznych.
Źródło:
Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska; 2023, 13, 3; 16--22
2083-0157
2391-6761
Pojawia się w:
Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Acoustic and capacity analysis of the vocal organ in patients with functional and organic larynx disorders using the DiagnoScope Specialist software
Autorzy:
Owczarek, Kalina
Niewiadomski, Piotr
Olszewski, Jurek
Powiązania:
https://bibliotekanauki.pl/articles/1397511.pdf
Data publikacji:
2019
Wydawca:
Index Copernicus International
Tematy:
acoustic and capacity analysis
singing voice
Opis:
Aim: The aim of the study was to assess the acoustic and capacity analysis of singing voice using DiagnoScope Specialist software. Material and methods: The study was conducted in 131 adult subjects, including 74 women and 46 men aged 21–51, divided into 3 groups: I – 40 subjects (treatment group) – professional vocalists, II – 40 subjects (treatment group) – semiprofessional vocalists, III – 40 subjects (control group) – students of The Military Medical Faculty at the Medical University of Lodz – nonsingers. The research methodology included: primary medical history, physical examination (otolaryngological), videolaryngoscopic examination, the GRBAS scale for subjective voice evaluation, diagnostic voice acoustic and capacity analysis using DiagnoScope Specialist software, survey on lifestyle patterns which may affect voice quality. R esults: Average value of the fundamental frequency F0 was the highest in professional vocalists group; it was 316.46 Hz in women and 165.09 Hz in men. In semiprofessional vocalists group it was accordingly 260.50 Hz and 149.26 Hz, in nonsingers group it was accordingly 261.23 Hz and 159.27 Hz. The mean value of Jitter parameter in professional vocalists group was 0.30% in women and 0.54% in men, in semiprofessional vocalists group it was accordingly 0.31% and 0.57%, in nonsingers group it was 0.31% and 0.56%. The mean value of Shimmer parameter in professional vocalists group was 3.27% in women and 3.75% in men, in semiprofessional vocalists group it was accordingly 3.46% and 3.77%, in nonsingers group it was 4.33% and 4.39%. The mean value of the NHR index in professional vocalists group was 3.28% in women and 6.00% in men, in semiprofessional vocalists group it was accordingly 3.23% and 6.72%, in nonsingers group it was 3.89% and 6.13%. Conclusions: Values of the parameters which measure the character of the voice, relative period-to-period fundamental frequency perturbations, relative period-to-period amplitude perturbation and level of buzzing together with other methods have diagnostic and predictive value in early detection of voice disorders. Capacity analysis in singing voice showed very low values of the following parameters: phonation time, true phonation time, no phonation coefficient, voice efficiency coefficient and voice capacity.
Źródło:
Polish Journal of Otolaryngology; 2019, 73, 4; 21-28
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The usefulness of the acoustic and the capacity analysis of singing voice
Autorzy:
Nowosielska-Grygiel, Joanna
Olszewski, Jurek
Powiązania:
https://bibliotekanauki.pl/articles/1397792.pdf
Data publikacji:
2019
Wydawca:
Index Copernicus International
Tematy:
acoustic and capacity analysis
singing voice
Opis:
Abstract Introduction: The aim of the study was to assess the acoustic and capacity analysis of singing voice using DiagnoScope Specialist software. Material and methods: The study was conducted in 120 adults subjects, including 74 women and 46 men aged 21-5, were divided into 3 groups: I -40 subjects (treatment group) – professional vocalists, II- 40 subjects (treatment group) – semiprofessional vocalists, III- 40 subjects (control group) – students of The Military Medical Faculty at Medical University of Lodz – nonsingers. The research methodology included: primary medical history, physical examination (otolaryngological), vdeolaryngoscopic examination, the GRBAS scale for subjective voice evaluation, diagnostic voice acoustic and capacity analysis using DiagnoScope Specialist software, survey on lifestyle patterns which may affect voice quality. Results: Average value of the fundamental frequency F0 was the highest in professional vocalists’ group was 316,46 Hz in women and 165,09 Hz in men, in semiprofessional vocalists’ group was accordingly 260,50 Hz and 149,26 Hz, in nonsingers’ group was accordingly 261,23 Hz and 159, 27 Hz. Average value of Jitter parameter in professional vocalists’ group was 0,30% in women and 0,54% in men, in semiprofessional vocalists’ group was accordingly 0,31% and 0,57%, in nonsingers’ group was 0,31% and 0,56%. Average value of Shimmer parameter in professional vocalists’ group was 3,27% in women and 3,75% in men, in semiprofessional vocalists’ group was accordingly 3,46% and 3,77%, in nonsingers’ group was 4,33% and 4,39%. Average value of NHR index in professional vocalists’ group was 3,28% in women and 6,00% in men, in semiprofessional vocalists’ group was accordingly 3,23% and 6,72%, in nonsingers’ group was 3,89% and 6,13%. Conclusions: Values of the parameters which are measuring the character of the voice, relative period-to-period fundamental frequency perturbations, relative period-to-period amplitude perturbation and level of buzzing together with other methods have diagnostic and predictive value in early detection of voice disorders. Capacity analysis in singing voice showed very low values of the following parameters: phonation time, true phonation time, no phonation coefficient, voice efficiency coefficient and voice capacity. Key words: The acoustic and capacity analysis, singing voice
Źródło:
Polish Journal of Otolaryngology; 2019, 73, 3; 16-25
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Subjective and objective assessment of voice quality in pregnancy
Autorzy:
Kosztyła-Hojna, Bożena
Łobaczuk-Sitnik, Anna
Biszewska, Jolanta
Moskal-Jasińska, Diana
Kraszewska, Anna
Zdrojkowski, Maciej
Duchnowska, Emilia
Powiązania:
https://bibliotekanauki.pl/articles/1397798.pdf
Data publikacji:
2019
Wydawca:
Index Copernicus International
Tematy:
HSDI
DKG
acoustics analysis
voice disorders in pregnancy
Opis:
During pregnancy, voice quality disorders may occur in form of: edema, dryness, nervousness. The aim of the study is subjective and objective evaluation of voice quality in pregnant women. The study included 20 women in the third trimester of pregnancy, age of 20-31 diagnosed at the Department of Clinical Phonoaudiology and Logopedics, Medical University of Bialystok. Subjective assessment has been based on the GRBAS scale. Objective assessment of the vocal organ used the HSDI technique (High Speed Digital Imaging). In the laryngeal visualization, high-speed camera (HS) using rigid endoscope with 90 ° optics has been used. Vibration of vocal folds has been recorded during phonation of vowel "e" at 4000 frames / sec. The glottal closure (GTs), symmetry, regularity and synchronization of vocal folds vibration have been assessed. In estimating the degree of glottal insufficiency, kymography of the larynx has been performed by analyzing the value of Open Quotient (OQ). Objective acoustic evaluation of voice has been also conducted using DiagnoScope Specjalista Program. Hoarseness has been observed in 15 pregnant women, whereas voice fatigability in 20 patients. Using HSDI, the edema of vocal folds in part of the group has been observed. Decreased MPT has been found in all examined women in the third trimester of pregnancy. Hoarseness and fatigability of voice are the most frequent subjective symptoms of voice organ in the third trimester of pregnancy. Decreased MPT is recorded objectively, as well as edema and insufficiency of vocal folds using HSDI technique.
Źródło:
Polish Journal of Otolaryngology; 2019, 73, 2; 1-5
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Application of High Speed Digital Imaging (HSDI) technique and voice acoustic analysis in the diagnosis of the clinical form of Presbyphonia in women
Autorzy:
Kosztyła-Hojna, Bożena
Duchnowska, Emilia
Zdrojkowski, Maciej
Łobaczuk-Sitnik, Anna
Biszewska, Jolanta
Powiązania:
https://bibliotekanauki.pl/articles/1397305.pdf
Data publikacji:
2020
Wydawca:
Index Copernicus International
Tematy:
acoustic analysis
clinical voice assessment
High-Speed Digital Imaging
Presbyphonia
vocal folds vibration
voice changes
Opis:
Introduction: The aging process of voice begins after the age of 60 and has an individually variable course. Voice quality disorders at this age are called senile voice (Presbyphonia or Vox Senium). Voice pathology is particularly severe in women. The aim of the study was to diagnose the clinical form of Presbyphonia in elderly women using High Speed Digital Imaging (HSDI) and acoustic voice analysis. Material and methods: Study included 50 elderly women (average age 69) with dysphonia (Group I). Control group (Group II) included 30 women (average age 71) without voice quality disorders. Visualization assessment has been conducted with High Speed Digital Imaging (HSDI) with High Speed camera (HS). Acoustic evaluation of voice included analysis isolated vowel “a” and continuous linguistic text with Diagnoscope Specialista software. Maximum Phonation Time (MPT) has been determined. Results: In Group I, 78% of women revealed vocal folds vibrations asymmetry, vibration amplitude increase, Mucousal Wave (MW) limitation and Type D glottal insufficiency (GTs). Acoustic voice analysis proved decrease in F0, increase in Jitter, Shimmer, NHR. In 22% of women, next to vibrations asymmetry, vibration amplitude reduction and MW limitation, Type E glottal insufficiency (GTs) have been found. Acoustic voice analysis revealed slight decrease in F0 and the presence of numerous non-harmonic components in the glottis region. Conclusions: Vocal folds visualization with HSDI showed edema, less often atrophy in elderly women. Both forms of dysphonia were caused abnormal values of F0, Jitter, Shimmer, NHR in the acoustic voice evaluation and significant reduction of MPT.
Źródło:
Polish Journal of Otolaryngology; 2020, 74, 5; 24-30
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Speech Analysis as a Tool for Detection and Monitoring of Medical Conditions : A review
Autorzy:
Igras-Cybulska, Magdalena
Hemmerling, Daria
Ziółko, Mariusz
Datka, Wojciech
Stogowska, Ewa
Kucharski, Michał
Rzepka, Rafał
Ziółko, Bartosz
Powiązania:
https://bibliotekanauki.pl/articles/31339837.pdf
Data publikacji:
2023
Wydawca:
Polska Akademia Nauk. Czasopisma i Monografie PAN
Tematy:
speech analysis
speech features
acoustic parameters
linguistic analysis
voice biomarkers
screening test
Opis:
The goal of this article is to present and compare recent approaches which use speech and voice analysis as biomarkers for screening tests and monitoring of some diseases. The article takes into account metabolic, respiratory, cardiovascular, endocrine, and nervous system disorders. A selection of articles was performed to identify studies that assess voice features quantitatively in selected disorders by acoustic and linguistic voice analysis. Information was extracted from each paper in order to compare various aspects of datasets, speech parameters, methods of applied analysis and obtained results. 110 research papers were reviewed and 47 databases were summarized. Speech analysis is a promising method for early diagnosis of certain disorders. Advanced computer voice analysis with machine learning algorithms combined with the widespread availability of smartphones allows diagnostic analysis to be conducted during the patient’s visit to the doctor or at the patient’s home during a telephone conversation. Speech analysis is a simple, low-cost, non-invasive and easy-toprovide method of medical diagnosis. These are remarkable advantages, but there are also disadvantages. The effectiveness of disease diagnoses varies from 65% up to 99%. For that reason it should be treated as a medical screening test and should be an indication of the need for classic medical tests.
Źródło:
Archives of Acoustics; 2023, 48, 3; 289-315
0137-5075
Pojawia się w:
Archives of Acoustics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Analysis of voice quality parameters in patients with vestibular voice
Autorzy:
Kosztyła-Hojna, Bożena
Łobaczuk-Sitnik, Anna
Zdrojkowski, Maciej
Duchnowska, Emilia
Moskal-Jasińska, Diana
Kraszewska, Anna
Twarowska, Anna
Biszewska, Jolanta
Powiązania:
https://bibliotekanauki.pl/articles/1397791.pdf
Data publikacji:
2019
Wydawca:
Index Copernicus International
Tematy:
vestibular voice
videolaryngostroboscopy
High Speed Digital Imaging
acoustic analysis
Opis:
Abstract Vestibular voice includes participation of larynx structures which are absent in physiological process. Vestibular phonation may be desired when vocal folds are damaged as in paralytic dysphonia, or undesired in marginal hyperfunction. Vestibular voice may result from psychogenic dysphonia – phononeurosis. The aim of the study is perceptive evaluation of vestibular voice, objective larynx visualization, acoustic and aerodynamic examination. The study included 40 patients: 20 with vestibular voice, 20 with euphonic voice. Voice quality has been evaluated using perceptual GRBAS scale. Endoscopic and stroboscopic larynx examination used Endo-STROB-EL-Xion GmbH with visual tract. High-Speed Digital Imaging (HSDI) and High Speed (HS) camera registered true vocal folds vibrations. Acoustic evaluation of voice with DiagnoScope Specjalista, DiagNova Technologies included analysis of F0, Jitter, Shimmer, NHR, nonharmonic components. MPT has been analyzed. In examined group, hoarseness (95%), roughness (75%) and voice strain (55%) have been recorded. Endoscopy revealed edema of vestibular folds with dilation of vessels covering glottis. Stroboscopy and HSDI confirmed coexistence of hyperfunctional (95%) or paralytic (5%) dysphonia. Acoustic assessment revealed increase in Jitter, Shimmer, NHR and decrease in F0 and MPT. The vestibular voice is observed most frequently in women with hyperfunctional dysphonia (phononeuroses) or in paralytic dysphonia. Visualization techniques confirm the coexistence of vestibular folds hypertrophy and edema with vibration disorders. In the perceptual assessment, vestibular voice was hoarse, rough and strained. Acoustic examination showed increase of Jitter, Shimmer, NHR, presence of nonharmonic components and decrease of F0 and MPT.
Źródło:
Polish Journal of Otolaryngology; 2019, 73, 3; 11-15
0030-6657
2300-8423
Pojawia się w:
Polish Journal of Otolaryngology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Approximate performance analysis of slotted downlink channel in a wireless CDMA system supporting integrated voice and data services
Autorzy:
Świderski, J.
Powiązania:
https://bibliotekanauki.pl/articles/308218.pdf
Data publikacji:
2004
Wydawca:
Instytut Łączności - Państwowy Instytut Badawczy
Tematy:
wireless CDMA system
synchronous downlink
voice and data integration
queueing analysis
Opis:
This paper is concerned with the performance analysis of a slotted downlink channel in a wireless CDMA communication system with integrated packet voice and data transmission. The system model consists of mobile terminals (MT) and a single base station (BS). It is assumed that the voice (data) packet error rate (PER) does not exceed lO(-2) (10(-5)). With this requirement the number of simultaneous transmissions over the downlink channel is limited. Therefore, the objective of the call admission control is to restrict the maximum number of CDMA codes available to voice and E data traffic. Packets of accepted voice calls are transmitted immediately while accepted data packets are initially buffered at the BS. This station distinguishes between silence and talk-spurt periods of voice sources, so that data packets can use their own codes for transmission during silent time slots. Data packets are buffered in queues created separately for each destination. Discrete-time Markov processes are used to model the system operation. Statistical dependence between queues is the main difficulty which arises during the analysis. This dependence leads to serious computational complexity. The aim of this paper is to present an approximate analytical method based on the restricted occupancy urn model which enables to evaluate system performance despite the dependence. Numerical calculations compared with simulation results show excellent agreement for the average system throughput and the blocking probability of data packets for higher system loads. On the other hand, when the average data packet delay is considered, analytical results underestimate simulation and therefore only approximate system performance evaluation is possible.
Źródło:
Journal of Telecommunications and Information Technology; 2004, 2; 47-53
1509-4553
1899-8852
Pojawia się w:
Journal of Telecommunications and Information Technology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Isolated word descriptors as control parameters of the computer applications
Autorzy:
Porwik, P.
Powiązania:
https://bibliotekanauki.pl/articles/333588.pdf
Data publikacji:
2006
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
analiza mowy
sterowanie głosem
osoby niepełnosprawne
speech analysis
voice control
persons disability
MSAA technology
Opis:
This paper is an extended version of the MIT'06 conference contribution. During the conference, many inquiries about the used techniques were performed. Hence, in the paper some parts of investigations were explained and discussed, with greater accuracy. It is shown that the computer applications can be controlled by a human voice. The computer controlling processes are available by means of utterance of isolated words, where application events with the aid of user's voice can be serviced. The voice usage can be convenient for blind or partially sighted users or for persons with limb paresis. The Microsoft application events, by means of the practicable Microsoft Windows firmware MSAA® technology can be analysed. Such technology, together with isolated word descriptors, as voice recognition system, has been presented.
Źródło:
Journal of Medical Informatics & Technologies; 2006, 10; 35-46
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł

Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies