Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "mowy" wg kryterium: Temat


Wyświetlanie 1-13 z 13
Tytuł:
Visualization of stages of determining cepstral factors in speech recognition systems
Autorzy:
Proksa, R.
Powiązania:
https://bibliotekanauki.pl/articles/333103.pdf
Data publikacji:
2009
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
rozpoznawanie mowy
LPCC
MFCC
wyizolowane słowo
sygnały mowy
speech recognition
cepstral coefficients
isolated word
Opis:
The article presents two methods of determination of cepstral parameters commonly applied in digital signal processing, in particular in speech recognition systems. The solutions presented are part of a project aimed at developing applications allowing to control the Windows operating system with voice and the use of MSAA (Microsoft Active Accessibility). The analysed voice signal has been visually presented at each of the crucial stages of developing cepstral coefficients.
Źródło:
Journal of Medical Informatics & Technologies; 2009, 13; 121-128
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Estimation of possibilities connected with usage of electroglotography method in speech signal analysis
Autorzy:
Zielińska, J.
Powiązania:
https://bibliotekanauki.pl/articles/333383.pdf
Data publikacji:
2008
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
wizualizacja komputerowa sygnału mowy
laryngograf
computer visualization of speech signal
electroglotography
laryngograph
Opis:
The research presented in this paper deals with the speech signal with use of elecroglotography method analysis issue. This is an instrumental analysis, so the device called Laryngograph is presented, as a practical application. In this paper capabilities of this device are estimated. The very interesting fact is that the visualization of the speech signal obtained using Laryngograph allows to detect its acoustically and phonetically most important features, and presenting them in a graphical form. The analysis process performed using a computer and the specified computer attachment is easier, faster and ensures higher quality than other methods. Computer voice recording enables not only visualization but also objective assessment and its repetitiveness. In the context of presented questions, practical capabilities of integrated system for speech examination - Speech Studio are discussed.
Źródło:
Journal of Medical Informatics & Technologies; 2008, 12; 217-222
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Word extraction method in human speech processing
Autorzy:
Porwik, P.
Proksa, R.
Powiązania:
https://bibliotekanauki.pl/articles/333377.pdf
Data publikacji:
2008
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
rozpoznawanie mowy
wykrywanie słów
słowa wyodrębnione
voice recognition
word detection
isolated words
Opis:
A major problem in isolated-word speech recognition systems is detection of the beginning and ending boundaries of the word. It is an essential of speech recognition algorithms, where signal speech segments should be reliably separated. During speech recognition background noise is also recorded, hence the word isolation is difficult. The parametric representation of the speech must provide enough information to characterize the words and to differentiate between acoustically similar words. In this paper the method of words extraction from human speech will be considered.
Źródło:
Journal of Medical Informatics & Technologies; 2008, 12; 209-216
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Automatic prolongation recognition in disordered speech using CWT and Kohonen network
Autorzy:
Codello, I.
Kuniszyk-Jóźkowiak, W.
Smołka, E.
Kobus, A.
Powiązania:
https://bibliotekanauki.pl/articles/332965.pdf
Data publikacji:
2012
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
sieć Kohonena
zaburzenia automatycznego rozpoznawania mowy
ciągła transformata falkowa
skala Barka
wydłużenie mowy
Kohonen network
automatic disorders speech recognition
waveblaster
CWT
continuous wavelet transform (CWT)
Bark scale
speech prolongations
Opis:
Automatic disorder recognition in speech can be very helpful for the therapist while monitoring therapy progress of the patients with disordered speech. In this article we focus on prolongations. We analyze the signal using Continuous Wavelet Transform with 18 bark scales, we divide the result into vectors (using windowing) and then we pass such vectors into Kohonen network. Quite large search analysis was performed (5 variables were checked) during which, recognition above 90% was achieved. All the analysis was performed and the results were obtained using the authors' program - "WaveBlaster". It is very important that the recognition ratio above 90% was obtained by a fully automatic algorithm (without a teacher) from the continuous speech. The presented problem is part of our research aimed at creating an automatic prolongation recognition system.
Źródło:
Journal of Medical Informatics & Technologies; 2012, 20; 137-144
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The voice synthetiser of polish text for blind persons
Autorzy:
Porwik, P.
Szczepankiewicz, M.
Powiązania:
https://bibliotekanauki.pl/articles/333665.pdf
Data publikacji:
2002
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
mowa ludzka
sztuczny generator mowy
osoba niewidoma
human speech
artificial speech generator
blind person
Opis:
In this paper we present new method of computer text analyser and computer Polish speech (words) generator. In the described computer program the grammatical characteristics of Polish speech and accents in some words have been taken into consideration. All users' actions are commented by artificial, computer voice. The group of blind students of University of Silesia have examined and tested the presented final program for over one year. Described software tool has in a lot of cases better parameters than others, commercial products.
Źródło:
Journal of Medical Informatics & Technologies; 2002, 4; MT101-109
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Optimal acoustic model complexity selection in polish medical speech recognition
Autorzy:
Sas, J.
Poreba, T.
Powiązania:
https://bibliotekanauki.pl/articles/333361.pdf
Data publikacji:
2011
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
rozpoznawanie mowy
modele języka
medyczne systemy informacji
speech recognition
language models
medical information systems
Opis:
In the paper, the method of acoustic model complexity level selection for automatic speech recognition is proposed. Selection of the appropriate model complexity affects significantly the accuracy of speech recognition. For this reason the selection of the appropriate complexity level is crucial for practical speech recognition applications, where end user effort related to the implementation of speech recognition system is important. We investigated the correlation between speech recognition accuracy and two popular information criteria used in statistical model evaluation: Bayesian Information Criterion and Akaike Information Criterion computed for applied acoustic models. Experiments carried out for language models related to general medicine texts and radiology diagnostic reporting in CT and MR showed strong correlation of speech recognition accuracy and BIC criterion. Using this dependency, the procedure of Gaussian mixture count selection for acoustic model was proposed. Application of this procedure makes it possible to create the acoustic model maximizing the speech recognition accuracy without additional computational costs related to alternative cross-validation approach and without reduction of training set size, which is unavoidable in the case of cross-validation approach.
Źródło:
Journal of Medical Informatics & Technologies; 2011, 17; 115-122
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Isolated word descriptors as control parameters of the computer applications
Autorzy:
Porwik, P.
Powiązania:
https://bibliotekanauki.pl/articles/333588.pdf
Data publikacji:
2006
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
analiza mowy
sterowanie głosem
osoby niepełnosprawne
speech analysis
voice control
persons disability
MSAA technology
Opis:
This paper is an extended version of the MIT'06 conference contribution. During the conference, many inquiries about the used techniques were performed. Hence, in the paper some parts of investigations were explained and discussed, with greater accuracy. It is shown that the computer applications can be controlled by a human voice. The computer controlling processes are available by means of utterance of isolated words, where application events with the aid of user's voice can be serviced. The voice usage can be convenient for blind or partially sighted users or for persons with limb paresis. The Microsoft application events, by means of the practicable Microsoft Windows firmware MSAA® technology can be analysed. Such technology, together with isolated word descriptors, as voice recognition system, has been presented.
Źródło:
Journal of Medical Informatics & Technologies; 2006, 10; 35-46
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Application of local bidirectional language model to error correction in polish medical speech recognition
Autorzy:
Sas, J.
Powiązania:
https://bibliotekanauki.pl/articles/333597.pdf
Data publikacji:
2010
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
rozpoznawanie mowy
modele języka
medyczne systemy informacji
speech recognition
language models
medical information systems
Opis:
In the paper, the method of short word deletion errors correction in automatic speech recognition is described. Short word deletion errors appear to be a frequent error type in Polish speech recognition. The proposed speech recognition process consists of two stages. At the first stage the utterance is recognized by a typical speech recognizer based on forward bigram language model. At the second stage the word sequence recognized by the first stage recognizer is analyzed and such pairs of adjacent words in the recognized sequence are localized, which are likely to be separated by a short word like conjunction or preposition. The probability of short word appearance in context of found words is evaluated using centered trigrams and backward bigram language model for short words prone to deletion. The set of probabilistic language properties used to correct deletions is called here Local Bidirectional Language Model (in contrast to purely forward or backward model used typically in speech recognition). The decision of short word insertion is based on comparison of deletion error probability of the first stage recognizer and the error probability of the decision based only on centered trigrams and backward model. Despite its simplicity, the method proved to be effective in correcting deletion errors of most frequently appearing Polish prepositions. The method was tested in application to medical spoken reports recognition, where the overall short word deletion error rate was reduced by almost 45%.
Źródło:
Journal of Medical Informatics & Technologies; 2010, 15; 127-134
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Gender recognition using neural networks and ASR techniques
Autorzy:
Sas, J.
Sas, A.
Powiązania:
https://bibliotekanauki.pl/articles/333972.pdf
Data publikacji:
2013
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
artificial neural networks
speech recognition
gender recognition
sztuczne sieci neuronowe
rozpoznawanie mowy
rozpoznawanie płci
Opis:
The paper presents the simple technique of speaker gender recognition that uses MFCC features typically applied in automatic speech recognition. Artificial neural network is used as a classifier. The speech signal is first divided into 20 ms frames. For each frame, Mel-Frequency Cepstral Coefficients are extracted and the created feature vector is provided into a neural network classifier, which individually classifies each frame as male or female sample. Finally, the whole utterance is classified by selecting the class, for which the sum of corresponding neural network outputs is greater. The advantage of the method is that it can be easily combined with speech recognition, because both processes (gender recognition and speech recognition) are based on the same features. This way, no additional logic and no extra computational power is needed to extract features necessary for gender recognition. The method was experimentally evaluated using speech samples in English and in Polish. The comparison with other methods described in literature based on other feature extraction methods shows the superiority of the proposed approach, especially in cases where the recognition is carried out in noisy environment or using poor audio equipment.
Źródło:
Journal of Medical Informatics & Technologies; 2013, 22; 179-187
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Specialised Linux software aiding work of blind persons
Autorzy:
Porwik, P.
Będkowski, K.
Żelechowski, Ł.
Lisowska, A.
Powiązania:
https://bibliotekanauki.pl/articles/333108.pdf
Data publikacji:
2003
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
syntezatory mowy dla Linux
aplikacje Linux
niewidomi użytkownicy
speech synthesisers for Linux
Linux applications
blind users
Opis:
For many years Linux environment has been evaluated from the only server system to desktop computer system. Thanks to this Linux has became serious competition for systems from the very popular Microsoft Windows family. Unfortunately sightless users of computer do not have wide choice of software speech synthesisers that may help them during working on computer in the Linux environment. The "Speakup" application is the one of the better ones designed for Linux. But, nevertheless, it does not support the maintenance in the same degree as popular applications for the Microsoft Windows. So in this paper the significant improvement of the "Speakup" is presented. This improvement allows blind users to quite comfortable work with the Linux system in the console text mode with the maintenance of Polish fonts, too. The application has become the user-friendly one with wide spectrum of possibilities, what many blind users confirm. The one of them is the co-author of this paper.
Źródło:
Journal of Medical Informatics & Technologies; 2003, 5; MI147-155
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Building compact language models for medical speech recognition in mobile devices with limited amount of memory
Autorzy:
Sas, J.
Powiązania:
https://bibliotekanauki.pl/articles/332971.pdf
Data publikacji:
2012
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
automatyczne rozpoznawanie mowy
medyczne systemy informacyjne
modelowanie języka
automatic speech recognition
medical information systems
language modeling
Opis:
The article presents the method of building compact language model for speech recognition in devices with limited amount of memory. Most popularly used bigram word-based language models allow for highly accurate speech recognition but need large amount of memory to store, mainly due to the big number of word bigrams. The method proposed here ranks bigrams according to their importance in speech recognition and replaces explicit estimation of less important bigrams probabilities by probabilities derived from the class-based model. The class-based model is created by assigning words appearing in the corpus to classes corresponding to syntactic properties of words. The classes represent various combinations of part of speech inflectional features like number, case, tense, person etc. In order to maximally reduce the amount of memory necessary to store class-based model, a method that reduces the number of part-of-speech classes has been applied, that merges the classes appearing in stochastically similar contexts in the corpus. The experiments carried out with selected domains of medical speech show that the method allows for 75% reduction of model size without significant loss of speech recognition accuracy.
Źródło:
Journal of Medical Informatics & Technologies; 2012, 20; 111-119
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Optimal spoken dialog control in hands-free medical information systems
Autorzy:
Sas, J.
Powiązania:
https://bibliotekanauki.pl/articles/333081.pdf
Data publikacji:
2009
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
rozpoznawanie mowy automatyczne
optymalizacja genetyczna
systemy informacji medycznej
automatic speech recognition
genetic optimization
medical information systems
Opis:
In the paper a method of optimal selection of utterances used as command entry-words for voice controlled application is presented. Voice controlled programs seem to be particularly useful in the area of medical informatics, where a physician interacts with a program by voice while operating the medical device or being involved in examinations requiring manual activities. The proposed method selects command words from sets of proposals defined for each command so as to minimize the overall probability of incorrect command recognition. First the entry-word dissimilarity matrix is calculated. The word dissimilarities are evaluated using HMM models consisting of appropriately trained acoustic models of the phonemes constituting words. The trained HMM is used as the sample utterance generator for the word. The artificially created utterance samples are then recognized by speech recognizers created for pairs of words. The estimation of correct recognition probability is used as the word dissimilarity measure. The word dissimilarities are then used to determine the average assessment of words selections that can be used as commands. Selection is created by choosing single word from sets of candidates defined for each command. Finally, suboptimal selection is found by using genetic algorithm. Experiments carried out prove that suboptimal selection of command entry-words can observably increase the accuracy of spoken commands recognition in many cases.
Źródło:
Journal of Medical Informatics & Technologies; 2009, 13; 113-120
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Disordered sound repetition recognition in continuous speech using CWT and Kohonen network
Autorzy:
Codello, I.
Kuniszyk-Jóźkowiak, W.
Smołka, E.
Kobus, A.
Powiązania:
https://bibliotekanauki.pl/articles/333359.pdf
Data publikacji:
2011
Wydawca:
Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:
sieć Kohonena
zaburzenia automatycznego rozpoznawania mowy
ciągła transformata falkowa
skala Barka
powtarzanie dźwięku
Kohonen network
automatic disorders speech recognition
waveblaster
CWT
continuous wavelet transform (CWT)
Bark scale
sound repetition
Opis:
Automatic disorders recognition in speech can be very helpful for therapist while monitoring therapy progress of patients with disordered speech. This article is focused on sound repetitions. The signal is analyzed using Continuous Wavelet Transform with 16 bark scales, the result is divided into vectors and passed into Kohonen network. Finally, the Kohonen winning neuron result is put on the 3-layer perceptron. The recognition ratio was increased by about 20% by adding a modification into the Kohonen network training process as well as into CWT computation algorithm. All the analysis was performed and the results were obtained using the authors' program ”WaveBlaster“, The problem presented in this article is a part of our research work aimed at creating an automatic disordered speech recognition system.
Źródło:
Journal of Medical Informatics & Technologies; 2011, 17; 123-130
1642-6037
Pojawia się w:
Journal of Medical Informatics & Technologies
Dostawca treści:
Biblioteka Nauki
Artykuł
    Wyświetlanie 1-13 z 13

    Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies