Temat: Recognition - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Gender recognition using neural networks and ASR techniques
Autorzy:: Sas, J.
Sas, A.
Powiązania:: https://bibliotekanauki.pl/articles/333972.pdf
Data publikacji:: 2013
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: artificial neural networks
speech recognition
gender recognition
sztuczne sieci neuronowe
rozpoznawanie mowy
rozpoznawanie płci
Opis:: The paper presents the simple technique of speaker gender recognition that uses MFCC features typically applied in automatic speech recognition. Artificial neural network is used as a classifier. The speech signal is first divided into 20 ms frames. For each frame, Mel-Frequency Cepstral Coefficients are extracted and the created feature vector is provided into a neural network classifier, which individually classifies each frame as male or female sample. Finally, the whole utterance is classified by selecting the class, for which the sum of corresponding neural network outputs is greater. The advantage of the method is that it can be easily combined with speech recognition, because both processes (gender recognition and speech recognition) are based on the same features. This way, no additional logic and no extra computational power is needed to extract features necessary for gender recognition. The method was experimentally evaluated using speech samples in English and in Polish. The comparison with other methods described in literature based on other feature extraction methods shows the superiority of the proposed approach, especially in cases where the recognition is carried out in noisy environment or using poor audio equipment.
Źródło:: Journal of Medical Informatics & Technologies; 2013, 22; 179-187
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Handwritten laboratory test order form recognition module for distributed clinic
Autorzy:: Sas, J.
Powiązania:: https://bibliotekanauki.pl/articles/333334.pdf
Data publikacji:: 2004
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: inteligentne rozpoznawanie znaków
rozpoznawanie wzorców
szpitalne systemy informatyczne
intelligent character recognition
pattern recognition
hospital information systems
Opis:: The work describes methods used in a laboratory order form recognition module of a hospital information system. Three-level form analysis architecture is proposed. The lower alphabetical level is responsible for separate character recognition. On the intermediate level, recognised strings are verified against the lexicons of items specific for a particular form field. Probabilistic model is used to select the set of most probable items. On the upper level, the dependencies between the form data items are taken into account to further improve the recognition performance. The presented approach was implemented in the medical information system supporting clinic laboratory operation. The laboratory test orders prepared manually by the physician in the paper form, in the net of distributed outpatient clinics are processed in the central hospital laboratory. In the central laboratory the paper forms are scanned, recognised and entered into the information system. The performance tests results are discussed and some further improvements of the applied recognition method are also suggested in the paper.
Źródło:: Journal of Medical Informatics & Technologies; 2004, 8; MM59-68
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Application of automatic speech recognition to medical reports spoken in Polish
Autorzy:: Hnatkowska, B.
Sas, J.
Powiązania:: https://bibliotekanauki.pl/articles/333379.pdf
Data publikacji:: 2008
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: systemy informacji medycznej
modele językowe
automatic speech recognition
hospital information systems
language models
Opis:: The paper presents an attempt to automatic speech recognition of Polish spoken medical texts. The attempt resulted in experimental system that can be used as a tool for practical applications. The system uses a typical recognition method based on Hidden Markov Model and domain-specific language model. Implemented software made it possible to conduct many experiments aimed on evaluation of the assumed approach usefulness. Obtained experiment results are presented and analyzed. The system architecture and the way in which it can be integrated with hospital information systems is also exposed.
Źródło:: Journal of Medical Informatics & Technologies; 2008, 12; 223-229
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Optimal acoustic model complexity selection in polish medical speech recognition
Autorzy:: Sas, J.
Poreba, T.
Powiązania:: https://bibliotekanauki.pl/articles/333361.pdf
Data publikacji:: 2011
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: rozpoznawanie mowy
modele języka
medyczne systemy informacji
speech recognition
language models
medical information systems
Opis:: In the paper, the method of acoustic model complexity level selection for automatic speech recognition is proposed. Selection of the appropriate model complexity affects significantly the accuracy of speech recognition. For this reason the selection of the appropriate complexity level is crucial for practical speech recognition applications, where end user effort related to the implementation of speech recognition system is important. We investigated the correlation between speech recognition accuracy and two popular information criteria used in statistical model evaluation: Bayesian Information Criterion and Akaike Information Criterion computed for applied acoustic models. Experiments carried out for language models related to general medicine texts and radiology diagnostic reporting in CT and MR showed strong correlation of speech recognition accuracy and BIC criterion. Using this dependency, the procedure of Gaussian mixture count selection for acoustic model was proposed. Application of this procedure makes it possible to create the acoustic model maximizing the speech recognition accuracy without additional computational costs related to alternative cross-validation approach and without reduction of training set size, which is unavoidable in the case of cross-validation approach.
Źródło:: Journal of Medical Informatics & Technologies; 2011, 17; 115-122
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Application of local bidirectional language model to error correction in polish medical speech recognition
Autorzy:: Sas, J.
Powiązania:: https://bibliotekanauki.pl/articles/333597.pdf
Data publikacji:: 2010
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: rozpoznawanie mowy
modele języka
medyczne systemy informacji
speech recognition
language models
medical information systems
Opis:: In the paper, the method of short word deletion errors correction in automatic speech recognition is described. Short word deletion errors appear to be a frequent error type in Polish speech recognition. The proposed speech recognition process consists of two stages. At the first stage the utterance is recognized by a typical speech recognizer based on forward bigram language model. At the second stage the word sequence recognized by the first stage recognizer is analyzed and such pairs of adjacent words in the recognized sequence are localized, which are likely to be separated by a short word like conjunction or preposition. The probability of short word appearance in context of found words is evaluated using centered trigrams and backward bigram language model for short words prone to deletion. The set of probabilistic language properties used to correct deletions is called here Local Bidirectional Language Model (in contrast to purely forward or backward model used typically in speech recognition). The decision of short word insertion is based on comparison of deletion error probability of the first stage recognizer and the error probability of the decision based only on centered trigrams and backward model. Despite its simplicity, the method proved to be effective in correcting deletion errors of most frequently appearing Polish prepositions. The method was tested in application to medical spoken reports recognition, where the overall short word deletion error rate was reduced by almost 45%.
Źródło:: Journal of Medical Informatics & Technologies; 2010, 15; 127-134
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Building compact language models for medical speech recognition in mobile devices with limited amount of memory
Autorzy:: Sas, J.
Powiązania:: https://bibliotekanauki.pl/articles/332971.pdf
Data publikacji:: 2012
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: automatyczne rozpoznawanie mowy
medyczne systemy informacyjne
modelowanie języka
automatic speech recognition
medical information systems
language modeling
Opis:: The article presents the method of building compact language model for speech recognition in devices with limited amount of memory. Most popularly used bigram word-based language models allow for highly accurate speech recognition but need large amount of memory to store, mainly due to the big number of word bigrams. The method proposed here ranks bigrams according to their importance in speech recognition and replaces explicit estimation of less important bigrams probabilities by probabilities derived from the class-based model. The class-based model is created by assigning words appearing in the corpus to classes corresponding to syntactic properties of words. The classes represent various combinations of part of speech inflectional features like number, case, tense, person etc. In order to maximally reduce the amount of memory necessary to store class-based model, a method that reduces the number of part-of-speech classes has been applied, that merges the classes appearing in stochastically similar contexts in the corpus. The experiments carried out with selected domains of medical speech show that the method allows for 75% reduction of model size without significant loss of speech recognition accuracy.
Źródło:: Journal of Medical Informatics & Technologies; 2012, 20; 111-119
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Optimal spoken dialog control in hands-free medical information systems
Autorzy:: Sas, J.
Powiązania:: https://bibliotekanauki.pl/articles/333081.pdf
Data publikacji:: 2009
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: rozpoznawanie mowy automatyczne
optymalizacja genetyczna
systemy informacji medycznej
automatic speech recognition
genetic optimization
medical information systems
Opis:: In the paper a method of optimal selection of utterances used as command entry-words for voice controlled application is presented. Voice controlled programs seem to be particularly useful in the area of medical informatics, where a physician interacts with a program by voice while operating the medical device or being involved in examinations requiring manual activities. The proposed method selects command words from sets of proposals defined for each command so as to minimize the overall probability of incorrect command recognition. First the entry-word dissimilarity matrix is calculated. The word dissimilarities are evaluated using HMM models consisting of appropriately trained acoustic models of the phonemes constituting words. The trained HMM is used as the sample utterance generator for the word. The artificially created utterance samples are then recognized by speech recognizers created for pairs of words. The estimation of correct recognition probability is used as the word dissimilarity measure. The word dissimilarities are then used to determine the average assessment of words selections that can be used as commands. Selection is created by choosing single word from sets of candidates defined for each command. Finally, suboptimal selection is found by using genetic algorithm. Experiments carried out prove that suboptimal selection of command entry-words can observably increase the accuracy of spoken commands recognition in many cases.
Źródło:: Journal of Medical Informatics & Technologies; 2009, 13; 113-120
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Pipelined language model construction for Polish speech recognition
Autorzy:: Sas, J.
Żołnierek, A.
Powiązania:: https://bibliotekanauki.pl/articles/329841.pdf
Data publikacji:: 2013
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: automatic speech recognition
hidden Markov model
adaptive language model
automatyczne rozpoznawanie mowy
model Markova ukryty
model językowy adaptacyjny
Opis:: The aim of works described in this article is to elaborate and experimentally evaluate a consistent method of Language Model (LM) construction for the sake of Polish speech recognition. In the proposed method we tried to take into account the features and specific problems experienced in practical applications of speech recognition in the Polish language, reach inflection, a loose word order and the tendency for short word deletion. The LM is created in five stages. Each successive stage takes the model prepared at the previous stage and modifies or extends it so as to improve its properties. At the first stage, typical methods of LM smoothing are used to create the initial model. Four most frequently used methods of LM construction are here. At the second stage the model is extended in order to take into account words indirectly co-occurring in the corpus. At the next stage, LM modifications are aimed at reduction of short word deletion errors, which occur frequently in Polish speech recognition. The fourth stage extends the model by insertion of words that were not observed in the corpus. Finally the model is modified so as to assure highly accurate recognition of very important utterances. The performance of the methods applied is tested in four language domains.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2013, 23, 3; 649-668
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "Recognition" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język