- Tytuł:
- Application of local bidirectional language model to error correction in polish medical speech recognition
- Autorzy:
- Sas, J.
- Powiązania:
- https://bibliotekanauki.pl/articles/333597.pdf
- Data publikacji:
- 2010
- Wydawca:
- Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
- Tematy:
-
rozpoznawanie mowy
modele języka
medyczne systemy informacji
speech recognition
language models
medical information systems - Opis:
- In the paper, the method of short word deletion errors correction in automatic speech recognition is described. Short word deletion errors appear to be a frequent error type in Polish speech recognition. The proposed speech recognition process consists of two stages. At the first stage the utterance is recognized by a typical speech recognizer based on forward bigram language model. At the second stage the word sequence recognized by the first stage recognizer is analyzed and such pairs of adjacent words in the recognized sequence are localized, which are likely to be separated by a short word like conjunction or preposition. The probability of short word appearance in context of found words is evaluated using centered trigrams and backward bigram language model for short words prone to deletion. The set of probabilistic language properties used to correct deletions is called here Local Bidirectional Language Model (in contrast to purely forward or backward model used typically in speech recognition). The decision of short word insertion is based on comparison of deletion error probability of the first stage recognizer and the error probability of the decision based only on centered trigrams and backward model. Despite its simplicity, the method proved to be effective in correcting deletion errors of most frequently appearing Polish prepositions. The method was tested in application to medical spoken reports recognition, where the overall short word deletion error rate was reduced by almost 45%.
- Źródło:
-
Journal of Medical Informatics & Technologies; 2010, 15; 127-134
1642-6037 - Pojawia się w:
- Journal of Medical Informatics & Technologies
- Dostawca treści:
- Biblioteka Nauki