Polish tagger TaKIPI: rule based construction and optimization

Szczegóły
Opis

Tytuł:: Polish tagger TaKIPI: rule based construction and optimization
Autorzy:: Piasecki, M.
Powiązania:: https://bibliotekanauki.pl/articles/1943261.pdf
Data publikacji:: 2007
Wydawca:: Politechnika Gdańska
Tematy:: morphosyntactic tagging
Polish
rule based tagging
decizion trees
Źródło:: TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2007, 11, 1-2; 151-167
1428-6394
Język:: angielski
Prawa:: CC BY: Creative Commons Uznanie autorstwa 4.0
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

A large number of different tags, limited corpora and the free word order are the main causes of low accuracy of tagging in Polish (automatic disambiguation of morphological descriptions) by applying commonly used techniques based on stochastic modeling. In the paper the rule-based architecture of the TaKIPI Polish tagger combining handwritten and automatically extracted rules is presented. The possibilities of optimization of its parameters and component are discussed, including the possibility of using different methods of rules extraction, than C4.5 Decision Trees applied initially. The main goal of this paper is to explore a range of promising rule-based classifiers and investigate their impact on the accuracy of tagging. Simple techniques of combing classifiers are also tested. The performed experiments have shown that even a simple combination of different classifiers can increase the tagger's accuracy by almost one percent.

Informacja

Powiązane pozycje