Temat: electronic text - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: O projekcie Korpusu Polszczyzny do 1500 roku
Autorzy:: Deptuchowa, Ewa
Jasińska, Katarzyna
Klapper, Magdalena
Kołodziej, Dorota
Powiązania:: https://bibliotekanauki.pl/articles/1630446.pdf
Data publikacji:: 2020-10-30
Wydawca:: Towarzystwo Kultury Języka
Tematy:: electronic text corpus
Old Polish
Mediaeval Polish
inflectional description
Opis:: This paper presents the assumptions of the Corpus of Polish until 1500, which is being developed as part of the project titled Baza leksykalna średniowiecznej polszczyzny (do 1500 roku). Fleksja (Lexical Database of Medieval Polish (until 1500). Infl ection). It introduces the fundamental objectives of the project, namely preparing an infl ectional description of all (infl ected) words from the time until 1500 and building a morphosyntactically annotated collection of texts from the same period. Afterwards, the authors discuss the present digital collections of Old Polish texts. In the main part of the paper, they present the criteria for selecting sources for the Corpus under creation and their elaboration methods, which refer to the solutions developed in the Electronic Corpus of Polish Texts from the 17th and 18th centuries (until 1772). Their major modifi cations aimed to adapt the structural and morphosyntactic annotations for the purpose of describing Mediaeval Polish are discussed on selected examples.
Źródło:: Poradnik Językowy; 2020, 777, 8; 7-16
0551-5343
Pojawia się w:: Poradnik Językowy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Analiza fleksyjna tekstów historycznych i zmienność fleksji polskiej z perspektywy danych korpusowych
Autorzy:: Woliński, Marcin
Kieraś, Witold
Powiązania:: https://bibliotekanauki.pl/articles/1630443.pdf
Data publikacji:: 2020-10-30
Wydawca:: Towarzystwo Kultury Języka
Tematy:: electronic text corpus
natural language processing
inflection of Polish
history of language
Opis:: The subject matter of this paper is Chronofleks, a computer system (http://chronofleks.nlp.ipipan.waw.pl/) modelling Polish inflection based on a corpus material. The system visualises changes of inflectional paradigms of individual lexemes over time and enables examination of the variability of the frequency of inflected form groups distinguished based on various criteria. Feeding Chronofleks with corpus data required development of IT tools to ensure an inflectional processing sequence of texts analogous to the ones used for modern language; they comprise a transcriber, a morphological analyser, and a tagger. The work was performed on data from three historical periods (1601–1772, 1830–1918, and modern ones) elaborated in independent projects. Therefore, finding a common manner of describing data from the individual periods was a significant element of the work.
Źródło:: Poradnik Językowy; 2020, 777, 8; 66-80
0551-5343
Pojawia się w:: Poradnik Językowy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Elektroniczny Korpus Tekstów Polskich z XVII i XVIII w. – problemy teoretyczne i warsztatowe
Autorzy:: Gruszczyński, Włodzimierz
Adamiec, Dorota
Bronikowska, Renata
Wieczorek, Aleksandra
Powiązania:: https://bibliotekanauki.pl/articles/1630441.pdf
Data publikacji:: 2020
Wydawca:: Towarzystwo Kultury Języka
Tematy:: electronic text corpus
historical corpus
17th-18th-century Polish
natural language processing
Opis:: This paper presents the Electronic Corpus of 17th- and 18th-century Polish Texts (KorBa) – a large (13.5-million), annotated historical corpus available online. Its creation was modelled on the assumptions of the National Corpus of Polish (NKJP), yet the specifi c nature of the historical material enforced certain modifi cations of the solutions applied in NKJP, e.g. two forms of text representation (transliteration and transcription) were introduced, the principle of designating foreign-language fragments was adopted, and the tagset was adapted to the description of the grammatical structure of the Middle Polish language. The texts collected in KorBa are diversified in chronological, geographical, stylistic, and thematic terms although, due to e.g. limited access to the material, the postulate of representativeness and sustainability of the corpus was not fully implemented. The work on the corpus was to a large extent automated as a result of using natural language processing tools.
Źródło:: Poradnik Językowy; 2020, 777, 8; 32-51
0551-5343
Pojawia się w:: Poradnik Językowy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "electronic text" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język