Temat: corpora - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Polsko-bułgarskie korpusy IS PAN i CLARIN-PL
Polish-Bulgarian corpora ISS PAS (IS PAN) and CLARIN-PL
Autorzy:: Roszko, Danuta
Roszko, Roman
Sosnowski, Wojciech
Powiązania:: https://bibliotekanauki.pl/articles/694545.pdf
Data publikacji:: 2018
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: Polish-Bulgarian Corpora
Parallel Corpora
CLARIN-PL
Opis:: Multilingual corpora have found many applications in arts and humanities and social sciences, as well as in translation. A number of ways exist in which multilingual corpora can be used. Translators and CAT users would predominantly use translation memories (TM). Other users can choose from two ways of accessing the resources produced by The Institute of Slavic Studies. In the first method, the user needs to download the open-source TMX translation memories from CLARIN-PL DSpace repository (https://clarin-pl.eu/dspace) and load it into their preferred computer application. One can found free and proprietary applications that facilitate querying multilingual corpora; CLARIN-PL also offers free tools. The other method of accessing the multilingual data produced by The Institute of Slavic Studies does not require any advanced computer skills from the user. CLARIN-PL webpage includes the KonText search engine, which contains also Polish-Bulgarian resources (https://kontext. clarin-pl.eu/). The Polish-Bulgarian corpus contains the following types of resources: (1) fiction literature, (2) specialist literature (literature that is a reflection of the latest technological and cultural developments); and (3) film dialogues, which are the most similar to spoken language.
-
Źródło:: Slavica Lodziensia; 2018, 2; 59-70
2544-1795
Pojawia się w:: Slavica Lodziensia
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements
Autorzy:: Roszko, Danuta
Roszko, Roman
Powiązania:: https://bibliotekanauki.pl/articles/677259.pdf
Data publikacji:: 2013
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: corpora
parallel and comparable corpora
annotation
Polish
Lithuanian
Opis:: Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Źródło:: Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)
Autorzy:: Dimitrova, Ludmila
Koseska-Toszewa, Violetta
Roszko, Danuta
Roszko, Roman
Powiązania:: https://bibliotekanauki.pl/articles/677184.pdf
Data publikacji:: 2010
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: multilingual electronic corpora
parallel and comparable corpora
corpus annotation
lexical databases
multilingual electronic dictionaries
Opis:: Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)In this paper we present applications of a trilingual corpus in language research. Comparative and contrastive studies of Polish and Bulgarian as well as Polish and Lithuanian have been already conducted, but up to the best of our knowledge no such studies exist for Bulgarian and Lithuanian. On the one hand, it is interesting to note that two Slavic languages are compared to a Baltic language (Lithuanian). On the other hand, the three languages are marginally present in the EU because of the later ascension of the three countries to the EU. The paper shortly describes the first electronic Bulgarian–Polish–Lithuanian experimental corpus, currently under development only for research. We also focus our attention on the morphosyntactic annotation of the parallel trilingual corpus according to the Corpus Encoding Standard: we present a review of the Part-of-Speech (POS) classification of the participle in the three languages – Bulgarian, Polish, and Lithuanian in comparison to another POS, the adjective. We briefly discuss tagsets for corpus annotation from the point of view of possible unification in the future with some examples.
Źródło:: Cognitive Studies; 2010, 10
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: The Use of the Lexical Exponents of Hypothetical Modality in Polish and Lithuanian
Autorzy:: Roszko, Danuta
Powiązania:: https://bibliotekanauki.pl/articles/677340.pdf
Data publikacji:: 2016
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: Polish
Lithuanian
hypothetical modality
parallel corpora
Opis:: The Use of the Lexical Exponents of Hypothetical Modality in Polish and LithuanianIn this article the author focuses on the issue of hypothetical modality[1] in Polish and Lithuanian. A list of the basic exponents of hypothetical modality in both languages is presented. However, the focus is mainly placed on the lexical exponents. On the basis of one of the six groups, which describes a high degree of probability (H5), the differences between the use of the lexical exponents in both languages are examined. In the study, multilingual corpora resources, including The Polish-Lithuanian parallel corpus Clarin-PL., are utilized.[1] [In the academic literature, for the notion described herein, the term of epistemic modality is also used. Nevertheless, in this paper I will continue to use the term of hypotheticality, which I borrowed from the studies on modality, conducted in Polish-Bulgarian cooperation (Slavic Institute of Polish Academy of Sciences and Institute for Bulgarian Language of the Bulgarian Academy of Sciences).] O użyciu wykładników leksykalnych modalności hipotetycznej w językach polskim i litewskimW artykule autorka porusza zagadnienie modalności hipotetycznej[1] w językach polskim i litewskim. Przedstawia wykaz podstawowych wykładników modalności hipotetycznej w obu językach. Główną uwagę skupia jednak na wykładnikach leksykalnych. Na przykładzie jednej z sześciu grup, opisującej wysoki stopień prawdopodobieństwa (H5), omawia różnice użycia wykładników leksykalnych w obu językach. W badaniach wykorzystuje wielojęzyczne zasoby korpusowe, w tym Polsko-litewski korpus równoległy Clarin-PL.[1] [W literaturze przedmiotu na oznaczenie opisywanych tu treści stosowany jest również termin epistemiczności. Niemniej jednak w tej pracy autorka pozostaje przy terminie hipotetyczności, który zapożycza z badań nad modalnością, prowadzonych we współpracy polsko-bułgarskiej (Instytut Slawistyki PAN i Instytut Języka Bułgarskiego BAN).]
Źródło:: Cognitive Studies; 2016, 16
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic Annotation
Autorzy:: Roszko, Danuta
Powiązania:: https://bibliotekanauki.pl/articles/677261.pdf
Data publikacji:: 2013
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: corpora
annotation
Lithuanian local dialect of Punsk in Poland
experimental dialectal corpus
Opis:: Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic AnnotationIn the article the author describes the experimental corpus of the Lithuanian local dialect of Puńsk in Poland (ECorp-of-Punsk). It is the first corpus of this type for the Lithuanian local dialect. The corpus consists of three subcorpora. The first one (referred to as fundamental) contains utterances given by Lithuanians in the local dialect, the second one – utterances given by Lithuanians in Polish, the third one – aligned Polish-dialectal texts. The texts recorded in the years 1986–2012 have been included in the Ecorp-of-Punsk resources.
Źródło:: Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "corpora" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język