Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "parallel corpora" wg kryterium: Temat


Tytuł:
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)
Autorzy:
Dimitrova, Ludmila
Koseska-Toszewa, Violetta
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677184.pdf
Data publikacji:
2010
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
multilingual electronic corpora
parallel and comparable corpora
corpus annotation
lexical databases
multilingual electronic dictionaries
Opis:
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)In this paper we present applications of a trilingual corpus in language research. Comparative and contrastive studies of Polish and Bulgarian as well as Polish and Lithuanian have been already conducted, but up to the best of our knowledge no such studies exist for Bulgarian and Lithuanian. On the one hand, it is interesting to note that two Slavic languages are compared to a Baltic language (Lithuanian). On the other hand, the three languages are marginally present in the EU because of the later ascension of the three countries to the EU. The paper shortly describes the first electronic Bulgarian–Polish–Lithuanian experimental corpus, currently under development only for research. We also focus our attention on the morphosyntactic annotation of the parallel trilingual corpus according to the Corpus Encoding Standard: we present a review of the Part-of-Speech (POS) classification of the participle in the three languages – Bulgarian, Polish, and Lithuanian in comparison to another POS, the adjective. We briefly discuss tagsets for corpus annotation from the point of view of possible unification in the future with some examples.
Źródło:
Cognitive Studies; 2010, 10
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Multilingual digital resources with Bulgarian language
Autorzy:
Dimitrova, Ludmila
Powiązania:
https://bibliotekanauki.pl/articles/677179.pdf
Data publikacji:
2010
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora (parallel
comparable
aligned)
corpus annotation
digital dictionaries
lexical databases
morpho-syntactic specifications
Opis:
Multilingual digital resources with Bulgarian languageThe paper presents in brief Bulgarian language resources as a part of multilingual digital resources developed in the frame of some international projects, among them parallel annotated and aligned corpora, comparable corpora, morpho-syntactic specifications for corpora annotation and dictionaries encoding, lexicons, lexical databases, and electronic dictionaries.
Źródło:
Cognitive Studies; 2010, 10
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The Bulgarian-Polish-Russian parallel corpus
Autorzy:
Duškin, Maksim
Satoła-Staśkowiak, Joanna
Powiązania:
https://bibliotekanauki.pl/articles/677061.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
parallel corpora
the Bulgarian-Polish-Russian parallel corpus
Opis:
The Bulgarian-Polish-Russian parallel corpusThe Semantics Laboratory Team of Institute of Slavic Studies of Polish Academy of Sciences is planning to begin work on the creation of a Bulgarian-Polish-Russian parallel corpus. The three selected languages are representatives of the main groups of Slavic languages: Bulgarian represents the southern group of Slavic languages, Polish – the western group of Slavic languages, Russian – the eastern group of Slavic languages. Our project will be the first parallel corpus of these three languages. The planned corpus will be based on material, dating from one period (the 20th century) and will have a synchronous nature. The project will not constitute the sum of the separate corpora of selected languages.One of the problems with creating multilingual parallel corpora are different proportions of translated texts between the selected languages, for example, Polish literature is often translated into Bulgarian, but not vice versa.Bulgarian, Russian and Polish differ typologically – Bulgarian is an analytic language, Polish and Russian are synthetic. The parallel corpus should have compatible annotation, while taking into account the characteristic features of the selected languages.We hope that the Bulgarian-Polish-Russian parallel corpus will serve as a source of linguistic material of contrastive language studies and may prove to be a big help for linguists, translators, terminologists and students of linguistics. The results of our work will be available on the Internet.
Źródło:
Cognitive Studies; 2011, 11
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Web presentation of bilingual corpora (Slovak-Bulgarian and Bulgarian-Polish)
Autorzy:
Garabík, Radovan
Dimitrova, Ludmila
Koseska-Toszewa, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/677063.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
Bulgarian
Polish
Slovak
digital language resources
parallel and aligned corpora
web presentation
Opis:
Web presentation of bilingual corpora (Slovak-Bulgarian and Bulgarian-Polish)In this paper we focus on the web-presentation of bilingual corpora in three Slavic languages and their possible applications. Slovak-Bulgarian and Bulgarian-Polish corpora are collected and developed as results of the collaboration in the frameworks of two joint research projects between Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, from one side, and from the other side: Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences and Institute of Slavic Studies, Polish Academy of Sciences, coordinate by authors of this paper.
Źródło:
Cognitive Studies; 2011, 11
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Bulgarian-Polish parallel digital corpus and quantification of time
Autorzy:
Dimitrova, Ludmila
Koseska-Toszewa, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/677292.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
Bulgarian
Polish
digital language resources
parallel and aligned corpora
quantification of time
Opis:
Bulgarian-Polish parallel digital corpus and quantification of timeThe paper presents the current state of the first Bulgarian-Polish parallel and aligned corpus, prepared in the frame of the joint research project “Semantics and Contrastive linguistics with a focus on a bilingual electronic dictionary” between the Institute of Mathematics and Informatics, Bulgarian Academy of Sciences and the Institute of Slavic Studies, Polish Academy of Sciences, coordinated by L. Dimitrova and V. Koseska-Toszewa. In particular, problems related to tense quantification are also discussed
Źródło:
Cognitive Studies; 2012, 12
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
About Certain Semantic Annotation in Parallel Corpora
Autorzy:
Koseska-Toszewa, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/677255.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
direct approach to semantics
semantic annotation
perfective aspect
inperfective aspect
event
state
Petri nets
parallel corpora
contrastive linguistics
Opis:
About Certain Semantic Annotation in Parallel CorporaThe semantic notation analyzed in this works is contained in the second stream of semantic theories presented here – in the direct approach semantics. We used this stream in our work on the Bulgarian-Polish Contrastive Grammar. Our semantic notation distinguishes quantificational meanings of names and predicates, and indicates aspectual and temporal meanings of verbs. It relies on logical scope-based quantification and on the contemporary theory of processes, known as “Petri nets”. Thanks to it, we can distinguish precisely between a language form and its contents, e.g. a perfective verb form has two meanings: an event or a sequence of events and states, finally ended with an event. An imperfective verb form also has two meanings: a state or a sequence of states and events, finally ended with a state. In turn, names are quantified universally or existentially when they are “undefined”, and uniquely (using the iota operator) when they are “defined”. A fact worth emphasizing is the possibility of quantifying not only names, but also the predicate, and then quantification concerns time and aspect.  This is a novum in elaborating sentence-level semantics in parallel corpora. For this reason, our semantic notation is manual. We are hoping that it will raise the interest of computer scientists working on automatic methods for processing the given natural languages. Semantic annotation defined like in this work will facilitate contrastive studies of natural languages, and this in turn will verify the results of those studies, and will certainly facilitate human and machine translations.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Contemporary Contrastive Studies of Polish, Bulgarian and Russian Neologisms versus Language Corpora
Autorzy:
Satoła-Staśkowiak, Joanna
Powiązania:
https://bibliotekanauki.pl/articles/677236.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
neologisms
neosemantisms
internationalisms
language corpora
parallel corpora
contrastive studies
Polish language
Bulgarian language
Russian language
Opis:
Contemporary Contrastive Studies of Polish, Bulgarian and Russian Neologisms versus Language CorporaIn the field of Slavonic linguistics contrastive studies of neologisms occupy little place, the newest words are insufficiently described and classified. The aim of this article is to draw attention to the need for contrastive description of the newest lexis and checking exclusively one of many possibilities of obtaining Polish, Bulgarian and Russian neologisms. Language corpora, as this possibility is in question, are not the only source from which the author obtains her research material, yet a growing interest in corpora has inspired her to also use this method. The author wants to show the reader to what degree language corpora can help in building the thesaurus of Polish, Bulgarian and Russian neologisms. Making an attempt to confront a collection of neologisms of contemporary Polish, Bulgarian and Russian language, the author points out the need to standardize the description (identical for each of the analysed languages), which she intends to propose in another publications on neologisms in Polish, Bulgarian and Russian language. The application of contrastive method to three different but related languages from the Slavonic group will help, in her opinion, to discover more mechanisms of new words coming into existence and examine the newest derivative processes and their productivity.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Dialog between a Lexicographer and a Translator
Autorzy:
Kit, Mark
Koseska-Toszewa, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/677253.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
reversibility of dictionaries
lexical structures
multilingual dictionaries
perfective aspect
semantic tags
parallel corpora
corpus linguistics
computational linguistics
contrastive linguistics
Opis:
Dialog between a Lexicographer and a TranslatorThe discussion between the authors of the paper concerns the most pressing issues encountered in natural language semantics, as well as in corpus linguistics and computational linguistics. A broad range of knowledge, allowing linguists and information scientists to work together, is required in these areas. The paper describes some primary problems of human and machine translation caused by gaps between different fields of knowledge. The authors suggest that interdisciplinary approach is required when it comes to contrastive studies in linguistics.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements
Autorzy:
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677259.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
parallel and comparable corpora
annotation
Polish
Lithuanian
Opis:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Forms of Address and their Meaning in Contrast in Polish and Russian Languages
Autorzy:
Sosnowski, Wojciech
Powiązania:
https://bibliotekanauki.pl/articles/677225.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
forms of address
language politeness
language rudeness
act of speech
bilingual dictionary
parallel corpora
classifiers of politeness
print dictionary
computer dictionary
Opis:
Forms of Address and their Meaning in Contrast in Polish and Russian LanguagesMany studies in contemporary linguistics focus on investigating politeness and rudeness in language. This paper, however, has not been intended as a contrastive study of the phenomena in question. Language politeness and rudeness are conveyed by means of expressions of politeness and rudeness which are perceived as entrenched and recurring in specific situations. These expressions convey the expected meaning of politeness and rudeness accepted in the model of social behaviour. If one uses the explicative method such expressions could be reduced to the following formula ‘I inform you that I follow a verbal conduct defined as polite’. Owing to the emergence of parallel corpora of particular languages, it is nowadays easier to collect data for research on forms of address as well as on expressions of politeness in the first half of the 21st century. Investigating the meaning of forms of address, which are part of linguistic repertoire used to express politeness and rudeness should be regarded as an interesting area of research. It is the consequence of the increasing importance of intercultural communication, expansion of international cooperation, and formation of new standards of interpersonal communication aimed at achieving mutual understanding without resorting to violence. It is worth mentioning that currently there are no bilingual dictionaries which would include practical rules for using forms of address. Moreover, dictionaries (especially bilingual ones) also do not list classifiers of politeness, which becomes a shortcoming as regards the purposes of translation and teaching foreign languages. The aforementioned problems apply to print as well as computer dictionaries. A reliable list of forms of address and their meaning may become helpful in intercultural communication. It would be also important to create a Contemporary Dictionary of Expressions of Politeness and Rudeness in a paper as well as a computer version.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
From the Problems of Dictionaries and Multi-lingual Corpora
Autorzy:
Koseska-Toszewa, Violetta
Satoła-Staśkowiak, Joanna
Sosnowski, Wojciech
Powiązania:
https://bibliotekanauki.pl/articles/677234.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
Bilingual dictionary
trilingual dictionary
online dictionary
traditional classifier
syntactic classifier
semantic classifier
bilingual parallel corpora
trilingual parallel corpora
linguistic quantification
“incomplete quantification”
Opis:
From the Problems of Dictionaries and Multi-lingual CorporaThe article describes the work on a number of dictionaries being developed by the Corpus Linguistics and Semantics Group of the Institute of Slavic PAS. They include “Contemporary Bulgarian-Polish Dictionary”, “Bulgarian-Polish Online Dictionary” and “Russian-Bulgarian-Polish Dictionary”. The dictionaries differ in the numbers of entries, as well as in the different degrees of their connection with parallel corpora being elaborated under the “Clarin” project. All the discussed dictionaries are similar with respect to their use of traditional, syntactic classifiers and of semantic classifiers, introduced for the first time in the existing lexicographical practice. Thanks to the “Polish-Bulgarian-Russian Corpus”, the Group has managed to verify the results of contrasting Polish and Bulgarian in the light of scope-based logical quantification. Thanks to the Russian material added to the trilingual corpus, the researchers have managed to confirm the fact that from the viewpoint of “incomplete quantification” Russian and Polish (synthetic languages) behave similarly, and are opposed to the analytic Bulgarian.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Semantics, contrastive linguistics and parallel corpora
Autorzy:
Koseska, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/967225.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
contrastive studies
online dictionary
parallel corpora
direct approach to semantics
semantic interlanguage
Petri nets
semantic annotation
Opis:
Semantics, contrastive linguistics and parallel corporaIn view of the ambiguity of the term “semantics”, the author shows the differences between the traditional lexical semantics and the contemporary semantics in the light of various semantic schools. She examines semantics differently in connection with contrastive studies where the description must necessary go from the meaning towards the linguistic form, whereas in traditional contrastive studies the description proceeded from the form towards the meaning. This requirement regarding theoretical contrastive studies necessitates construction of a semantic interlanguage, rather than only singling out universal semantic categories expressed with various language means. Such studies can be strongly supported by parallel corpora. However, in order to make them useful for linguists in manual and computer translations, as well as in the development of dictionaries, including online ones, we need not only formal, often automatic, annotation of texts, but also semantic annotation - which is unfortunately manual. In the article we focus on semantic annotation concerning time, aspect and quantification of names and predicates in the whole semantic structure of the sentence on the example of the “Polish-Bulgarian-Russian parallel corpus”.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Translation equivalence of demonstrative pronouns in Bulgarian-Slovak parallel texts
Autorzy:
Dimitrova, Ludmila
Garabík, Radovan
Powiązania:
https://bibliotekanauki.pl/articles/967224.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
translation equivalents
demonstrative pronouns
parallel corpora
aligned text
Slovak
Bulgarian
Opis:
Translation equivalence of demonstrative pronouns in Bulgarian-Slovak parallel textsIn this paper we describe our automatic analysis of several parallel Bulgarian-Slovak texts with the goal to obtain useful information about Slovak translation equivalents of (definite) articles and demonstrative pronouns in Bulgarian. Rather than focusing on individual translation equivalents, we present a method for automatic extraction and visualization of the translations. This can serve as a guide for pinpointing interesting features in specific translated documents and could be extended for other parts of speech or otherwise identifiable textual units.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Extraction and Presentation of Bilingual Correspondences from Slovak-Bulgarian Parallel Corpus
Autorzy:
Garabík, Radovan
Dimitrova, Ludmila
Powiązania:
https://bibliotekanauki.pl/articles/677001.pdf
Data publikacji:
2015
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
translation equivalents
GIZA++
parallel corpora
aligned text
Slovak
Bulgarian
Opis:
Extraction and Presentation of Bilingual Correspondences from Slovak-Bulgarian Parallel CorpusIn this paper the results of the automatic extraction and presentation of bilingual correspondences from Slovak-Bulgarian Parallel corpus are described. The equivalent phrases are extracted from sentence and word level automatically aligned corpus, filtered, indexed and presented in a dictionary-like interface. The bilingual dictionary database contains 80 thousand phrase pairs consisting of approximately 350 thousand words (per each language). Counting unique word forms, the size is 31 thousand in the Slovak part of the dictionary, 26 thousand in the Bulgarian part.
Źródło:
Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Les verbes garantir et assurer et leurs équivalents polonais dans le texte constitutionnel contemporain d’un état et dans Gaudium et spes (1965)
Verbs Garantir and Assurer and their Polish Equivalents in the Text of the Polish National Constitution and in Gaudium rt Spes (1965)
Czasowniki garantir i assurer oraz ich polskie ekwiwalenty w tekście konstytucji państwowej i w konstytucji Gaudium et spes (1965)
Autorzy:
Śliwa, Dorota
Powiązania:
https://bibliotekanauki.pl/articles/1883428.pdf
Data publikacji:
2015
Wydawca:
Katolicki Uniwersytet Lubelski Jana Pawła II. Towarzystwo Naukowe KUL
Tematy:
juryslingwistyka
terminologia
czasowniki modalne
tłumaczenia specjalistyczne
korpusy porównywalne
korpusy równoległe
jurislinguistics
terminology
modal verbs
specialized translation
comparable corpora
parallel corpora
Opis:
Artykuł zawiera szczegółową analizę czasowników garantir i assurer oraz polskich ekwiwalentów gwarantować, zapewniać i zabezpieczyć, wyodrębnionych z tekstów normatywnych, jakimi są konstytucje państwowe i Konstytucja duszpasterska o Kościele w świecie współczesnym Gaudium et spes Kościoła katolickiego. Jako czasowniki modalne deontyczne zapewnienia są używane w określonych typach tekstów specjalistycznych. Badanie zależności między typem tekstu konstytucyjnego a doborem czasowników zapewnienia w języku oryginału wykazuje, że zarówno konstrukcja składniowa, jak i znaczenie tych czasowników są charakterystyczne dla danego typu tekstu. Refleksja nad doborem tych czasowników w tekstach tłumaczonych prowadzi do wniosku, że warto ponownie przemyśleć dobór ekwiwalentów w tekście tłumaczonym zgodnie ze stylem redakcji właściwym dla danego tekstu normatywnego. Szczegółowe ilustracje dwujęzycznych kontekstów tekstów porównywalnych, jak i paralelnych mają także na celu wskazanie specjalistom z lingwistyki komputerowej problemów, jakie pojawiają się przy kodowaniu, a które wynikają z transformacji struktur z systemu danego języka w ich realizacjach dyskursywnych oraz w przeformułowaniu całych fraz z czasownikiem zapewnienia w procesie tłumaczenia wraz z konsekwencjami wynikającymi z wyborem modalności de re lub de dicto.
The article contains a detailed analysis of the verbs garantir and assurer and their evivalents in Polish gwarantować, zapewniać i zabezpieczyć extracted from the normative texts, which are national constitutions and the constitution of the Catholic Church. As modal deontological verbs of assurance they are used in specialized texts. The study of the relationship between the type of a constitutional text and the choice of verbs of assurance in the language of the original indicates that both the syntactic structure and the meaning of those verbs are characteristic of a given type of text. The reflection over the choice of those verbs in translated texts leads to the conclusion that it is worthwhile to re-think the use of equivalents in the translated text according to the redaction style appropriate for a given normative text. Detailed illustrations of bilingual contexts of comparable texts as well as of parallel ones also aim at pointing out to computer linguistics specialists certain issues appearing during encoding and which arise from the transformation of whole phrases containing a verb of assurance during the translation process along with the consequences resulting from the choice of the de re or de dicto modality.
Źródło:
Roczniki Humanistyczne; 2015, 63, 8; 147-186
0035-7707
Pojawia się w:
Roczniki Humanistyczne
Dostawca treści:
Biblioteka Nauki
Artykuł

Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies