Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "national corpus" wg kryterium: Temat


Tytuł:
The Bulgarian National Corpus : Theory and Practice in Corpus Design
Autorzy:
Koeva, S.
Stoyanova, I.
Leseva, S.
Dimitrova, T.
Dekova, R.
Tarpomanova, E.
Powiązania:
https://bibliotekanauki.pl/articles/103907.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
corpus design
Bulgarian National Corpus
computational linguistics
Opis:
The paper discusses several key concepts related to the development of corpora and reconsiders them in light of recent developments in NLP. On the basis of an overview of present-day corpora, we conclude that the dominant practices of corpus design do not utilise the technologies adequately and, as a result, fail to meet the demands of corpus linguistics, computational lexicology and computational linguistics alike. We proceed to lay out a data-driven approach to corpus design, which integrates the best practices of traditional corpus linguistics with the potential of the latest technologies allowing fast collection, automatic metadata description and annotation of large amounts of data. Thus, the gist of the approach we propose is that corpus design should be centred on amassing large amounts of mono- and multilingual texts and on providing them with a detailed metadata description and high-quality multi-level annotation. We go on to illustrate this concept with a description of the compilation, structuring, documentation, and annotation of the Bulgarian National Corpus (BulNC). At present it consists of a Bulgarian part of 979.6 million words, constituting the corpus kernel, and 33 Bulgarian-X language corpora, totalling 972.3 million words, 1.95 billion words (1.95×109) altogether. The BulNC is supplied with a comprehensive metadata description, which allows us to organise the texts according to different principles. The Bulgarian part of the BulNC is automatically processed (tokenised and sentence split) and annotated at several levels: morphosyntactic tagging, lemmatisation, word-sense annotation, annotation of noun phrases and named entities. Some levels of annotation are also applied to the Bulgarian-English paralel corpus with the prospect of expanding multilingual annotation both in terms of linguistic levels and the number of languages for which it is available. We conclude with a brief evaluation of the quality of the corpus and an outline of its applications in NLP and linguistic research.
Źródło:
Journal of Language Modelling; 2012, 0, 1; 65-110
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Po Narodowym Korpusie Języka Polskiego – zmiany w słownictwie polskim ostatniej dekady na przykładzie słów kluczowych
Evolution of Polish Lexis in the Decade Following the Introduction of the National Corpus of Polish: A Keyword Analysis
Autorzy:
Zawadzka-Paluektau, Natalia
Tomaszewska, Aleksandra
Wołoszyn, Joanna
Powiązania:
https://bibliotekanauki.pl/articles/9259670.pdf
Data publikacji:
2023
Wydawca:
Towarzystwo Kultury Języka
Tematy:
corpus linguistics
National Corpus of Polish
language change
diachronic research
keyword analysis
Opis:
The paper reports on the results of an examination of changes in Polish lexis over the past decade. Two different, multi-million corpora spanning the years 2011–2022 were contrasted with a subset of the balanced National Corpus of Polish, which covers the period until 2010. To this end, keyword analysis was employed, and words that are particularly characteristic of the more recent set of texts, compared to the older corpus, were automatically extracted. This allowed us to identify the most salient lexical trends which differentiate the language of the last decade from the one recorded in the National Corpus of Polish, and which point to significant extralinguistic socio-cultural, economic, and political shifts across time.
Źródło:
Poradnik Językowy; 2023, 803, 4; 28-45
0551-5343
Pojawia się w:
Poradnik Językowy
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Skojarzenia werbalne w Narodowym Korpusie Języka Polskiego: przyczynek do badań nad werbalnym stereotypem "inwalidy"
Verbal Associations in the National Corpus of Polish: A Contribution to the Study of the Verbal Stereotype of inwalida ‘an invalid’
Autorzy:
Mikołajczak-Matyja, Nawoja
Powiązania:
https://bibliotekanauki.pl/articles/38695725.pdf
Data publikacji:
2021
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpus linguistics
National Corpus of Polish
semantic prosody
verbal associations
verbal stereotype
invalid
Opis:
The role of corpus research in linguistics and in related fields of study has increased in recent decades. Searching for and analysis of collocations and concordances of a lexical unit, which makes it possible to determine its semantic preferences and semantic prosody, can be a tool for studying stereotypes, understood as overly generalized and simplified evaluative and affective images of a fragment of reality named by the lexical unit. The aim of this article is to verify the validity of supplementing studies based on the analysis of corpus resources with data obtained in free association tests. The study focuses on the lexical unit inwalida ‘an invalid’ as the name of a concept which may be subject to strong stereotyping. The resources of the balanced sub-corpus of the National Corpus of Polish, consisting of about 250 million words, were searched for associative responses to the word inwalida given by at least 2 people from a group of 40 Polish speakers. In the corpus, the co-occurrence of the word inwalida was checked with each of the 33 obtained associations, using a search tool to identify the contexts (concordances) containing both words – inwalida and the association – with an interval of 0 and ≤5. The results of the study indicate that an association test can be a significant complement to corpus data analyses: it can provide important elements of semantic prosody which are not found in corpus analysis results, it can guide concordance search and it can indicate the elements which are the most important for the meaning of the examined word.
W ostatnich dziesięcioleciach wzrasta rola badań korpusowych w językoznawstwie i naukach pokrewnych. Wyszukiwanie i analiza kolokacji i konkordancji określonej jednostki leksykalnej, pozwalające na określenie jej preferencji semantycznej i prozodii semantycznej, mogą stanowić narzędzie badania stereotypów, rozumianych jako nadmiernie uogólnione i uproszczone oceniająco-afektywne obrazy fragmentu rzeczywistości nazywanego przez daną jednostkę leksykalną. Celem analizy przedstawionej w artykule jest weryfikacja zasadności uzupełniania badań opartych na analizie zasobów korpusowych danymi uzyskanymi w teście skojarzeń swobodnych. Przeprowadzona analiza dotyczy jednostki leksykalnej inwalida, jako nazwy pojęcia podlegającego silnej stereotypizacji. W zasobach liczącego około 250 milionów słów podkorpusu zrównoważonego Narodowego Korpusu Języka Polskiego poszukiwano reakcji skojarzeniowych na hasło inwalida podanych przez minimum 2 osoby z 40-osobowej grupy użytkowników języka polskiego. Sprawdzono w korpusie współwystępowanie wyrazu inwalida kolejno z każdym z 33 uzyskanych skojarzeń, używając do tego celu narzędzia szukającego kontekstów (konkordancji) zawierających oba wyrazy – inwalida i skojarzenie – w odstępie 0 oraz ≤5. Wykazano, że test skojarzeniowy może w sposób istotny uzupełniać analizy korpusowe poprzez: dostarczanie ważnych elementów prozodii semantycznej, których nie ma w wynikach analiz korpusowych, ukierunkowanie wyszukiwania konkordancji oraz wskazywanie elementów najważniejszych dla znaczenia badanego wyrazu.
Źródło:
Studia z Filologii Polskiej i Słowiańskiej; 2021, 56
0081-7090
2392-2435
Pojawia się w:
Studia z Filologii Polskiej i Słowiańskiej
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Language resources for named entity annotation in the National Corpus of Polish
Autorzy:
Savary, A.
Piskorski, J.
Powiązania:
https://bibliotekanauki.pl/articles/206388.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
natural language processing
proper names
named entities
corpus annotation
Polish National Corpus
SProUT
Opis:
We present the named entity annotation subtask of a project aiming at creating the National Corpus of Polish. We summarize the annotation requirements defined for this corpus, and we discuss how existing lexical resources and grammars for named entity recognition for Polish have been adapted to meet those requirements. We show detailed results of the corpus annotation using the information extraction platform SProUT. We also analyze the errors committed by our knowledge-based method and suggest its further improvements.
Źródło:
Control and Cybernetics; 2011, 40, 2; 361-391
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Jak se odrazízájem o téma zdraví a nemocí v užití vybraných kolokací v českých textech
Reflexion o f the Attention Paid to the Themes o f Health (zdraví) and Illness (nemoc) in Various Czech Texts
Autorzy:
KOLÁŘOVÁ, IVANA
Powiązania:
https://bibliotekanauki.pl/articles/953438.pdf
Data publikacji:
2020-12-12
Wydawca:
Uniwersytet Opolski
Tematy:
Czech National Corpus
collocatiun
phraseology
adjective
using in various types oj texts
Opis:
Themes of “health” and “illness” háve been always very popular in spoken and written communication in Czech. In the corpora o f spoken Czech the words zdraví (health), nemoc, choroba, and onemocnění (illness) are often infrequent, depending on the smáli range of those corpora. In the corpus o f written texts SYN (1.3 billion words and tokens), words were found in a lot o f thousands seutences. That is wh> we háve decided to research the words zdrávi, nemoc, choroba, onemocněni in the collocations 'adjective + zdraví', 'adjective + nemoc', adjective + choroba', 'adjective + onemocnění'. We presuppose the fřequency o f the collocations can reflect how much attention is paid to the themes of “health” and “illness” (what kind of illness are spoken about, what problems of health are discussed, etc.). Those collocations are ušed in ioumalistic texts, and some ot them háve been found in scientific texts. We háve tried to fínd the fřequency of those collocation reflexes interests o f various themes of health and illness. We found that the use o f the most ťřequent collocations reflect the problems of i lnesses that people are mos mterested in: for example, Alzheimer’s disease, Parkinson’s disease, Creutzfeldt-Jakob disease, in generál mental illnesses, and mental and physical health. The name o f diseases oř the characteristics of illnesses and diseases are onen used in joumalistic texts dealing with medical matters that are intended for laypersons, and that are characterized by persuasive components.    
Źródło:
Stylistyka; 2012, 21; 253-27
1230-2287
2545-1669
Pojawia się w:
Stylistyka
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Концепт „олигарх”: опыт корпусного анализа
Autorzy:
Горбунова, Людмила
Powiązania:
https://bibliotekanauki.pl/articles/1026530.pdf
Data publikacji:
2021-05-05
Wydawca:
Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:
cognitive linguistics
the concept oligarch
corpus analysis
Russian language
National corpus of the Russian language
Opis:
The structure, content and communicative significance of the oligarch concept are explicated using corpus analysis. The structure of the oligarch concept is asymmetric: the logical and figurative components are presented much more narrowly than the axiological one. The axiological component dominates the structure of the concept. Various assessments from the negative zone of the axiological scale are included in the axiological component of the concept. The logical components of the concept are ‘the presence of very significant financial resources’, ‘the source of finance – most often oil or other extractive industry’,‘power’, ‘participation in the actual government of the country’, ‘belonging to a certain country (more often Russia or Ukraine)’, and ‘opposition to official authorities and government’. The status of “oligarch” is associated with a small number of the same persons. All logical components are assessed negatively, disparagingly or ironically. The concept of an oligarch receives significant signs from the concept of wealth – the main part of the logical component and axiology. The article compares the axiological components of the concepts oligarch and wealth. As a result, the attitude of the “ordinary Russian people” to wealth and the oligarchs isrevealed: this is envy and distrust, a rich person has extremely negative qualities and has accumulated wealth in an illegal or immoral way.
Źródło:
Studia Rossica Posnaniensia; 2021, 46, 1; 71-83
0081-6884
Pojawia się w:
Studia Rossica Posnaniensia
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Problematika styloreho hodnoceni frekventovanych unirerbizdtu v textech raźnych stylovych oblasti
Autorzy:
Kolarova, Ivana
Powiązania:
https://bibliotekanauki.pl/articles/1203358.pdf
Data publikacji:
2011
Wydawca:
Uniwersytet Opolski
Tematy:
Czech National Corpus
multiverbal lexeme
univerbizat
reąuency
freąuented uniyerbizates
style
stylish sphere
Opis:
The article focuses on „univerbizates”, lexical units that were formed by fusion of multiverbal lexemes into a one-word lexemes, whose freąuency in the Czech National Corpus SYN is more than 300 and only 4-2x lower than the freąuency of their underlying multiverbal lexemes. There are monitored the univerbizates with high freąuency in the texts of various stylish spheres. The texts with freąuented univerbizates evidence that the uniyerbizates with freąuency comparable to the freąuency of underlying multiverbal lexemes are often stylish neutral or they are used terminologically, but some freąuented uniyerbizates are substandard - colloąuial (conversational), for example prumyslovka, obćanka, ridićak, or expressive, for example papiftak, spacak. Some uniyerbizates that were taken for expressive and slangy are taken for terminological, for example kopirka.
Źródło:
Stylistyka; 2011, 20; 363-379
1230-2287
2545-1669
Pojawia się w:
Stylistyka
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Czasowniki maksymalnej i nadmiernej efektywności w Narodowym Fotokorpusie Języka Polskiego – rekonesans
Verbs of the maximum and excessive effectiveness in the Na onal Photocorpus of the Polish Language (NFJP) – reconnaissance
Autorzy:
Dzienisiewicz, Daniel
Powiązania:
https://bibliotekanauki.pl/articles/3201077.pdf
Data publikacji:
2023
Wydawca:
Towarzystwo Kultury Języka
Tematy:
National Photocorpus of the Polish Language (NFJP)
National Corpus of Polish (NKJP)
language corpora
corpus linguistics
lexicography
verbs effectiveness
Opis:
The purpose of this article is to analyse the resources of the National Photocorpus of the Polish Language (NFJP) in terms of presence of verbs of maximum and excessive effectiveness. The author endeavours to answer the following questions: 1) which verbs of maximum and excessive effectiveness are recorded by the NFJP, 2) to what extent the verbs recorded in the NFJP are present in the networks of entries in the 19th-century dictionaries of Polish, 3) whether words not recorded in dictionaries of Polish are present in the resources of the National Corpus of Polish (NKJP). The conducted examinations showed that 279 verbs of maximum and excessive effectiveness, including lexemes with prefi xes do-, na-, nad(e)-, o-||ob(e)-, prze-, roz(e)-, u-, wyand za- in their morphological structures and aspectual derivatives, can be found in the NFJP. The analysis evidences that over 15% of verbs have not been recorded in the examined dictionaries of Polish. However, out of 42 verbs, which are unique to the NFJP, 13 words have been found in the NKJP resources. The findings of the study lead to the conclusion that the NFJP could serve as a valuable source for the research on the 20th-century lexis of Polish, e.g. by complementing the knowledge of the vocabulary that is “distributed” in various types of texts and has not been covered by research to this date.
Źródło:
Poradnik Językowy; 2023, 800, 1; 12-28
0551-5343
Pojawia się w:
Poradnik Językowy
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Korpus ogólny jako model danego języka naturalnego: korpusy języków fonicznych a korpus polskiego języka migowego
A general corpus as a model of a natural language: corpora of phonic languages and the Corpus of Polish Sign Language
Autorzy:
Świdziński, Marek
Rutkowski, Paweł
Powiązania:
https://bibliotekanauki.pl/articles/2147056.pdf
Data publikacji:
2022-03
Wydawca:
Towarzystwo Kultury Języka
Tematy:
PJM (Polish Sign Language)
Corpus of Polish Sign Language (KPJM)
National Corpus of Polish (NKJP)
general corpus
sign linguistics
corpus linguistics
Opis:
The aim of this paper is to discuss the major differences and similarities between the Corpus of Polish Sign Language (KPJM), which has been developed for a decade by the team of the Section for Sign Linguistics, Faculty of Polish Studies, University of Warsaw, and corpora of phonic languages (and in particular the National Corpus of Polish (NKJP)). The KPJM is a general corpus with an ambition to represent the whole language, used by the Polish Deaf. Unlike the corpora of phonic languages, which are collections of existing texts, the material of the KPJM was generated purposefully by recording and annotating an extensive set of videos. The paper shows that the sign language corpus should be viewed as analogous to spoken language corpora rather than to written language corpora. The KPJM can be perceived as a model of Polish Sign Language.
Źródło:
Poradnik Językowy; 2022, 792, 3; 7-22
0551-5343
Pojawia się w:
Poradnik Językowy
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Kombinatorika tempordlnich konektoru: jejich semanticko-pragmatickd a stylisticka profilace (ziskand interpretaci korpusoyych dat)
Autorzy:
HOFFMANNOVA, JANA
KOLAROVA, IVANA
Powiązania:
https://bibliotekanauki.pl/articles/957582.pdf
Data publikacji:
2007
Wydawca:
Uniwersytet Opolski
Tematy:
compound temporal connectives
temporal vs. conditional etc. meanings
functional stylistics
pragmatics
Czech National Corpus
Opis:
This article is a part of a large project which aim is a new, corpus-based grammar of contemporary Czech language. It deals with connectives that express meanings from the sphere of time and tense, particularly with their combinations (compound connectives), and with stylistic and pragmatic properties of them. We consider the connective kdyź as a centre of this group of compound connectives and try to differ functions of combinations like aź kdyź; prave kdyź; tehdy, kdyź; kdyź potom; kdyź jeśte; vźdycky kdyź; etc. It is inte- resting to study the interference between temporal semantics and causal, conditional, con- cessive, as well as other meanings. As we have today large amount of data at our disposal (thanks to the Czech National Corpus), we are able to study different distribution of temporal connectives (and their combinations) in the pattems of fimctional styles and in va- rious text types and genres (scientific texts, professional instructions, economic and sport reports and commentaries, interviews, fairy tales or novels, etc.).
Źródło:
Stylistyka; 2007, 16; 81-93
1230-2287
2545-1669
Pojawia się w:
Stylistyka
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Univerbizované názvy jevů prezentovaných v médiích a v publicistických textech
Univerbizates in Media and Journalistic Texts
Autorzy:
KOLÁŘOVÁ, IVANA
Powiązania:
https://bibliotekanauki.pl/articles/953976.pdf
Data publikacji:
2010-12-30
Wydawca:
Uniwersytet Opolski
Tematy:
univerbizates
one-word lexeme
word-forming
suffix
media texts
joumalistic texts
Czech national corpus
Opis:
Univerbization (or condensing) - fusing of multiverbal lexemes into one-word lexemes - is a word-forming process that is frequently used not only in contemporary spoken Czech texts, but also in the written texts. Linguistic research in the last 40-50 years proved that “univerbizates” are used in the joumalistic and media texts very often. The aim of the paper is to present univerbizates naming phenomena of the media and phenomena that are described in the media texts - names of persons (professions, performers), of actions and operations, names of films, plays or songs and compositions. The Czech national corpus has been used for this purpose. Results of the research are as follows - morę than 90 % univerbizates are formed by suffixation, of which nearly 75% - with the use of the -ak or -ka suffixes. They occur in various literary and colloąuial texts.
Źródło:
Stylistyka; 2010, 19; 189-208
1230-2287
2545-1669
Pojawia się w:
Stylistyka
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The collocations of modern Polish: the case of kolaborować [to collaborate] and ambasador [an ambassador]
Autorzy:
Kłosek, Gabriela
Wąsińska, Kinga
Powiązania:
https://bibliotekanauki.pl/articles/1830334.pdf
Data publikacji:
2020-01-29
Wydawca:
Akademia Techniczno-Humanistyczna w Bielsku-Białej
Tematy:
collocations
modern Polish
The National Corpus of Polish
kolokacje
współczesna polszczyzna
Narodowy Korpus Języka Polskiego
Opis:
The aim of the article was to confirm whether or not the modification of words meaning is associated with their collocations. Based on the analysis performed on the collocation of the verb kolaborować [to collaborate] and the noun ambasador [an ambassador] we proved the existence of the aforementioned correlation. The language material for research was collected using PELCRA search engine for Narodowy Korpus Języka Polskiego [The National Corpus of Polish] and Monco PL searching tool for web sides. The analysis of the authentic examples of word combinations reveals the relation between node terms and their collocates. The article listed the word pairs whom recurrence has high co-occurrence frequency. In the case of the verb kolaborować, excerpt collocations refer to the negative contexts associated with cooperation with the enemy and neutral about the cooperation of people and companies. The contexts of the noun ambasador are related to the work of a diplomatic representative to a foreign country, an action spokesperson or a brand ambassador. Our research indicated that collocations are not accidental or arbitrary. It has been also noted that collocations study are a valuable starting point for linguistic analyses.
Źródło:
Świat i Słowo; 2019, 33, 2; 171-192
1731-3317
Pojawia się w:
Świat i Słowo
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
От социологии к оценке
Autorzy:
Фролова, Ольга
Powiązania:
https://bibliotekanauki.pl/articles/1023155.pdf
Data publikacji:
2019-09-05
Wydawca:
Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:
social stratification
social stratum
figurative meaning
metaphor
evaluation
ambivalence
connotation
Russian national corpus
democratization
image perception
Opis:
The article deals with figurative meanings of nominations of the social hierarchy relating to the upper, middle and lower strata of Russian society of the 18th and 19th centuries, and their reflection in the explanatory dictionary of the beginning of the 21st century. The use of adjectives, formed from personal nouns in the Russian National Corpus in the period 2000–2017, is analyzed.
Źródło:
Studia Rossica Posnaniensia; 2019, 44, 2; 261-270
0081-6884
Pojawia się w:
Studia Rossica Posnaniensia
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Slovesa s metaforickým významem jako prostředek pro vyjádření mezilidských vztahů, psychický ch stavů a atmosféry prostředí
The metaphoric use of Czech verbs that indicate peoples relationship, mental condition and atmosphere in the environment
Autorzy:
Kolarova, I.
Powiązania:
https://bibliotekanauki.pl/articles/1009004.pdf
Data publikacji:
2006
Wydawca:
Uniwersytet Opolski
Tematy:
CZECH NATIONAL CORPUS SYN2000
CZECH VERBS
LANGUAGE INDICATION OF PEOPLE'S RELATIONSHIP & MENTAL CONDITION
METAPHORIC USE
Opis:
We are going to deal with the Czech verbs 'jiskrit', 'zajiskrit', 'skripat', 'zaskripat', 'vrit', 'praskat' that are used as a metaphor in non-subjective syntactical constructions with the expletive 'to'. Those verbs can indicate various meaning and function. The verbs 'jiskrit', 'zajiskrit' indicate positive or negative people's relationship (Mezi Zielencem a Rumlem to jiskrilo.), mental condition of people (V ocich ji to vesele zajiskrilo.) or atmosphere in the groups of people (S jejich prichodem to ve vzduchu okamzite zacalo jiskrit). The verbs 'skripat', 'zaskripat' indicate negative people's relationship (Mezi Victorii a jeji matkou to hrozne skripe.) or atmosphere in the groups of people (Na zacatku pripravy to v týmu skripalo kvuli oddilove prislusnosti.). The verb 'vrit' indicates almost mental condition of people and strong emotion (Kdybys jen vedela, Kitty, jak to ve mne pri tom nadáváni a pomlouváni vre). We are going to try to substitute the sentences with the verbs 'jiskrit', 'skripat', 'vrit', 'praskat' and with the expletive 'to' by the sentences without 'to' and to compare both type of sentences. We are going to recognize using of those verbs in the texts of various stylish spheres and in the texts with the language means of various stylish features (stylish neutral features, expressive features, terminology...). The sentences and short texts with the verbs 'jiskrit', 'zajiskrit', 'skripat', 'zaskripat', 'vrit', 'praskat' shall be in the texts of Czech national corpus SYN2000.
Źródło:
Stylistyka; 2006, 15; 237-248
1230-2287
2545-1669
Pojawia się w:
Stylistyka
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Leksem błękitny w polskiej prozie (badania korpusowe)
The lexeme błękitny ‘blue’ in the Polish prose: A corpus-based study
Autorzy:
Stanulewicz, Danuta
Powiązania:
https://bibliotekanauki.pl/articles/1591636.pdf
Data publikacji:
2017
Wydawca:
Uniwersytet Szczeciński. Wydawnictwo Naukowe Uniwersytetu Szczecińskiego
Tematy:
błękitny ‘blue’
sky ‘blue’
niebieski ‘blue’
prose
National Corpus of Polish
błękitny
niebieski
proza
Narodowy Korpus Języka Polskiego
Opis:
Celem artykułu jest omówienie występowania przymiotnika błękitny w tekstach zgromadzonych w Narodowym Korpusie Języka Polskiego (NKJP), dostępnym pod adresem www.nkjp. pl. Pod uwagę brana jest kategoria „proza” w podkorpusie zrównoważonym, a dane zostały wyekscerpowane przy pomocy wyszukiwarki PELCRA. Jak pokazuje analiza materiału wyekscerpowanego z podkorpusu zrównoważonego NKJP, błękitny używany jest w prozie częściej niż w tekstach innych, nieliterackich kategorii. Dane korpusowe przemawiają za tym, by uważać to słowo za wyszukane, „książkowe”. Dane korpusowe dostarczyły materiału pozwalającego na wychwycenie różnic – nie tylko frekwencyjnych – w stosowaniu przez prozaików przymiotników błękitny i niebieski. Ten pierwszy częściej używany jest w opisach nieba, wody, dymów i mgieł, natomiast drugi służy do deskrypcji ubrań i oczu.
The purpose of the article is to discuss the occurrence of the adjective błękitny in the texts collected in the category “prose” of the National Corpus of Polish, available at www.nkjp.pl. The data have been excerpted from the balanced subcorpus with the search engine PELCRA. As the analysis demonstrates, błękitny is used in prose more frequently than in other texts, non-literary categories, and that is why, this word should be regarded as sophisticated and bookish. The corpus data allow for identifying differences not only in the frequency but also in the use of the words błękitny and niebieski by prose writers. Błękitny is used more often to describe the sky, water, smoke and mist, whereas niebieski, which is the basic term for blue in Polish, is employed to describe clothes and eyes.
Źródło:
Studia Językoznawcze; 2017, 16; 237-253
1730-4180
2353-3161
Pojawia się w:
Studia Językoznawcze
Dostawca treści:
Biblioteka Nauki
Artykuł

Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies