Temat: stylometry - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: On the Authenticity of Prose Writings Attributed to Śaṅkara
Autorzy:: Andrijanić, Ivan
Bąkowski, Jacek
Powiązania:: https://bibliotekanauki.pl/articles/35031971.pdf
Data publikacji:: 2023
Wydawca:: Polska Akademia Nauk. Instytut Kultur Śródziemnomorskich i Orientalnych PAN
Tematy:: authorship
stylometry
Advaita
Vedānta
Opis:: Śaṅkara is traditionally considered the author of an exceptionally large number of works. Indological scholarship has attempted to filter out some of these works within traditional philological and historical frameworks. Many were, however, taken for granted to be authentic, and no serious research into their authenticity has been conducted. This paper attempts a computational stylometric approach to establish the authenticity of prose commentaries attributed to Śaṅkara. The General Imposters (GI) framework appears to be the most suitable existing method developed for the purpose of verifying authorship. The GI calculates the statistical distance between certain texts’ features and estimates whether the disputed text is closer to the candidate author than to a set of texts that may not have been composed by him. The paper also presents a machine-based method for separating the words and resolving the sandhi in the Sanskrit text, crucial for the procedure. The success rate in verifying authors of undisputed texts appears to be acceptable enough to proceed to the next step, where 18 prose commentaries traditionally attributed to Śaṅkara are subjected to the GI verification procedure. The result conforms to the most conservative assessments of Śaṅkara’s authorship; GI verified the authenticity of the commentaries on the principal Upaniṣads (with the exception of the commentary on the Śvetāśvataropaniṣad) and on the Bhagavadgītā. Besides these, commentaries on the Nṛsiṃha-(pūrva)-tāpanīyopaniṣad and the Adhyātmapaṭala were, rather unexpectedly, also successfully verified as genuine works of Śaṅkara.
Źródło:: Acta Asiatica Varsoviensia; 2023, 36; 5-38
0860-6102
2449-8653
Pojawia się w:: Acta Asiatica Varsoviensia
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Mikrokorpus Gronowy Polszczyzny 1830–1918
Autorzy:: Derwojedowa, Magdalena
Powiązania:: https://bibliotekanauki.pl/articles/1630442.pdf
Data publikacji:: 2020-10-30
Wydawca:: Towarzystwo Kultury Języka
Tematy:: corpus
diachrony
inflection
syntax
orthography
stylometry
Opis:: This paper is dedicated to the construction of a small cluster corpus of Polish texts from the period 1830–1918. The assumptions of the corpus, its micro- and macro-structure, as well as stylistic, regional and author diversity, and method of making it available are presented. Its application capabilities are illustrated on the example of orthographic, infl ectional, and syntactic studies.
Źródło:: Poradnik Językowy; 2020, 777, 8; 52-65
0551-5343
Pojawia się w:: Poradnik Językowy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Attributivity and Subjectivity in Czech Journalism and Scientific Literature
Autorzy:: Kubát, Miroslav
Čech, Radek
Powiązania:: https://bibliotekanauki.pl/articles/50709426.pdf
Data publikacji:: 2024
Wydawca:: Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:: attributivity
subjectivity
stylometry
syntax
journalism
scientific literature
Opis:: This study examines two recently proposed stylometric indices, attributivity and subjectivity, in the context of Czech journalism and scientific literature. Index of attributivity is defined as the ratio of the frequency of attributes to the sum of frequencies of nouns, pronouns, and attributes. Index of subjectivity is defined as the ratio of the frequency of subjects to the sum of frequencies of predicates and subjects. The research is based on the corpus of contemporary written Czech language SYN2020. It appears that both indices of attributivity and subjectivity are sensitive to different genres, and could be used in stylometric analyses, especially those that focus on the interpretability of quantitative measures. In general, more formal texts tend to have higher values of both indices across different genres.
Źródło:: Bohemistyka; 2024, 24, 1; 3-14
1642-9893
Pojawia się w:: Bohemistyka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: An open stylometric system based on multilevel text analysis
Autorzy:: Eder, Maciej
Piasecki, Maciej
Walkowiak, Tomasz
Powiązania:: https://bibliotekanauki.pl/articles/677202.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: stylometry
Polish
CLARIN-PL
research infrastructure
language technology
Opis:: An open stylometric system based on multilevel text analysisStylometric techniques are usually applied to a limited number of typical tasks, such as authorship attribution, genre analysis, or gender studies. However, they could be applied to several tasks beyond this canonical set, if only stylometric tools were more accessible to users from different areas of the humanities and social sciences. This paper presents a general idea, followed by a fully functional prototype of an open stylometric system that facilitates its wide use through to two aspects: technical and research flexibility. The system relies on a server installation combined with a web-based user interface. This frees the user from the necessity of installing any additional software. At the same time, the system offers a variety of ways in which the input texts can be analysed: they include not only the usual lexical level, but also deep-level linguistic features. This enables a range of possible applications, from typical stylometric tasks to the semantic analysis of text documents. The internal architecture of the system relies on several well-known software packages: a collection of language tools (for text pre-processing), Stylo (for stylometric analysis) and Cluto (for text clustering). The paper presents: (1) The idea behind the system from the user’s perspective. (2) The architecture of the system, with a focus on data processing. (3) Features for text description. (4) The use of analytical systems such as Stylo and Cluto. The presentation is illustrated with example applications. Otwarty system stylometryczny wykorzystujący wielopoziomową analizę języka Zastosowania metod stylometrycznych na ogół ograniczają się do kilku typowych problemów badawczych, takich jak atrybucja autorska, styl gatunków literackich czy studia nad zróżnicowaniem stylistycznym kobiet i mężczyzn. Z pewnością dałoby się je z powodzeniem zastosować również do wielu innych problemów klasyfikacji tekstów, gdyby tylko owe metody oraz odpowiednie narzędzia były bardziej dostępne dla uczonych reprezentujących różne dyscypliny nauk humanistycznych i społecznych. Artykuł niniejszy omawia założenia teoretyczne oraz w pełni funkcjonalny prototyp otwartego systemu stylometrycznego, którego szerokie zastosowanie umożliwią dwie jego cechy: elastyczność techniczna oraz dostosowywalność do różnych pytań badawczych. System opiera się na instalacji serwerowej sprzęgniętej z sieciowym interfejsem użytkownika. Uwalnia to użytkownika od konieczności instalowania jakichkolwiek dodatkowych programów. Jednocześnie system oferuje wiele sposobów analizowania tekstów nie tylko na poziomie leksykalnym, lecz także poprzez cechy językowe niskiego poziomu. Daje to możliwość stosowania systemu na wiele różnych sposobów, od typowych testów stylometrycznych do analizy semantycznej dokumentów. Wewnętrzna architektura systemu składa się z wielu elementów znanych ze swej funkcjonalności, w tym z pakietu Stylo przeznaczonego do analiz stylometrycznych oraz pakietu Cluto służącego do zaawansowanej analizy skupień. Artykuł omawia: (1) Koncepcję całego systemu, postrzeganą z punktu widzenia użytkownika, (2) Architekturę systemu oraz jego elementy odpowiedzialne za przetwarzanie tekstu, (3) Cechy językowe służące do opisu dokumentów, (4) Zastosowanie modułów analizy danych, takich jak Stylo czy Cluto. W artykule zostały też przedstawione przykładowe zastosowania systemu.
Źródło:: Cognitive Studies; 2017, 17
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Syntactic complexity as a stylistic feature of subtitles
Autorzy:: Stauder, Andy
Ustaszewski, Michael
Powiązania:: https://bibliotekanauki.pl/articles/1191718.pdf
Data publikacji:: 2020
Wydawca:: Uniwersytet Wrocławski. Oficyna Wydawnicza ATUT – Wrocławskie Wydawnictwo Oświatowe
Tematy:: translation style
stylometry
subtitles
syntactic complexity
corpus approaches
Opis:: In audiovisual translation, stylometry can be used to measure formal-aesthetic fidelity. We present a corpus-based measure of syntactic complexity as a feature of language style. The methodology considers hierarchical dimensions of syntactic complexity, using syllable counting and dependency parsing. The test material are dialogues of several characters from the TV show “Two and a Half Men”. The results show that characters do not differ syntactically among themselves as much as might be expected, and that, despite a general tendency to level differences even more in translation, the changes in syntactic complexity between the original and translation depend mostly on the respective character-feature combination.
Źródło:: Studia Translatorica; 2020, 11; 177-188
2084-3321
2657-4802
Pojawia się w:: Studia Translatorica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Profiling a set of personality traits of text author: what our words reveal about us
Autorzy:: Litvinova, Tatiana
Seredin, Pavel
Litvinova, Olga
Zagorovskaya, Olga
Powiązania:: https://bibliotekanauki.pl/articles/620717.pdf
Data publikacji:: 2016
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: authorship profiling
neurolinguistics
language personality
computational stylometry
discourse production
Opis:: Authorship profiling, i.e. revealing information about an unknown author by analyzing their text, is a task of growing importance. One of the most urgent problems of authorship profiling (AP) is selecting text parameters which may correlate to an author’s personality. Most researchers’ selection of these is not underpinned by any theory. This article proposes an approach to AP which applies neuroscience data. The aim of the study is to assess the probability of selfdestructive behaviour of an individual via formal parameters of their texts. Here we have used the “Personality Corpus”, which consists of Russian-language texts. A set of correlations between scores on the Freiburg Personality Inventory scales that are known to be indicative of self-destructive behaviour (“Spontaneous Aggressiveness”, “Depressiveness”, “Emotional Lability”, and “Composedness”) and text variables (average sentence length, lexical diversity etc.) has been calculated. Further, a mathematical model which predicts the probability of selfdestructive behaviour has been obtained.
Źródło:: Research in Language; 2016, 14, 4
1731-7533
Pojawia się w:: Research in Language
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Słowa znaczące, słowa kluczowe, słowozbiory – o statystycznych metodach wyszukiwania wyrazów istotnych
Autorzy:: Maciej, Eder,
Powiązania:: https://bibliotekanauki.pl/articles/897248.pdf
Data publikacji:: 2016
Wydawca:: Uniwersytet Warszawski. Wydawnictwa Uniwersytetu Warszawskiego
Tematy:: quantitative linguistics
stylometry
keywords
Zeta method
topic modelling
wordlist
Opis:: This article discusses automatic extraction of relevant words from sets of texts. The author briefly presents three methods aimed to extract the words from the corpus of words with regard to their frequency, or words whose occurrence next to each other is not random. First, he focuses on the keyword analysis method, then he discusses the Zeta method developed by John Burrows and Hugh Craig, and the third method covered in the article is the topic modelling method, which is becoming very popular recently, and consists in finding clusters of words co-occurring in similar contexts. Topic modelling was intended for a quick content search in large collections of documents. On the basis of 100 Polish novels, the article presents how this method can be used for linguistic studies.
Źródło:: Przegląd Humanistyczny; 2016, 60(3 (454)); 31-44
0033-2194
Pojawia się w:: Przegląd Humanistyczny
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Words of Warfare: Life of Mission Soldiers in the Perspective of Quantitative Linguistics
Autorzy:: Radková, Lucie
Místecký, Michal
Powiązania:: https://bibliotekanauki.pl/articles/909320.pdf
Data publikacji:: 2021-01-15
Wydawca:: Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:: stylometry
keyword analysis
collocation analysis
military speech
Czech language
Opis:: The paper focuses on the analysis of a sample of military language from the stylometric perspective. The corpus is the chronicle of the 8th Czech Armed Forces Guard Company, which operated at the Bagram Air Field base (BAF). We work on the assumptions that in the corpus, there will be (A) a prominent presence of military slang; (B) a high proportion of abbreviations; (C) frequent linguistic devices expressing mutuality and collectiveness of the soldiers’ enterprise. The texts were subjected to keyword and collocation analyses; these determined several stylistic features of theirs (such as use of English-based expressions, protocol-like language, or idiosyncratic collocations), which testify to the multifaceted character of the military chronicle genre.
Źródło:: Bohemistyka; 2021, 1; 7-26
1642-9893
Pojawia się w:: Bohemistyka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Komputerowo wspomagane metody badania tekstów w polskiej perspektywie
Computer aided methods of text research in the Polish perspective
Autorzy:: Trzoss, Adrian
Powiązania:: https://bibliotekanauki.pl/articles/2048578.pdf
Data publikacji:: 2021
Wydawca:: Fundacja Popierania Nauki
Tematy:: NLP
stylometria
język
analiza sieciowa
stylometry
language
social network analysis
Opis:: W niniejszym artykule autor przedstawia stosowanie metod analizy przetwarzania języka naturalnego (NLP) w obszarze polskich badań. W analizie uwzględniono trzy pola badawcze: socjologiczne, politologiczne oraz literaturoznawcze. Omówione zostały prace takich badaczy, jak Marek Troszyński, Paweł Matuszewski oraz Maciej Eder. Efektem przeprowadzonej analizy było nakreślenie najważniejszych aspektów metodologicznych związanych z używaniem metody NLP: kontekstów, możliwości oraz zagrożeń. Finalnie wskazano dalsze perspektywy badawcze, w których stosowanie omawianych metod może przynieść potencjalnie pozytywne rezultaty.
In the following paper author discuss the natural language processing method (NLP) usage in polish academic literature. In the analysis three fields were pointed out: sociology, political science and literature science. Three groups of texts were presented from Marek Troszyński, Paweł Matuszewski and Maciej Eder. As the result of the conducted analysis author emphasized the most important methodological aspects of NLP usage: contexts, opportunities and risks. Finally, author indicated areas for the further research where NLP would be beneficial method.
Źródło:: Nauka Polska. Jej Potrzeby, Organizacja i Rozwój; 2021, XXX (LV); 213-225
1230-5480
Pojawia się w:: Nauka Polska. Jej Potrzeby, Organizacja i Rozwój
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: In Search of the Author of Chronica Polonorum Ascribed to Gallus Anonymus: A Stylometric Reconnaissance
Autorzy:: Eder, Maciej
Powiązania:: https://bibliotekanauki.pl/articles/601379.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Instytut Historii im. Tadeusza Manteuffla PAN w Warszawie
Tematy:: Gallus Anonymus
Chronica Polonorum
authorship attribution
stylometry
multidimensional methods
Monachus Littorensis
Opis:: The article deals with the question of authorship of the thirteenth-century Chronica Polonorum (or Gesta principium Polonorum [The Deeds of the Princes of the Poles]), also known as The Polish Chronicle. It seeks to verify the hypothesis, recently reproposed by Tomasz Jasiński, whereby the author was of Venetian origin. The hypothesis is namely based on the textual similarities observed between Translatio Sancti Nicolai by an author referred to as the ‘Monk of Lido’ (Monachus Littorensis) and the Chronica. The attribution attempt put forth by M. Eder is based upon stylometric methods that measure the frequencies of the most frequent words in the texts under research (mainly, conjunctions, prepositions, pronouns, and particles) which are subsequently subjected to cluster analysis, multidimensional scaling, or principal components analysis. The outcome of the experiment in question has demonstrated a strong resemblance between the Translatio Sancti Nicolai and the Polish Chronicle, which may be regarded as an substantial argument in support of the Venetian background hypothesis.
Źródło:: Acta Poloniae Historica; 2015, 112
0001-6829
Pojawia się w:: Acta Poloniae Historica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: A review of the book "Reading beyond the female: the relationship between perception of author gender and literary quality"
Recenzja książki Cornelii Wilhelminy Koolen "Reading beyond the female: The relationship between perception of author gender and literary quality"
Autorzy:: Byszuk, Joanna
Powiązania:: https://bibliotekanauki.pl/articles/475507.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Instytut Języka Polskiego PAN
Tematy:: recenzja
gender studies
analiza dyskursu
stylometria
literaturoznawstwo
review
discourse analysis
stylometry
literary studies
Opis:: Niniejszy tekst stanowi recenzję książki "Reading beyond the female: The relationship between perception of author gender and literary quality" holenderskiej badaczki Cornelii Koolen. Prezentowana książka podejmuje tematykę relacji między płcią autora, oceną jakości literackiej jego lub jej twórczości i rzeczywistymi cechami tekstów, wpisując się w tym samym w nurt badań nad stereotypami płciowymi w języku i literaturze. Dzięki innowacyjnemu zastosowaniu ilościowych metod analizy tekstu, stanowi też istotną pozycję w zakresie metodologii stylometrycznej, nadając całości pracy interdyscyplinarny charakter.
Presented text is a review of the book "Reading beyond the female: The relationship between perception of author gender and literary quality" by Dutch researcher Cornelia Koolen. Discussed book undertakes the issues of relations between the gender of the author, evaluation of literary quality of their work and actual features of the texts, thus fitting in the larger trend of research on gender stereotypes in language and literature. The innovative use of quantitative methods also grants it an important place within literatureon stylometric, making it an interdisciplinary work.
Źródło:: Socjolingwistyka; 2019, 33; 283-287
0208-6808
Pojawia się w:: Socjolingwistyka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: "Quo vadis" all’italiana: investigazione stilometrica sulle traduzioni milanesi del bestseller di Henryk Sienkiewicz
"Quo vadis" in Italian: A Stylometric Investigation of Milanese Translations of Henryk Sienkiewicz’s Bestseller
Autorzy:: Biernacka-Licznar, Katarzyna
Rybicki, Jan
Powiązania:: https://bibliotekanauki.pl/articles/446408.pdf
Data publikacji:: 2020-07-31
Wydawca:: Wydawnictwo Adam Marszałek
Tematy:: "Quo Vadis"
translation
Milanese publishers
quantitative analysis
stylometry
traduzione
editori di Milano
analisi quantitativa
stilometria
Opis:: Henryk Sienkiewicz’s novel "Quo Vadis" made its way into Italy at the end of the 19 th century through the efforts of Neapolitan translator Federigo Verdinois. The first part of this paper outlines the history of the popularity of "Quo Vadis" by focusing on the operations of Milanese publishers that made the Polish novel part of their offer in a variety of ways (as translations, adaptations, reworkings, plagiarisms, etc.). Bibliometric methods are used to establish why so many publishing houses decided to publish Henryk Sienkiewicz’s Roman romance. The analysis of the bibliometric data of the published translations helped assess and describe the extent and the character of the popularity that the novel garnered among Milanese publishers. The second part of the paper relates the findings of a multi-method quantitative study of the same material. The number of word tokens was compared between the original and the translations. The lexical richness across the texts under study was compared by means of the moving average type-token ratio (MATTR). Sentence lengths were also compared, as was sentence length distribution as time series. Two different programmes ("WCopyFind and Tracer") yielded very similar results on the degree of the similarity of five-word phrases in pairs of translations, which was determined in network analysis.
L’opera "Quo vadis" di Henryk Sienkiewicz arrivò in Italia alla fine del XIX secolo grazie al traduttore napoletano Federigo Verdinois. Lo scopo della prima parte del contributo è quello di presentare la storia della popolarità del romanzo "Quo vadis" attraverso le azioni delle case editrici milanesi, le quali hanno introdotto l’opera del polacco, in varie forme, nella sua offerta editoriale (come traduzioni, adattamenti, parafrasi, plagi). La ricostruzione della storia delle traduzioni del romanzo romano di Henryk Sienkiewicz è stata possibile grazie al metodo biblometrico che è stato ustato nella prima parte dell’articolo. L’analisi dei dati bibliografici raccolti ha permesso di valutare e descrivere la grandezza e il carattere della popolarità di "Quo vadis" tra gli editori milanesi nella prima parte del XX secolo. Nella seconda parte del contributo per meglio far luce sulle complicate sorti milanesi dell’opera di Sienkiewicz abbiamo usato alcuni metodi d’analisi quantitativa. Abbiamo paragonato il numero delle parole in originale e nelle traduzioni descritte nella prima parte. La ricchezza del vocabolario di tutti i testi esaminati è stata misurata e confrontata usando il calcolo della media mobile del rapporto del numero di parole alla lunghezza del testo (MATTR). Abbiamo confrontato anche le lunghezze delle frasi come serie temporali. I due programmi diversi ("WCopyFind e Tracer"), utili per le analisi delle reti, hanno dato risultati simili per il numero delle somiglianze delle frasi di pentagrammi verbali tra le traduzioni di "Quo vadis".
Źródło:: Italica Wratislaviensia; 2020, 11.1; 31-64
2084-4514
Pojawia się w:: Italica Wratislaviensia
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Czynniki różnicujące wypowiedzi informatorów – analiza stylometryczna wywiadów biograficznych
Factors Differentiating the Statements of Narrators: A Stylometric Analysis of Biographical Interviews
Autorzy:: Kądzioła, Magdalena
Powiązania:: https://bibliotekanauki.pl/articles/634946.pdf
Data publikacji:: 2018
Wydawca:: Ośrodek Pamięć i Przyszłość
Tematy:: relacje biograficzne
stylometria
język mówiony
klasyfikacja tekstów
metadane
biographical accounts
stylometry
spoken language
classification of texts
metadata
Opis:: This article proposes ways to analyse the content and metadata of biographical interviews using statistical methods. The basis for this series of stylometric experiments was a specially created corpus exceeding 1.2 million lexical units in size and composed of texts extracted from selected biographical interviews from the Oral History Archive, the History Meeting House, and the KARTA Centre available on the website www.relacjebiograficzne.pl. Research was based on the content of biographical interviews with forty-one people assigned to three thematic categories: ‘Warsaw,’ ‘the village,’ and ‘gentry.’ The main goal of the experiments was to determine which linguistic factors differentiate speakers and which features (gender, place of origin, age, length of speech, or topic) can influence this classification. This research was carried out using quantitative linguistics methods, and the conclusions we have arrived at allow for the determination of the direction of further work in the field of the stylometry of spoken language.
W artykule zostają zaproponowane sposoby analizy treści i metadanych wywiadów biograficznych metodami statystycznymi. Podstawą do przeprowadzenia serii eksperymentów stylometrycznych był specjalnie stworzony korpus o rozmiarze przekraczającym 1,2 mln jednostek leksykalnych. Na korpus złożyły się teksty wybranych relacji biograficznych pochodzących z Archiwum Historii Mówionej, Domu Spotkań z Historią i Ośrodka KARTA, udostępnianych na portalu: www.relacjebiograficzne.pl. W badaniach wykorzystano treści wywiadów biograficznych 41 osób przyporządkowanych do trzech kategorii tematycznych: Warszawa, wieś, ziemiaństwo. Głównym celem eksperymentów było ustalenie, jakie czynniki językowe różnicują mówców i które cechy (płeć, miejsce pochodzenia, wiek, długość wypowiedzi, temat) mogą mieć wpływ na klasyfikację. Badania przeprowadzono metodami językoznawstwa kwantytatywnego, a uzyskane wnioski pozwalają na wyznaczenie kierunku dalszych prac w zakresie stylometrii języka mówionego.
Źródło:: Wrocławski Rocznik Historii Mówionej; 2018, 8; 63-80
2719-7522
2084-0578
Pojawia się w:: Wrocławski Rocznik Historii Mówionej
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: Kwestia tajemniczego autorstwa renesansowego poematu Sarca
The question of the mysterious authorship of the Renaissance poem Sarca
Autorzy:: Kowalczyk-Cantoro, Daria
Powiązania:: https://bibliotekanauki.pl/articles/32055859.pdf
Data publikacji:: 2022
Wydawca:: Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:: Renaissance
New Latin literature
Pietro Bembo
stylometry
attribution
authorship
Sarca
renesans
nowa literatura łacińska
stylometria
atrybucja
autorstwo
Opis:: Dating to the 1530s, the Latin poem Sarca was published in 1842 in Rome by Angelo Mai based on a manuscript found in Vienna. Forty years after publication, the incompatibility of this edition with the Viennese manuscript was first noticed. In the second half of the last century, a polemic began challenging the theory, previously considered certain, that Sarca’s author was Pietro Bembo. Researchers named Girolamo Fracastor, Andrea Navager and Agostino Beazzan, among others, as possible authors of the text in addition to Bembo. The question of the poem’s authorship remains unresolved to this day. The aim of this article is to analyse the turbulent and hypothesis-rich debate on the attribution of the Renaissance work. The poem has been subjected to internal criticism, i.e. inferences about the author’s person based on content analysis. Historical and bibliographical indicators of the work have also been examined. The article also attempts to answer the question of whether the latest techniques used in the analysis of the linguistic and stylistic features of the text, such as the methods of computer stylometry, could prove helpful in determining the authorship of the Sarca poem.
Źródło:: Przestrzenie Teorii; 2022, 38; 291-300
2450-5765
Pojawia się w:: Przestrzenie Teorii
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 15.

Tytuł:: A Morpho-syntactic Analysis of Human-moderated Hate Speech Samples from Wykop.pl Web Service
Morfosyntaktyczna analiza przykładów mowy nienawiści zablokowanych przez moderatorów serwisu Wykop.pl
Autorzy:: Okulska, Inez
Kołos, Anna
Powiązania:: https://bibliotekanauki.pl/articles/28407654.pdf
Data publikacji:: 2023
Wydawca:: Krakowskie Towarzystwo TERTIUM
Tematy:: cyberagresja
mowa nienawiści
treści internetowe
automatyczne wykrywanie treści
stylometria
cyberbullying
hate speech
user-generated online content
automated detection
stylometry
Opis:: The dynamic increase in user-generated content on the web presents significant challenges in protecting Internet users from exposure to offensive material, such as cyberbullying and hate speech, while also minimizing the spread of wrongful conduct. However, designing automated detection models for such offensive content remains complex, particularly in languages with limited publicly available data. To address this issue, our research collaborates with the Wykop.pl web service to fine-tune a model using genuine content that has been banned by professional moderators. In this paper, we focus on the Polish language and discuss the notion of datasets and annotation frameworks, presenting our stylometric analysis of Wykop.pl content to identify morpho-syntactic structures that are commonly applied in cyberbullying and hate speech. By doing so, we contribute to the ongoing discussion on offensive language and hate speech in sociolinguistic studies, emphasizing the need to consider user-generated online content.
Dynamiczny wzrost treści generowanych przez użytkowników w sieci stanowi poważne wyzwanie w zakresie ochrony użytkowników Internetu przed narażeniem na obraźliwe materiały, takie jak cyberprzemoc i mowa nienawiści, i jednoczesnego ograniczania rozprzestrzeniania nieetycznych zachowań. Jednak projektowanie zautomatyzowanych modeli wykrywania obraźliwych treści pozostaje złożonym zadaniem, szczególnie w językach o ograniczonych publicznie dostępnych danych. W naszych badaniach współpracujemy z serwisem internetowym Wykop.pl w celu uczenia modelu przy użyciu rzeczywistych treści, które podlegały usunięciu w procesie moderacji. W niniejszym artykule skupiamy się na języku polskim i omawiamy pojęcie zbiorów danych i metod anotacji, a następnie przedstawiamy naszą analizę stylometryczną treści z serwisu Wykop.pl w celu zidentyfikowania struktur morfosyntaktycznych, które są powszechnie aplikowane w języku cyberprzemocy i mowie nienawiści. Dzięki naszym badaniom mamy nadzieję na wniesienie wkładu w toczącą się dyskusję na temat obraźliwego języka i mowy nienawiści w badaniach socjolingwistycznych, podkreślając potrzebę analizy treści generowanych przez użytkowników w sieci.
Źródło:: Półrocznik Językoznawczy Tertium; 2023, 8, 2; 54-71
2543-7844
Pojawia się w:: Półrocznik Językoznawczy Tertium
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "stylometry" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język