Temat: distributional semantics - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: A dependency-based approach to word contextualization using compositional distributional semantics
Autorzy:: Gamallo, Pablo
Powiązania:: https://bibliotekanauki.pl/articles/103863.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: distributional semantics
compositionality
dependency-based parsing
Opis:: We propose a strategy to build the distributional meaning of sentences mainly based on two types of semantic objects: context vectors associated with content words and compositional operations driven by syntactic dependencies. The compositional operations of a syntactic dependency make use of two input vectors to build two new vectors representing the contextualized sense of the two related words. Given a sentence, the iterative application of dependencies results in as many contextualized vectors as content words the sentence contains. At the end of the contextualization process, we do not obtain a single compositional vector representing the semantic denotation of the whole sentence (or of the root word), but one contextualized vector for each constituent word of the sentence. Our method avoids the troublesome high-order tensor representations of approaches relying on category theory, by defining all words as first-order tensors (i.e. standard vectors). Some corpus-based experiments are performed to both evaluate the quality of the contextualized vectors built with our strategy, and to compare them to other approaches on distributional compositional semantics. The experiments show that our dependency-based method performs as (or even better than) the state-of-the-art.
Źródło:: Journal of Language Modelling; 2019, 7, 1; 99-138
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Idiosyncratic frequency as a measure of derivation vs. inflection
Autorzy:: Copot, Maria
Mickus, Timothee
Bonami, Olivier
Powiązania:: https://bibliotekanauki.pl/articles/24201226.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: morphology
derivation– inflection gradient
distributional semantics
Opis:: There is ongoing discussion about how to conceptualize the nature of the distinction between inflection and derivation. A common approach relies on qualitative differences in the semantic relationship between inflectionally versus derivationally related words: inflection yields ways to discuss the same concept in different syntactic contexts, while derivation gives rise to words for related concepts. This differential can be expected to manifest in the predictability of word frequency between words that are related derivationally or inflectionally: predicting the token frequency of a word based on information about its base form or about related words should be easier when the two words are in an inflectional relationship, rather than a derivational one. We compare prediction error magnitude for statistical models of token frequency based on distributional and frequency information of inflectionally or derivationally related words in French. The results conform to expectations: it is easier to predict the frequency of a word from properties of an inflectionally related word than from those of a derivationally related word. Prediction error provides a quantitative, continuous method to explore differences between individual processes and differences yielded by employing different predicting information, which in turn can be used to draw conclusions about the nature and manifestation of the inflection–derivation distinction.
Źródło:: Journal of Language Modelling; 2022, 10, 2; 193--240
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Graded hyponymy for compositional distributional semantics
Autorzy:: Bankova, D.
Coecke, B.
Lewis, M.
Marsden, D.
Powiązania:: https://bibliotekanauki.pl/articles/103883.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: categorical semantics
compositional semantics
distributional semantics
computational linguistics
entailment
density operator
Opis:: The categorical compositional distributional model of natural language provides a conceptually motivated procedure to compute the meaning of a sentence, given its grammatical structure and the meanings of its words. This approach has outperformed other models in mainstream empirical language processing tasks, but lacks an effective model of lexical entailment. We address this shortcoming by exploiting the freedom in our abstract categorical framework to change our choice of semantic model. This allows us to describe hyponymy as a graded order on meanings, using models of partial information used in quantum computation. Quantum logic embeds in this graded order.
Źródło:: Journal of Language Modelling; 2018, 6, 2; 225-260
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Distinguishing between paradigmatic semantic relations across word classes : human ratings and distributional similarity
Autorzy:: Schulte im Walde, Sabine
Powiązania:: https://bibliotekanauki.pl/articles/1429743.pdf
Data publikacji:: 2020
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: semantic relations
human ratings
distributional semantics
automatic classification
Opis:: This article explores the distinction between paradigmatic semantic relations, both from a cognitive and a computational linguistic perspective. Focusing on an existing dataset of German synonyms, antonyms and hypernyms across the word classes of nouns, verbs and adjectives, we assess human ratings and a supervised classification model using window-based and pattern-based distributional vector spaces. Both perspectives suggest differences in relation distinction across word classes, but easy vs. difficult class-relation combinations differ, exhibiting stronger ties between ease and naturalness of class-dependent relations for humans than for computational models. In addition, we demonstrate that distributional information is indeed a difficult starting point for distinguishing between paradigmatic relations but that even a simple classification model is able to manage this task. The fact that the most salient vector spaces and their success vary across word classes and paradigmatic relations suggests that combining feature types for relation distinction is better than applying them in isolation.
Źródło:: Journal of Language Modelling; 2020, 8, 1; 53-101
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: German particle verbs : compositionality at the syntax-semantics interface
Autorzy:: Bott, S.
Schulte im Walde, S.
Powiązania:: https://bibliotekanauki.pl/articles/103841.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: particie verbs
multi-word expressions
compositionality
distributional semantics
Opis:: Particle verbs represent a type of multi-word expression composed of a base verb and a particle. The meaning of the particle verb is often, but not always, derived from the meaning of the base verb, sometimes in quite complex ways. In this work, we computationally assess the levels of German particle verb compositionality by applying distributional semantic models. Furthermore, we investigate properties of German particle verbs at the syntax-semantics interface that influence their degrees of compositionality: (i) regularity in semantic particle verb derivation and (ii) transfer of syntactic subcategorization from base verbs to particle verbs. Our distributional models show that both superficial window co-occurrence models as well as theoretically well-founded syntactic models are sensitive to subcategorization frame transfer and can be used to predict degrees of particle verb compositionality, with window models performing better even though they are conceptually and computationally simpler.
Źródło:: Journal of Language Modelling; 2018, 6, 1; 41-86
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Testing word embeddings for Polish
Autorzy:: Mykowiecka, Agnieszka
Marciniak, Małgorzata
Rychlik, Piotr
Powiązania:: https://bibliotekanauki.pl/articles/677111.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: distributional semantics
word embeddings
model evaluation
synonymy
analogy
Opis:: Testing word embeddings for PolishDistributional Semantics postulates the representation of word meaning in the form of numeric vectors which represent words which occur in context in large text data. This paper addresses the problem of constructing such models for the Polish language. The paper compares the effectiveness of models based on lemmas and forms created with Continuous Bag of Words (CBOW) and skip-gram approaches based on different Polish corpora. For the purposes of this comparison, the results of two typical tasks solved with the help of distributional semantics, i.e. synonymy and analogy recognition, are compared. The results show that it is not possible to identify one universal approach to vector creation applicable to various tasks. The most important feature is the quality and size of the data, but different strategy choices can also lead to significantly different results. Testowanie wektorowych reprezentacji dystrybucyjnych słów języka polskiegoSemantyka dystrybucyjna opiera się na założeniu, że znaczenie słów wyrażone jest za pomocą wektorów reprezentujących, w sposób bezpośredni bądź pośredni, konteksty, w jakich słowo to jest używane w dużym zbiorze tekstów. Niniejszy artykuł dotyczy ewaluacji wielu takich modeli skonstruowanych dla języka polskiego. W pracy porównano skuteczność modeli opartych na lematach i formach słów, utworzonych przy wykorzystaniu sieci neuronowych na danych z dwóch różnych korpusów języka polskiego. Ewaluacji dokonano na podstawie wyników dwóch typowych zadań rozwiązywanych za pomocą metod semantyki dystrybucyjnej, tzn. rozpoznania występowania synonimii i analogii między konkretnymi parami słów. Uzyskane wyniki dowodzą, że nie można wskazać jednego uniwersalnego podejścia do tworzenia modeli dystrybucyjnych, gdyż ich skuteczność jest różna w zależności od zastosowania. Najważniejszą cechą wpływającą na jakość modelu jest jakość oraz rozmiar danych, ale wybory różnych strategii uczenia sieci mogą również prowadzić do istotnie odmiennych wyników.
Źródło:: Cognitive Studies; 2017, 17
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Text : now in 2D! A framework for lexical expansion with contextual similarity
Autorzy:: Biemann, C.
Riedl, M.
Powiązania:: https://bibliotekanauki.pl/articles/103919.pdf
Data publikacji:: 2013
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: distributional semantics
lexical expansion
contextual similarity
lexical substitution
computational semantics
Opis:: A new metaphor of two-dimensional text for data-driven semantic modeling of natural language is proposed, which provides an entirely new angle on the representation of text: not only syntagmatic relations are annotated in the text, but also paradigmatic relations are made explicit by generating lexical expansions. We operationalize distributional similarity in a general framework for large corpora, and describe a new method to generate similar terms in context. Our evaluation shows that distributional similarity is able to produce high-quality lexical resources in an unsupervised and knowledge-free way, and that our highly scalable similarity measure yields better stores in a WordNet-based evaluation than previous measures for very large corpora. Evaluating on a lexical substitution task, we find that our contextualization method improves over a non-contextualized baseline across all parts of speech, and we show how the metaphor can be applied successfully to part-of-speech tagging. A number of ways to extend and improve the contextualization method within our Framework are discussed. As opposed to comparable approaches, our framework defines a model of lexical expansions in context that can generate the expansions as opposed to ranking a given list, and thus does not require existing lexical-semantic resources.
Źródło:: Journal of Language Modelling; 2013, 1, 1; 55-95
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Le sens de fête en polonais, en lituanien, en français et sa (non)coïncidence collocationnelle
Znaczenie wyrazu święto w języku polskim, litewskim i francuskim oraz jego kolokacyjna (nie)ekwiwalentność
The meaning of holiday in Polish, Lithuanian, French and its collocational (non)coincidence
Autorzy:: Kazlauskienė, Vitalija
Dryjańska, Agnieszka
Powiązania:: https://bibliotekanauki.pl/articles/2173942.pdf
Data publikacji:: 2022-12-02
Wydawca:: Uniwersytet Pedagogiczny im. Komisji Edukacji Narodowej w Krakowie
Tematy:: corpus analysis
distributional semantics
collocation
holiday
French Language Teaching
analiza korpusowa
semantyka dystrybucyjna
kolokacja
święto
FLE
Opis:: The linguistic overview of the word holiday in the three languages (French, Lithuanian and Polish) is promising for the intercultural approach to teaching French as a foreign language with a view to go beyond the roughly monocultural contexts in Poland and Lithuania. The research is based on text corpora in these three languages. Its objective is to analyse the linguistic images of the word holiday and its Lithuanian and Polish equivalents and to examine their collocational (non)coincidence in order to systematize the teaching/learning of collocations to French learners. The aim would be to help students retain meaning and lexical association simultaneously, as well as to fix the structures they already partially know and to discover (inter)cultural aspects.
Źródło:: Annales Universitatis Paedagogicae Cracoviensis. Studia Linguistica; 2022, 17; 20-42
2083-1765
Pojawia się w:: Annales Universitatis Paedagogicae Cracoviensis. Studia Linguistica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Русские verba sentiendi в модели экспликативного синтаксиса
Autorzy:: Kiklewicz, Aleksander
Powiązania:: https://bibliotekanauki.pl/articles/2032593.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: semantics
verba sentiendi
semantic syntax
explicative syntax
valence
distributional pattern
modern Russian
Źródło:: Slavia Orientalis; 2018, LXVII, 1; 89-115
0037-6744
Pojawia się w:: Slavia Orientalis
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "distributional semantics" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język