Wszystkie pola: wordnets - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Persistent semantic identity in WordNet
Autorzy:: Kafe, Eric
Powiązania:: https://bibliotekanauki.pl/articles/677244.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: wordnets
semantic identifiers
sense keys
key violations
synsets
mappings
Opis:: Persistent semantic identity in WordNetAlthough rarely studied, the persistence of semantic identity in the WordNet lexical database is crucial for the interoperability of all the resources that use WordNet data. The present study investigates the stability of the two primary entities of the WordNet database (the word senses and the synonym sets), by following their respective identifiers (the sense keys and the synset offsets) across all the versions released between 1995 and 2012, while also considering "drifts" of identical definitions and semantic relations. Contrary to expectations, 94.4% of the WordNet 1.5 synsets still persisted in the latest 2012 version, compared to only 89.1% of the corresponding sense keys. Meanwhile, the splits and merges between synonym sets remained few and simple. These results are presented in tables that allow to estimate the lexicographic effort needed for updating WordNet-based resources to newer WordNet versions. We discuss the specific challenges faced by both the dominant synset-based mapping paradigm (a moderate amount of split synsets), and the recommended sense key-based approach (very few identity violations), and conclude that stable synset identifiers are viable, but need to be complemented by stable sense keys in order to adequately handle the split synonym sets. Trwała tożsamość semantyczna w WordNecieChociaż rzadko badana, trwałość tożsamości semantycznej w leksykalnej bazie danych WordNet ma kluczowe znaczenie dla interoperacyjności wszystkich zasobów korzystających z danych WordNetowych. W niniejszej pracy zbadano stabilność dwóch podstawowych elementów bazy danych WordNet (jednostek leksykalnych i synsetów – zbiorów synonimicznych jednostek leksykalnych), poprzez prześledzenie ich identyfikatorów (tj. identyfikatorów jednostek i identyfikatorów synsetów) we wszystkich wersjach wydanych w latach 1995-2012. Wzięto również pod uwagę przesunięcia identycznych definicji i relacji semantycznych. Wbrew oczekiwaniom, 94,4% synsetów WordNetu 1.5 zachowało się w najnowszej wersji z 2012 r., w porównaniu do 89,1% odpowiadających im identyfikatorów jednostek. Tymczasem podziały i połączenia pomiędzy synsetami pozostały proste i nieliczne. Wyniki te przedstawiono w tabelach, które pozwalają oszacować wysiłek leksykograficzny potrzebny do aktualizacji zasobów opartych o WordNet do nowszych wersji WordNetu. Omawiamy konkretne wyzwania, przed którymi stoi zarówno dominujący paradygmat rzutowania synsetów (umiarkowana liczba podzielonych synsetów), jak i zalecane podejście oparte na identyfikatorach jednostek (bardzo niewiele naruszeń tożsamości) i stwierdzamy, że można stworzyć stabilne identyfikatory synsetów, ale muszą one iść w parze ze stabilnymi identyfikatorami jednostek, aby odpowiednio zająć się podzielonymi synsetami.
Źródło:: Cognitive Studies; 2018, 18
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: The System of Register Labels in plWordNet
Autorzy:: Maziarz, Marek
Piasecki, Maciej
Szpakowicz, Stan
Powiązania:: https://bibliotekanauki.pl/articles/677096.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: wordnets
plWordNet
lexical register
large-scale wordnet expansion
inter-annotator agreement
Opis:: The System of Register Labels in plWordNetStylistic registers influence word usage. Both traditional dictionaries and wordnets assign lexical units to registers, and there is a wide range of solutions. A system of register labels can be flat or hierarchical, with few labels or many, homogeneous or decomposed into sets of elementary features. We review the register label systems in lexicography, and then discuss our model, designed for plWordNet, a large wordnet for Polish. There follows a detailed comparative analysis of several register systems in Polish lexical resources. We also present the practical effect of the adoption of our flat, small and homogeneous system: a relatively high consistency of register assignment in plWordNet, as measured by inter-annotator agreement on a manageable sample. Large-scale conclusions for the whole plWordNet remain to be made once the annotation has been completed, but the experience half-way through this labour-intensive exercise is very encouraging.
Źródło:: Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Mapping wordnets from the perspective of inter-lingual equivalence
Autorzy:: Rudnicka, Ewa Katarzyna
Piasecki, Maciej Tomasz
Piotrowski, Tadeusz
Grabowski, Łukasz
Bond, Francis
Powiązania:: https://bibliotekanauki.pl/articles/677283.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: Princeton Wordnet
Polish wordnet (Słowosieć)
wordnet mapping
equivalence
translation
bilingual lexicography
Opis:: Mapping wordnets from the perspective of inter-lingual equivalenceThis paper explores inter-lingual equivalence from the perspective of linking two large lexico-semantic databases, namely the Princeton WordNet of English and the plWordnet (pl. Słowosieć) of Polish. Wordnets are built as networks of lexico-semantic relations between words and their meanings, and constitute a type of monolingual dictionary cum thesaurus. The development of wordnets for different languages has given rise to many wordnet linking projects (e.g. EuroWordNet, Vossen, 2002). Regardless of a linking method used, these projects require defining rules for establishing equivalence links between wordnet building blocks, known as synsets (sets of synonymous lexical units, i.e., lemma-sense pairs). In this paper an analysis is carried out of a set of inter-wordnet relations used in the mapping of the plWordNet onto the Princeton WordNet, and an attempt is made to relate them to equivalence taxonomies described in specialist literature on bilingual lexicography and translation. Rzutowanie wordnetów w perspektywie ekwiwalencji międzyjęzykowejArtykuł przedstawia analizę zjawiska ekwiwalencji międzyjęzykowej z perspektywy powiązania dwóch wielkich wordnetów: polskiej Słowosieci i angielskiego WordNetu princetońskiego. Wordnety są relacyjnymi bazami danych leksykalno-semantycznych opisującymi sieć relacji leksykalno-semantycznych pomiędzy słowami i ich znaczeniami. Stanowią zatem rodzaj słownika jednojęzycznego połączonego z tezaurusem. Rozwój wordnetów dla wielu języków świata zaowocował następnie ich wzajemnymi powiązaniami. Wymagało to zdefiniowania metodologii dla ustalenia ekwiwalencji pomiędzy ich podstawowymi elementami tzn. synsetami, które są zbiorami synonimicznych jednostek leksykalnych tzn. par lemat numer znaczenia. W artykule analizujemy zbiór relacji międzywordnetowych używanych w rzutowaniu pomiędzy Słowosiecią a WordNetem princetońskim, odnosząc je do taksonomii ekwiwalencji postulowanych w literaturze leksykograficznej i translatorycznej.
Źródło:: Cognitive Studies; 2017, 17
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "wordnets" wg kryterium: Wszystkie pola

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język