Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "annotation" wg kryterium: Temat


Wyświetlanie 1-40 z 40
Tytuł:
A COMPARISON OF THE EFFECT OF TEXTUAL, AUDIO AND TEXTUAL-PICTORIAL AND AUDIO–PICTORIAL ANNOTATIONS ON ENHANCING READING COMPREHENSION AMONG IRANIAN EFL LEARNERS
Autorzy:
Karbalaei, Alireza
Zare, Amaneh
Powiązania:
https://bibliotekanauki.pl/articles/955805.pdf
Data publikacji:
2019
Wydawca:
Uniwersytet Marii Curie-Skłodowskiej w Lublinie. IATEFL Poland Computer Special Interest Group
Tematy:
textual annotation
audio annotation
textual-pictorial annotation
audio-pictorial annotation
reading comprehension
Opis:
This study aimed to investigate the interaction between L2 readers and the reading text equipped with four different annotations or glosses including text-only, audio-only, text-picture and audio-picture annotations. The participants in the study were selected from four intact classes consisting of 100 students studying English at intermediate level in Kish Institute of Science & technology (olom va fonon), in Iran. After they were given a reading comprehension text, the four experimental groups were given the same reading comprehension texts with different annotations. Then, they were asked to take the same reading test as posttest. The results of the study demonstrated that text-only and audio-only were more effective than other kinds of annotation. The results suggested that providing the new words whether in audio or text annotation during reading comprehension can help students to comprehend reading in an effective way. Educational implications suggest that provision of different kinds of glosses is beneficial for L2 students although they need some scaffolding for utilizing glosses in a beneficial way.
Źródło:
Teaching English with Technology; 2019, 19, 3; 40-67
1642-1027
Pojawia się w:
Teaching English with Technology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Bulgarian sense-annotated corpus – between the tradition and novelty
Autorzy:
Koeva, Svetla
Powiązania:
https://bibliotekanauki.pl/articles/677294.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpus studies
corpus annotation
annotation principles
Opis:
Bulgarian sense-annotated corpus – between the tradition and noveltyThe Bulgarian Sense-annotated Corpus (BulSemCor) is compiled according to the general methodology established by the SemCor project. It is a subset of the Brown Corpus of Bulgarian semantically annotated with a corresponding synonym set (synset) in the Bulgarian wordnet. Unlike the bulk of sense-annotated corpora where only (sets of) content words are annotated, in BulSemCor each lexical unit has been assigned a sense. The main contributions achieved in the work on BulSemCor are briefly decides in the presented paper: definition of an annotation schema, compilation of an input corpus, development of a sense-annotated corpus, Bulgarian wordnet enlargement.
Źródło:
Cognitive Studies; 2012, 12
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish Speakers
Autorzy:
Trzebińska, Joanna
Bartoszewicz, Jakub
Powiązania:
https://bibliotekanauki.pl/articles/677159.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
speech corpus
pragmatic annotation
semantic annotation
disability
Opis:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish SpeakersWhile Polish language is relatively well represented in general purpose corpora such as National Polish Language Corpus still there are groups of speakers that are underrepresented in reference corpora. One of such sub-groups is the disabled people community. On the other hand there is a growing need for understanding how disability influences social and cognitive abilities, language in particular. In this paper, we present a specialized Corpus of Dialogs of Disabled Speakers. The process of compiling, transcription and annotation of pragmatic, semantic and morphosyntactic features will be described, as well as Corpus applications will be discussed.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
A French corpus annotated for multiword expressions and named entities
Autorzy:
Candito, Marie
Constant, Mathieu
Ramisch, Carlos
Savary, Agata
Guillaume, Bruno
Parmentier, Yannick
Cordeiro, Silvio Ricardo
Powiązania:
https://bibliotekanauki.pl/articles/1818889.pdf
Data publikacji:
2020
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
multiword expressions
annotation
corpus
French
Opis:
We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status. In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with the syntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
Źródło:
Journal of Language Modelling; 2020, 8, 2; 415--479
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
A French corpus annotated for multiword expressions and named entities
Autorzy:
Candito, Marie
Constant, Mathieu
Ramisch, Carlos
Savary, Agata
Guillaume, Bruno
Parmentier, Yannick
Cordeiro, Silvio Ricardo
Powiązania:
https://bibliotekanauki.pl/articles/1818891.pdf
Data publikacji:
2020
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
multiword expressions
annotation
corpus
French
Opis:
We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status. In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with the syntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
Źródło:
Journal of Language Modelling; 2020, 8, 2; 415--479
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Design and analysis of a lean interface for Sanskrit corpus annotation
Autorzy:
Goyal, P.
Huet, G.
Powiązania:
https://bibliotekanauki.pl/articles/103855.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
Sanskrit
text segmentation
annotation
interface
Opis:
We describe an innovative computer interface designed to assist annotators in the efficient selection of segmentation solutions for proper tagging of Sanskrit corpora. The proposed solution uses a compact representation of the shared forest of all segmentations. The main idea is to represent the union of all segmentations, abstracting from the sandhi rules used, and aligning with the input sentence. We show that this representation provides an exponential saving, in both space and time. The segmentation methodology is lexicon-directed. When the lexicon does not have full coverage of the corpus vocabulary, some chunks of the input may fail to be recognized. We designed a lexiconacquisition facility, which remedies this incompleteness and makes the interface more robust. This interface has been implemented, and is currently being applied to the annotation of the Sanskrit Library corpus. Evaluation over 1,500 sentences from the Pañcatantra text shows the effectiveness of the proposed interface on real corpus data.
Źródło:
Journal of Language Modelling; 2016, 4, 2; 145-182
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Towards an event annotated corpus of Polish
Autorzy:
Marcińczuk, Michał
Oleksy, Marcin
Bernaś, Tomasz
Kocoń, Jan
Wolski, Michał
Powiązania:
https://bibliotekanauki.pl/articles/677125.pdf
Data publikacji:
2015
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
information extraction
event recognition
corpus annotation
Opis:
Towards an event annotated corpus of PolishThe paper presents a typology of events built on the basis of TimeML specification adapted to Polish language. Some changes were introduced to the definition of the event categories and a motivation for event categorization was formulated. The event annotation task is presented on two levels – ontology level (language independent) and text mentions (language dependant). The various types of event mentions in Polish text are discussed. A procedure for annotation of event mentions in Polish texts is presented and evaluated. In the evaluation a randomly selected set of documents from the Corpus of Wrocław University of Technology (called KPWr) was annotated by two linguists and the annotator agreement was calculated. The evaluation was done in two iterations. After the first evaluation we revised and improved the annotation procedure. The second evaluation showed a significant improvement of the agreement between annotators. The current work was focused on annotation and categorisation of event mentions in text. The future work will be focused on description of event with a set of attributes, arguments and relations.
Źródło:
Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Języki słowiańskie i litewski w korpusach równoległych Clarin-PL
Autorzy:
Koseska-Toszewa, Violetta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/678946.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
multilingual parallel corpora
semantic annotation
scope quantification
Opis:
Slavic languages and the Lithuanian language in the Clarin-PL parallel corporaThe Clarin Eric and Clarin-PL strategic scientific purpose is to support humanistic research in a multicultural and multilingual Europe. Polish researchers put the emphasis on building a bridge between the Polish language and Polish linguistic technologies and other European languages and their linguistic technologies. So far, the Polish scientific community has mainly focused on Polish-English connections. Clarin-PL has been developing the first and only multilingual corpora of the Polish language in conjunction with other Slavic languages and the Lithuanian language: the Polish-Bulgarian-Russian Parallel Corpus and the Polish- Lithuanian Parallel Corpus. The parallel corpora created by the ISS PAS Corpus Linguistics and Semantics Team break through the existing “canons” and allow scientists access to interlinked multilingual language resources – in the first phase limited to the languages of the three Slavic groups and the Lithuanian language. In the article, the authors present very detailed information on their original system of the semantic annotation of scope quantification in multilingual parallel corpora, hitherto unused in the subject literature. Due to the system’s originality, the semantic annotation is carried out manually. Identification of particular values of scope quantification in a sentence and the hereby presented attempts of its recording are supported by long-term research conducted by an international team of linguists and computer scientists / mathematicians developing the issue of quantification of names, time and aspect in natural languages. Języki słowiańskie i litewski w korpusach równoległych Clarin-PLStrategicznym celem naukowym Clarin ERIC i Clarin-PL jest wspieranie badań humanistycznych w wielokulturowej i wielojęzycznej Europie. Dla polskich badaczy ważna jest budowa pomostu między językiem polskim, polskimi technologiami językowymi a innymi językami europejskimi i na ich rzecz opracowanymi technologiami językowymi. Dotychczas w nauce polskiej największy nacisk był kładziony na powiązania polsko-angielskie. Clarin-PL opracowuje zatem pierwsze jak dotąd wielojęzyczne korpusy języka polskiego w zestawieniu z innymi językami słowiańskimi oraz z językiem litewskim: Korpus równoległy polsko-bułgarsko-rosyjski i Korpus równoległy polsko-litewski. Tworzone przez Zespół Lingwistyki Korpusowej i Semantyki (IS PAN) korpusy równoległe przełamują dotychczasowe „kanony” i udostępniają nauce powiązane wielojęzyczne zasoby – w pierwszym etapie ograniczone do języków trzech grup słowiańskich oraz języka litewskiego. W artykule autorzy przedstawiają bardzo szczegółową informację o zastosowanej po raz pierwszy w literaturze przedmiotu anotacji semantycznej dotyczącej kwantyfikacji zakresowej w wielojęzycznych korpusach równoległych. Z powodu swojego rozległego zakresu i nowatorstwa ta anotacja semantyczna jest nanoszona ręcznie. Identyfikacja poszczególnych wartości kwantyfikacji zakresowej w zdaniu oraz przedstawiane tu próby jej zapisu są poparte wieloletnimi badaniami międzynarodowego zespołu lingwistów i matematyków-informatyków opracowujących zagadnienie kwantyfikacji imion, czasu i aspektu w językach naturalnych.
Źródło:
Studia z Filologii Polskiej i Słowiańskiej; 2016, 51
2392-2435
0081-7090
Pojawia się w:
Studia z Filologii Polskiej i Słowiańskiej
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Construction of a medical corpus based on information extraction results
Autorzy:
Marciniak, M.
Mykowiecka, A.
Powiązania:
https://bibliotekanauki.pl/articles/206379.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
corpus
semantic annotation
clinical data
information extraction
Opis:
The paper presents a method of automatic construction of a semantically annotated corpus using the results of a rulebased information extraction (IE) application. Construction of the corpus is based on using existing programs for text tokenization and morphological analysis and combining their results with domain related correction rules. We reuse the specialized IE system to obtain a corpus annotated on the semantic level. The texts included within the corpus are Polish free text clinical data. We present the documents - diabetic patients' discharge records, the structure of the corpus annotation and the methods for obtaining the annotations. Initial evaluations based on the results of manual verification of selected data subset are also presented. The corpus, once manually corrected, is designed to be used for developing supervised machine learning models for IE applications.
Źródło:
Control and Cybernetics; 2011, 40, 2; 337-360
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Information searching for an experience management platform of the EU Pellucid project
Autorzy:
Majewska, M.
Krawczyk, K.
Słota, R.
Kitowski, J.
Hluchy, L.
Lambert, S.
Powiązania:
https://bibliotekanauki.pl/articles/1964213.pdf
Data publikacji:
2004
Wydawca:
Politechnika Gdańska
Tematy:
experience management
ontologies
information retrieval
semantic annotation
Opis:
The EU Pellucid project is developing an experience management system for public organizations with staff mobility. The paper presents an activity whitin the project focused on searching for information in repositories of documents. The project's background and the process of information searching are described. Ontological methods such as semantic annotation and similarity searching, as well as ontology- and full-text-based searching are presented. Monitoring of organizational repositories is discussed.
Źródło:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2004, 8, 4; 513-523
1428-6394
Pojawia się w:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Automatyczna anotacja genomu jako narzędzie biologii systemów
Automatic genome annotation as a tool of systems biology
Autorzy:
Bizukojć, M.
Powiązania:
https://bibliotekanauki.pl/articles/2070453.pdf
Data publikacji:
2009
Wydawca:
Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:
anotacja
genom
Aspergillus
KAAS
KEGG
annotation
genome
Opis:
W pracy przedstawiono metodę analizy metabolizmu organizmów polegającą na rekonstrukcji sieci metabolicznej na podstawie całkowicie lub częściowo zsekwencjonowanego genomu. Analizę tę przeprowadzono dla siedmiu gatunków grzybów nitkowych z rodzaju Aspergillus wykorzystując serwer automatycznej anotacji, a jej wyniki porównano z wybranymi danymi fizjologicznymi.
A method based upon the reconstruction of fully or partially sequenced genome to analyse metabolic networks of organisms is presented. This analysis was performed for seven fungal species of genus Aspergillus with the use of automatic annotation server. The results were compared with selected physiological data.
Źródło:
Inżynieria i Aparatura Chemiczna; 2009, 3; 25-27
0368-0827
Pojawia się w:
Inżynieria i Aparatura Chemiczna
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Badania aspektu w językach polskim, czeskim i rosyjskim za pomocą korpusów i baz danych (pierwsze podsumowanie tematu)
Autorzy:
Wiemer, Björn
Wrzesień-Kwiatkowska, Joanna
Łaziński, Marek
Powiązania:
https://bibliotekanauki.pl/articles/1036249.pdf
Data publikacji:
2020-11-20
Wydawca:
Wydawnictwo Uniwersytetu Śląskiego
Tematy:
aspect
verbal prefixes
aspect triples
electronic corpora
annotation
Opis:
The article is connected to the project “DiAsPol250” in which Polish is compared to Czech and Russian from the perspective of the evolution of their aspect systems (http://www.diaspol.uw.edu.pl/). We make use of existing synchronic and diachronic electronic corpora and are building a corpus of our own with annotated aspect pairs; we also create a database of aspect triplets whose role we consider as particularly important for the system. We want to assess which changes have occurred since the mid-18th century in prefixing and suffixing strategies of verb stems, both in general and by comparing particular prefixes and suffixes, especially so-called natural (vs. specialized) prefixes (according to Janda et al., 2013). The article supplies a sketch of the general premises of the project, and it summarizes our experience with existing large corpora and databases which we have been employing. We also present a case study in order to demonstrate a procedure designed to compare the distribution of the Czech prefix z- in triplets and in the corpus. This procedure is meant to check more general tendencies; it also illustrates why electronic corpora cannot be replaced in research on distributional properties and why their role does not consist simply in providing examples for the illustration of hypotheses.
Źródło:
Forum Lingwistyczne; 2020, 7; 45-58
2449-9587
2450-2758
Pojawia się w:
Forum Lingwistyczne
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
KIS: An automated attribute induction method for classification of DNA sequences
Autorzy:
Biedrzycki, R.
Arabas, J.
Powiązania:
https://bibliotekanauki.pl/articles/330979.pdf
Data publikacji:
2012
Wydawca:
Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:
klasyfikacja
optymalizacja
anotacja
wzorzec
classification
optimization
annotation
patterns
Opis:
This paper presents an application of methods from the machine learning domain to solving the task of DNA sequence recognition. We present an algorithm that learns to recognize groups of DNA sequences sharing common features such as sequence functionality. We demonstrate application of the algorithm to find splice sites, i.e., to properly detect donor and acceptor sequences. We compare the results with those of reference methods that have been designed and tuned to detect splice sites. We also show how to use the algorithm to find a human readable model of the IRE (Iron-Responsive Element) and to find IRE sequences. The method, although universal, yields results which are of quality comparable to those obtained by reference methods. In contrast to reference methods, this approach uses models that operate on sequence patterns, which facilitates interpretation of the results by humans.
Źródło:
International Journal of Applied Mathematics and Computer Science; 2012, 22, 3; 711-721
1641-876X
2083-8492
Pojawia się w:
International Journal of Applied Mathematics and Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Enhancing grammar and valence resources for Akan and Ga
Autorzy:
Beermann, Dorothee
Hellan, Lars
Linde-Usiekniewicz, Jadwiga
Storch, Anne
Powiązania:
https://bibliotekanauki.pl/chapters/1040110.pdf
Data publikacji:
2020
Wydawca:
Uniwersytet Warszawski. Wydawnictwa Uniwersytetu Warszawskiego
Tematy:
digital resources
lexicon
valence
corpus annotation
Akan
Ga
Opis:
We present a case study in valence comparison between closely related Kwa languages, assessing frames and meanings of the verb ba (‘come’) in Akan with a homophonous corresponding item in Ga. The discussion draws on the Akan dictionary (Christaller 1881), a Ga valence dictionary based on (Dakubu 2009), and an online annotated corpus of Akan hosted in TypeCraft (Beermann & Mihaylov 2014). With a view to the possibility of making use of resources for one language in the development of resources for another, we demonstrate how digital resources and linguistic specifications can inform each other.
Źródło:
West African languages. Linguistic theory and communication; 166-185
9788323546313
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Annogene: Restful Web Service for Annotating Genomic Features
Autorzy:
Tomski, A.
Piechota, M.
Przewłocki, R.
Powiązania:
https://bibliotekanauki.pl/articles/108641.pdf
Data publikacji:
2014
Wydawca:
Społeczna Akademia Nauk w Łodzi
Tematy:
BED annotation
RESTful web service
ChIP-seq peaks
Opis:
Modern high-throughput sequencing techniques generate a constantly increasing amount of genomic data from eukaryotes. The main problem is quickly identifying the data that may provide information about the nature of intracellular processes, such as the targeting of transcription factor-binding sites. Typically, thousands of peaks or signals are found across the genome and the nearby genes must be annotated. We introduce AnnoGene - a web service for annotating genomic features. AnnoGene was implemented in a representational state transfer (REST) architectural style. The program searches for the gene nearest to the center of a genomic position. Subsequently, the location and annotationsof the gene are shown. The tool can be downloaded and run on a local computer, but it was designed to be a web service. AnnoGene is freely available through a web browser. Moreover, our paper covers examples of the REST clients written in the Python, R and Java programming languages. AnnoGene only requires genomic positions from the user. Even when annotating several thousand positions, the output is typically ready in a few seconds. Moreover, this tool supports Seqinspector – a web tool for finding regulators of the genes.
Źródło:
Journal of Applied Computer Science Methods; 2014, 6 No. 2; 101-110
1689-9636
Pojawia się w:
Journal of Applied Computer Science Methods
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements
Autorzy:
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677259.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
parallel and comparable corpora
annotation
Polish
Lithuanian
Opis:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The method of automatic summarization from different sources
Autorzy:
Shakhovska, N.
Cherna, T.
Powiązania:
https://bibliotekanauki.pl/articles/411243.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:
annotation
abstracting
national system of abstracting
heterogeneous data
analysis
Opis:
In this article is analyzed technology of automatic text abstracting and annotation. The role of annotation in automatic search and classification for different scientific articles is described. The algorithm of summarization of natural language documents using the concept of importance coefficients is developed. Such concept allows considering the peculiarity of subject areas and topics that could be found in different kinds of documents. Method for generating abstracts of single document based on frequency analysis is developed. The recognition elements for unstructured text analysis are given. The method of pre-processing analysis of several documents is developed. This technique simultaneously considers both statistical approaches to abstracting and the importance of terms in a particular subject domain. The quality of generated abstract is evaluated. For the developed system there was conducted experts evaluation. It was held only for texts in Ukrainian. The developed system concluding essay has higher aggregate score on all criteria. The summarization system architecture is building. To build an information system model there is used CASE-tool AllFusion ERwin Data Modeler. The database scheme for information saving was built. The system is designed to work primarily with Ukrainian texts, which gives a significant advantage, since most modern systems still oriented to English texts.
Źródło:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2016, 5, 1; 103-109
2084-5715
Pojawia się w:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Robust Audio Watermarks in Frequency Domain
Autorzy:
Dymarski, P.
Markiewicz, R.
Powiązania:
https://bibliotekanauki.pl/articles/308461.pdf
Data publikacji:
2014
Wydawca:
Instytut Łączności - Państwowy Instytut Badawczy
Tematy:
annotation watermarking
audio watermarking
digital signature
dirty paper codes
LDPC
Opis:
In this paper an audio watermarking technique is presented, using log-spectrum, dirty paper codes and LDPC for watermark embedding. This technique may be used as a digital communication channel, transmitting data at about 40 b/s. It may be also applied for hiding a digital signature, e.g., for copyright protection purposes. Robustness of the watermarks against audio signal compression, resampling and transmitting through an acoustic channel is tested.
Źródło:
Journal of Telecommunications and Information Technology; 2014, 2; 12-21
1509-4553
1899-8852
Pojawia się w:
Journal of Telecommunications and Information Technology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Erotetic Reasoning Corpus. A data set for research on natural question processing
Autorzy:
Łupkowski, P.
Urbański, M.
Wiśniewski, A.
Błądek, W.
Juska, A.
Kostrzewa, A.
Pankow, D.
Paluszkiewicz, K.
Ignaszak, O.
Urbańska, J.
Żyluk, N.
Gajda, A.
Marciniak, B.
Powiązania:
https://bibliotekanauki.pl/articles/103809.pdf
Data publikacji:
2017
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
question
logic of question
question processing
erotetic reasoning
corpus annotation
Opis:
The aim of this paper is to present the Erotetic Reasoning Corpus (ERC) which constitutes a data set for research on natural question processing. We describe the theoretical background, linguistic data and tags used for the annotation process. We also discuss the potential areas in which the ERC can be exploited.
Źródło:
Journal of Language Modelling; 2017, 5, 3; 607-631
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Abstrakt i adnotacja jako element opisu dokumentu w bazie iSybislaw
Abstract and annotation as an element of bibliographic description in the iSybislaw database
Autorzy:
Kowalski, Paweł
Powiązania:
https://bibliotekanauki.pl/articles/965802.pdf
Data publikacji:
2014-12-31
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
abstract
annotation
bibliographic description
database
information retrieval system
iSybislaw
summary
Opis:
Abstract (sometimes called summary) of a scientific publication is a brief text that contains keywords. It is one of the elements of a bibliographic description in the bibliographic database iSybislaw – a modern information retrieval system. In the paper definitions of terms such as abstract, annotation and summary along with their constitutive elements are presented. A characteristics of such short texts inserted in the iSybislaw database in the fields Abstract and Abstract 2 is also given. Based on some examples excerpted from the iSybislaw system a typology of short texts, which are elements of the database bibliographic description, is proposed. The material allows to list three kinds of texts that are being used in the iSybislaw database: annotations, abstracts and biographic annotations.
Przedmiotem analizy artykułu są krótkie teksty, które stanowią jeden z elementów opisu bibliograficznego w systemie wyszukiwawczym iSybislaw. W praktyce naukowej używane są różne terminy odnoszące się do takich tekstów (abstrakt, adnotacja, streszczenie). Autor podaje ich definicje oraz wskazuje elementy konstytutywne. Na podstawie przykładów wyekscerpowanych z systemu iSybislaw przedstawia ich typologię oraz omawia miejsce i funkcje w opisie bibliograficznym.
Źródło:
Studia z Filologii Polskiej i Słowiańskiej; 2014, 49; 88-98
2392-2435
0081-7090
Pojawia się w:
Studia z Filologii Polskiej i Słowiańskiej
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
3D Medical Segmentation Visualization in Julia with MedEye3d
Autorzy:
Mitura, Jakub
Chrapko, Beata
Powiązania:
https://bibliotekanauki.pl/articles/1838173.pdf
Data publikacji:
2021-12
Wydawca:
Warszawska Wyższa Szkoła Informatyki
Tematy:
OpenGl
Computer Tomagraphy
PET/CT
medical image annotation
medical image visualization
Opis:
MedEye3d is a Julia language package designed to simplify visualizations of segmentation in three dimensional setting. Motivation to develop this application was to provide to rapidly growing Julia language scientific community tool for research in three dimensional medical images. Package is based on multiple open source software packages, yet most prominent is utilization of OpenGl specification to enable GPU acceleration.Application was tested both on Linux and Windows platforms and in both cases latency observed by the user in most common interaction like scrolling, annotation and change of displayed plane was very small.Thanks to utilization of many modern packages and methodologies developed package is providing convenient visualization in rapid prototyping with medical image segmentation algorithms. Application also is easily extendable and will be included in medical image segmentation framework that is currently in development.
Źródło:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki; 2021, 15, 25; 57-67
1896-396X
2082-8349
Pojawia się w:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Encapsulation of image metadata for ease of retrieval and mobility
Autorzy:
Woods, Nancy
Robert, Charles
Powiązania:
https://bibliotekanauki.pl/articles/117866.pdf
Data publikacji:
2019
Wydawca:
Polskie Towarzystwo Promocji Wiedzy
Tematy:
automatic image annotation
image tagging
metadata
automatyczna adnotacja obrazu
znakowanie obrazów
metadane
Opis:
Increasing proliferation of images due to multimedia capabilities of hand-held devices has resulted in loss of source information resulting from inherent mobility. These images are cumbersome to search out once stored away from their original source because they drop their descriptive data. This work, developed a model to encapsulate descriptive metadata into the Exif section of image header for effective retrieval and mobility. The resulting metadata used for retrieval purposes was mobile, searchable and non-obstructive.
Źródło:
Applied Computer Science; 2019, 15, 1; 62-73
1895-3735
Pojawia się w:
Applied Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Auditory Display Applied to Research in Music and Acoustics
Autorzy:
Kostek, B.
Powiązania:
https://bibliotekanauki.pl/articles/176425.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:
auditory display
music
acoustics
music technology
music information retrieval
sonification
music annotation
Opis:
This paper presents a relationship between Auditory Display (AD) and the domains of music and acoustics. First, some basic notions of the Auditory Display area are shortly outlined. Then, the research trends and system solutions within the fields of music technology, music information retrieval and music recommendation and acoustics that are within the scope of AD are discussed. Finally, an example of AD solution based on gaze tracking that may facilitate music annotation process is shown. The paper concludes with a few remarks about directions for further research in the domains discussed.
Źródło:
Archives of Acoustics; 2014, 39, 2; 203-214
0137-5075
Pojawia się w:
Archives of Acoustics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic Annotation
Autorzy:
Roszko, Danuta
Powiązania:
https://bibliotekanauki.pl/articles/677261.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
annotation
Lithuanian local dialect of Punsk in Poland
experimental dialectal corpus
Opis:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic AnnotationIn the article the author describes the experimental corpus of the Lithuanian local dialect of Puńsk in Poland (ECorp-of-Punsk). It is the first corpus of this type for the Lithuanian local dialect. The corpus consists of three subcorpora. The first one (referred to as fundamental) contains utterances given by Lithuanians in the local dialect, the second one – utterances given by Lithuanians in Polish, the third one – aligned Polish-dialectal texts.  The texts recorded in the years 1986–2012 have been included in the Ecorp-of-Punsk resources.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Trilingual aligned corpus – current state and new applications
Autorzy:
Dimitrova, Ludmila
Koseska, Violetta
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/967220.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
aligned trilingual corpus
digital resources
event
Petri net theory
semantic annotation
state
Opis:
Trilingual aligned corpus – current state and new applicationsThis article describes current state of a trilingual parallel corpus consisted of texts in two Slavic (Bulgarian and Polish) and one Baltic language (Lithuanian). The corpus contains original literary texts (fiction, novels, and short stories) in one of the three languages with translations to the other two, and texts in other languages translated into Bulgarian, Polish, and Lithuanian. A part of the texts are aligned at the sentence level. The authors propose a semantic annotation of verbs appearing in these aligned texts that will facilitate contrastive studies of natural languages. A theoretical background for the proposed semantic annotation is briefly also discussed.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Language resources for named entity annotation in the National Corpus of Polish
Autorzy:
Savary, A.
Piskorski, J.
Powiązania:
https://bibliotekanauki.pl/articles/206388.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
natural language processing
proper names
named entities
corpus annotation
Polish National Corpus
SProUT
Opis:
We present the named entity annotation subtask of a project aiming at creating the National Corpus of Polish. We summarize the annotation requirements defined for this corpus, and we discuss how existing lexical resources and grammars for named entity recognition for Polish have been adapted to meet those requirements. We show detailed results of the corpus annotation using the information extraction platform SProUT. We also analyze the errors committed by our knowledge-based method and suggest its further improvements.
Źródło:
Control and Cybernetics; 2011, 40, 2; 361-391
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Multilingual digital resources with Bulgarian language
Autorzy:
Dimitrova, Ludmila
Powiązania:
https://bibliotekanauki.pl/articles/677179.pdf
Data publikacji:
2010
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora (parallel
comparable
aligned)
corpus annotation
digital dictionaries
lexical databases
morpho-syntactic specifications
Opis:
Multilingual digital resources with Bulgarian languageThe paper presents in brief Bulgarian language resources as a part of multilingual digital resources developed in the frame of some international projects, among them parallel annotated and aligned corpora, comparable corpora, morpho-syntactic specifications for corpora annotation and dictionaries encoding, lexicons, lexical databases, and electronic dictionaries.
Źródło:
Cognitive Studies; 2010, 10
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Challenges of annotation and analysis in computer-assisted language comparison: A case study on Burmish languages
Autorzy:
Hill, Nathan W.
List, Johann Mattis
Powiązania:
https://bibliotekanauki.pl/articles/1121461.pdf
Data publikacji:
2017
Wydawca:
Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:
historical linguistics
linguistic reconstruction
burmish languages
annotation
analy-sis
computer-assisted language comparison
Opis:
The use of computational methods in comparative linguistics is growing in popularity. The increasing deployment of such methods draws into focus those areas in which they remain inadequate as well as those areas where classical approaches to language comparison are untransparent and inconsistent. In this paper we illustrate specific challenges which both computational and classical approaches encounter when studying South-East Asian languages. With the help of data from the Burmish language family we point to the challenges resulting from missing annotation standards and insufficient methods for analysis and we illustrate how to tackle these problems within a computer-assisted framework in which computational approaches are used to pre-analyse the data while linguists attend to the detailed analyses.
Źródło:
Yearbook of the Poznań Linguistic Meeting; 2017, 3, 1; 47-76
2449-7525
Pojawia się w:
Yearbook of the Poznań Linguistic Meeting
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)
Autorzy:
Dimitrova, Ludmila
Koseska-Toszewa, Violetta
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677184.pdf
Data publikacji:
2010
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
multilingual electronic corpora
parallel and comparable corpora
corpus annotation
lexical databases
multilingual electronic dictionaries
Opis:
Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)In this paper we present applications of a trilingual corpus in language research. Comparative and contrastive studies of Polish and Bulgarian as well as Polish and Lithuanian have been already conducted, but up to the best of our knowledge no such studies exist for Bulgarian and Lithuanian. On the one hand, it is interesting to note that two Slavic languages are compared to a Baltic language (Lithuanian). On the other hand, the three languages are marginally present in the EU because of the later ascension of the three countries to the EU. The paper shortly describes the first electronic Bulgarian–Polish–Lithuanian experimental corpus, currently under development only for research. We also focus our attention on the morphosyntactic annotation of the parallel trilingual corpus according to the Corpus Encoding Standard: we present a review of the Part-of-Speech (POS) classification of the participle in the three languages – Bulgarian, Polish, and Lithuanian in comparison to another POS, the adjective. We briefly discuss tagsets for corpus annotation from the point of view of possible unification in the future with some examples.
Źródło:
Cognitive Studies; 2010, 10
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Web-Application for the Presentation of Bilingual Corpora (Focusing on Bulgarian as One of the Two Paired Languages)
Autorzy:
Dimitrova, Ludmila
Dutsova, Ralitsa
Powiązania:
https://bibliotekanauki.pl/articles/677223.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
parallel corpus
aligned corpus
concordance
linguistic annotation
lemmatization
POS-tagging
web-interface
web-application
Opis:
Web-Application for the Presentation of Bilingual Corpora (Focusing on Bulgarian as One of the Two Paired Languages)This paper briefly presents a web-application for the presentation of bilingual aligned corpora focusing on Bulgarian as one the two paired languages. The focus is given to the description of the software tools and user interface. The software is developed in IMI-BAS and will be hosted on a server there. Some examples of the usage of the web-application for the presentation of a Bulgarian-Polish aligned corpus are included.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Annotating a non-model plant genome – a study on the narrow-leafed lupin
Autorzy:
Zielezinski, A.
Potarzycki, P.
Ksiazkiewicz, M.
Karlowski, W.M.
Powiązania:
https://bibliotekanauki.pl/articles/80218.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:
genome
plant genome
pipeline
software
narrow-leaved lupin
gene annotation system
gene sequence
DNA sequence
Źródło:
BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology; 2012, 93, 3
0860-7796
Pojawia się w:
BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Semantics, contrastive linguistics and parallel corpora
Autorzy:
Koseska, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/967225.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
contrastive studies
online dictionary
parallel corpora
direct approach to semantics
semantic interlanguage
Petri nets
semantic annotation
Opis:
Semantics, contrastive linguistics and parallel corporaIn view of the ambiguity of the term “semantics”, the author shows the differences between the traditional lexical semantics and the contemporary semantics in the light of various semantic schools. She examines semantics differently in connection with contrastive studies where the description must necessary go from the meaning towards the linguistic form, whereas in traditional contrastive studies the description proceeded from the form towards the meaning. This requirement regarding theoretical contrastive studies necessitates construction of a semantic interlanguage, rather than only singling out universal semantic categories expressed with various language means. Such studies can be strongly supported by parallel corpora. However, in order to make them useful for linguists in manual and computer translations, as well as in the development of dictionaries, including online ones, we need not only formal, often automatic, annotation of texts, but also semantic annotation - which is unfortunately manual. In the article we focus on semantic annotation concerning time, aspect and quantification of names and predicates in the whole semantic structure of the sentence on the example of the “Polish-Bulgarian-Russian parallel corpus”.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
L’annotazione di testi storico-letterari al tempo dei social media
The Annotation of Historical and Literary Texts in the Age of Social Media
Autorzy:
Boschetti, Federico
Del Grosso, Angelo Mario
Powiązania:
https://bibliotekanauki.pl/articles/446669.pdf
Data publikacji:
2020-07-31
Wydawca:
Wydawnictwo Adam Marszałek
Tematy:
digital philology
collaborative annotation
communities
digital scholarly edition
formalisation
filologia digitale
annotazione collaborativa
comunità
edizione scientifica digitale
formalizzazione
Opis:
The annotation of historical and literary texts is approached differently by traditional philologists and digital philologists. The former are concentrated on the detailed study of a given text (close reading) while the latter are focused on the study of large quantities of texts (distant reading). A structured and collaborative annotation makes it possible both to add information to particular passages of individual texts, as in a traditional linear comment, and to connect data from entire textual collections through rigorous protocols. However, the standards developed by digital philologists are not highly appreciated by traditional academics, since the effort necessary to apply the proposed technologies allegedly diverts researchers’ attention from the object of study. As opposed to this objection, we intend to highlight that it is indeed possible to maintain the precision requisite for the application of computational tools to digital resources without renouncing the annotation practices established in traditional contexts. In support of the method, we report a number of case studies of digital scientific editions whose goals include both reconstructing respective texts and encouraging the dissemination of contents and public participation in the academic debate. In particular, we will discuss the following projects: a) the stylistic annotation of three different editions of Giacomo Leopardi’s translation of the "Batracomiomachia"; b) the scientific edition of Bellini’s letters; c) the multi-level annotated edition of Bassani; and d) the comparison of Umberto Eco’s variants of his "Il nome della rosa".
Nella transizione dalla stampa al digitale, l’attività di annotazione di testi storico-letterari oscilla fra le resistenze dei tradizionalisti e le nuove pratiche dei filologi (e critici) digitali. Fra le due comunità il dialogo è difficile: gli studi filologici e letterari tradizionali, innervati dal metodo storico, si sono sempre più chiusi sul particolare, mentre i nuovi approcci, animati dal metodo scientifico, si sono sempre più aperti allo studio del generale. L’annotazione strutturata e collaborativa consente di aggiungere informazioni ai passi specifici dei singoli testi come nel tradizionale commento lineare, ma permette anche di collegare dati di intere collezioni testuali tramite protocolli rigorosi. Tuttavia gli standard elaborati dai filologi digitali sono accolti con perplessità dall’accademia, poiché lo sforzo necessario ad applicare le tecnologie proposte distrae dall’oggetto di studio. Noi intendiamo invece evidenziare come sia possibile mantenere il rigore formale necessario all’applicazione di strumenti computazionali alle risorse digitali senza rinunciare alle pratiche di annotazione stabilite nei contesti tradizionali. A sostegno del metodo, saranno descritti alcuni casi di studio relativi ad edizioni scientifiche digitali, in cui l’obiettivo principale, accanto alla ricostruzione del testo, è quello di favorire la divulgazione dei contenuti e la partecipazione del pubblico al dibattito accademico. In particolare, si illustreranno i progetti relativi a) all’annotazione stilistica delle tre diverse redazioni della traduzione di Giacomo Leopardi della "Batracomiomachia", b) all’edizione scientifica delle lettere di Bellini, c) all’edizione annotata multi-livello di Bassani e d) al confronto delle varianti delle due edizioni del "Nome della rosa" di Eco.
Źródło:
Italica Wratislaviensia; 2020, 11.1; 65-99
2084-4514
Pojawia się w:
Italica Wratislaviensia
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
About Certain Semantic Annotation in Parallel Corpora
Autorzy:
Koseska-Toszewa, Violetta
Powiązania:
https://bibliotekanauki.pl/articles/677255.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
direct approach to semantics
semantic annotation
perfective aspect
inperfective aspect
event
state
Petri nets
parallel corpora
contrastive linguistics
Opis:
About Certain Semantic Annotation in Parallel CorporaThe semantic notation analyzed in this works is contained in the second stream of semantic theories presented here – in the direct approach semantics. We used this stream in our work on the Bulgarian-Polish Contrastive Grammar. Our semantic notation distinguishes quantificational meanings of names and predicates, and indicates aspectual and temporal meanings of verbs. It relies on logical scope-based quantification and on the contemporary theory of processes, known as “Petri nets”. Thanks to it, we can distinguish precisely between a language form and its contents, e.g. a perfective verb form has two meanings: an event or a sequence of events and states, finally ended with an event. An imperfective verb form also has two meanings: a state or a sequence of states and events, finally ended with a state. In turn, names are quantified universally or existentially when they are “undefined”, and uniquely (using the iota operator) when they are “defined”. A fact worth emphasizing is the possibility of quantifying not only names, but also the predicate, and then quantification concerns time and aspect.  This is a novum in elaborating sentence-level semantics in parallel corpora. For this reason, our semantic notation is manual. We are hoping that it will raise the interest of computer scientists working on automatic methods for processing the given natural languages. Semantic annotation defined like in this work will facilitate contrastive studies of natural languages, and this in turn will verify the results of those studies, and will certainly facilitate human and machine translations.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Prototypical implementation of a decentralized semantic-web-based information system for specific study programs
Prototypowa implementacja zdecentralizowanego systemu informacyjnego opartego na sieci semantycznej dla specyficznych programów studiów
Autorzy:
Jetschni, Jonas
Meister, Vera G.
Powiązania:
https://bibliotekanauki.pl/articles/590808.pdf
Data publikacji:
2016
Wydawca:
Uniwersytet Ekonomiczny w Katowicach
Tematy:
Education management
Knowledge extraction
Semantic annotation
Semantic web application
Ekstrakcja wiedzy
Semantyczna adnotacja
Semantyczna aplikacja internetowa
Zarządzanie edukacją
Opis:
The paper describes the approach and the results of an agile student’s development project. Initial point was the idea of a decentralized guide for specific study programs located in the DACH (Germany, Austria, Switzerland) region. Compared with existing systems, this guide is meant to provide relevant, and moreover commensurable information about the subject-specifics features of study programs. Prospective students ought to get support in their study decision. In addition, the system may support other stakeholders, e. g. companies searching for qualified personnel in a specific field. The basic idea is to allow the semantic enrichment of any website of a university on study programs (i.e. in a decentralized manner), which is based on an ontology for Information Systems study programs. Doing so, this information become web-wide accessible and may be aggregated and visualized in a web application. The single stages of development will be described from a business as well as from a technical perspective.
W artykule przedstawiono metodę oraz wyniki zwinnego projektu rozwoju edukacji studenta. Pierwotną była idea zdecentralizowanego poradnika dla konkretnych programów studiów, zlokalizowanych w regionie DACH (Niemcy, Austria, Szwajcaria). W porównaniu z istniejącymi systemami, analizowany w tym artykule poradnik dla studentów ma na celu zapewnienie odpowiedniej i współmiernej informacji dotyczącej programów studiów, zorientowanych na poszczególne przedmioty. Podstawową ideą jest umożliwienie semantycznego wzbogacenia każdej strony internetowej uniwersytetu o treści dotyczące programów studiów i oparte na ontologii programów studiów z dziedziny Systemów Informatycznych. W artykule poszczególne etapy rozwoju zostaną opisane z biznesowego i technicznego punktu widzenia.
Źródło:
Studia Ekonomiczne; 2016, 296; 7-21
2083-8611
Pojawia się w:
Studia Ekonomiczne
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Prolegomena do tagоwania frazemów w równoległym korpusie rosyjsko-polskim (literatura piękna) w aspekcie przekładoznawczym
Prolegomena for tagging of phrasemes in a parallel Russian-Polish corpus (literature) in translation studies
Autorzy:
Fedorushkov, Yury
Powiązania:
https://bibliotekanauki.pl/articles/481965.pdf
Data publikacji:
2018-06-30
Wydawca:
Uniwersytet Warmińsko-Mazurski w Olsztynie
Tematy:
annotation tool brat v1.3
tags for phrasemes
Verb-Noun constructions; parallelization of Russian and Polish sentences; parallel corpora
Opis:
This article considers tagging methods for parallel Russian-Polish phrasemathic objects. In particular, an opinion about the annotation tool brat v1.3.is given. This online tool offers a palette of possibilities for classifying words and phrases in parallel texts. Working with this software is largely simplified by a user-friendly interface, and therefore working with the corpus does not cause difficulties for philologists and translators who do not have programming skills. As an example of such a classification, the layout of the metadata system for tagging Russian and Polish parallel phrasemes is described. These resources allow experience to be gathered and concurrent objects to be categorized in the workshop of a translator. As an example, the article presents the tagging of Verb-Noun of the text classified as collocation phrasemes, for example, погасить свет. The status of Verb-Noun constructions is also discussed, which, according to a number of factors, relate to autonomous phrases, although with the status of “free compatibility”, for example, поехать в клуб. A number of recommendations is proposed for the configuration of parallel texts at the level of single sentences.
Źródło:
Acta Polono-Ruthenica; 2018, 2, XXIII; 55-73
1427-549X
Pojawia się w:
Acta Polono-Ruthenica
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Ocena zmian profilu ekspresji genów kandydujących w podkładkach jabłoni o odmiennym stopniu tolerancji mrozowej
Evaluation of changes in the expression profile of candidate genes in apple rootstockss with a different degree of frost tolerance .
Autorzy:
Keller-Przybyłkowicz, Sylwia
Lewandowski, Mariusz
Powiązania:
https://bibliotekanauki.pl/articles/2199254.pdf
Data publikacji:
2020-12-09
Wydawca:
Instytut Hodowli i Aklimatyzacji Roślin
Tematy:
adnotacja funkcjonalna genów
Malus domestica Borkh.
sekwencjonowanie
profil ekspresji
ilościowa reakcja amplifikacji
expression profile
gene annotation
new generation sequencing (NGS)
quantitative transcript level (qRT-PCR)
Opis:
Celem przeprowadzonych badań była identyfikacja genów sprzężonych z cechą mrozoodporności podkładek jabłoni. Ocenę zmian w poziomie ekspresji wyizolowanych genów przeprowadzono metodami RNAseq i qRT-PCR, dla podkładek zróżnicowanych po względem stopnia tolerancji mrozowej: P 66 (tolerancyjna) i M.9 (wrażliwa).W wyniku przeprowadzonych odczytów sekwencji RNA (sekwencjonowanie de novo w systemie Illumina Solid) dla w/w podkładek zidentyfikowano około 167 milionów odczytów unikatowych sekwencji, z których do wstępnych badań weryfikacyjnych wytypowano 15 o zróżnicowanym profilu ekspresji. Sekwencje poddano adnotacji funkcjonalnej. Wytypowane geny kodują: białka strukturalne i integralne błon komórkowych i wakuoli komórkowych, czynników transkrypcyjnych, białek regulujących transport międzykomórkowy i wewnątrzkomórkowy, białek hydrolizujących wiązania C-O i C-N oraz białek wiążących makro- i mikroelementy. Celem weryfikacji typu regulacji sekwencji transkryptomu uzyskanych z sekwencjonowania nowej generacji (NGS), dla tych samych prób przeprowadzono ilościową analizę transkryptu genów (qRT-PCR). Spośród badanych genów, trzy reprezentowały identyczny typ regulacji w badanych układach eksperymentalnych RNA-seq i qRT-PCR. Wytypowane geny stanowią potencjalne sekwencje kandydujące do sporządzenia markerów funkcjonalnych, umożliwiających wczesną selekcję podkładek jabłoni tolerancyjnych na mróz.
The aim of presented study was to identify putative candidate genes associated with apple rootstock winter hardiness. The assessment of changes in expression profile of isolated differentially expressed genes, was performed using two subsequent experiments: RNAseq (based on New Generation Sequencing, NGS) and qRT-PCR (Real Time transcript amplification). In terms of traits of interests two apple rootstocks P 66 (frost tolerant) and M.9 (frost sensitive) were evaluated. As a result of the RNA sequence readings (de novo sequencing, Illumina Solid system), approximately 167 million reads of unique sequences were identified. Finally, fifteen functionally annotated expressed tags, representing different expression profile, were chosen. Selected putative genes coding: structural and integral proteins of cell membranes and cellular vacuoles, transcription factors, proteins regulating intercellular and intracellular transport, C-O and C-N bonds hydrolyzes, and proteins binding macro- and microelements. In order to verify the type of regulation of the transcriptome sequences obtained in NGS technology, qRT-PCR tests were carried out for the same samples layout. Three of studied sequences, represented identical type of regulation in both RNA-seq and qRT-PCR experiments. The selected genes seems to represent potential candidate sequences (functional molecular markers), enabling the early selection of frost-tolerant apple rootstocks.
Źródło:
Biuletyn Instytutu Hodowli i Aklimatyzacji Roślin; 2020, 291; 21-32
0373-7837
2657-8913
Pojawia się w:
Biuletyn Instytutu Hodowli i Aklimatyzacji Roślin
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Zastosowanie gier skierowanych na cel do anotacji korpusów językowych
The applications of games with a purpose used for obtaining annotated language resources
Autorzy:
Włodarczyk, Wojciech
Powiązania:
https://bibliotekanauki.pl/articles/460019.pdf
Data publikacji:
2015
Wydawca:
Fundacja Pro Scientia Publica
Tematy:
gry skierowane na cel
GWAP
crowdsourcing
human computation
przetwarzanie języka naturalnego
sztuczna inteligencja, AI-zupełne
anotacja korpusu
Wordrobe
game with a purpose
natural language processing
artificial intelligence, AI-complete
corpus annotation
Opis:
Istnienie problemów AI-zupełnych przyczyniło się do poszukiwań alternatywnych sposobów rozwiązywania problemów sztucznej inteligencji, nie opartych wyłącznie na pracy komputera. Pomimo że komunikacja jest dla ludzi czymś oczywistym, nadal nie istnieje sposób jej automatyzacji. Aktualnie powszechnie stosowanym podejściem w rozwiązywaniu problemów NLP jest podejście statystyczne, którego powodzenie zależy od wielkości korpusu językowego. Przygotowanie rzetelnego zbioru danych jest zatem kluczowym aspektem tworzenia statystycznego systemu sztucznej inteligencji. Z uwagi na zaangażowanie specjalistów jest to proces czasochłonny i kosztowny. Jednym z obiecujących podejść, pomagających zredukować czas i koszt tworzenia otagowanego korpusu, jest korzystanie z gier skierowanych na cel. Ambicją niniejszej pracy jest przybliżenie poszczególnych etapów tworzenia gry przeznaczonej do pozyskania zasobów językowych oraz omówienie skuteczności jej działania. Analiza ta zostanie przeprowadzona na podstawie kolekcji gier Wordrobe wspierających anotacje korpusu języka naturalnego.
The existence of AI-complete problems has led to a growth in research of alternative ways of solving artificial intelligence problems, which are not based solely on the computer. Although for us communication is obvious, there is still no way automate it. The current widely-used approach to solving the problems of NLP is a statistical one, whose success depends on the size of the training corpus. The preparation of a reliable set of data is therefore a key aspect in creating an artificial intelligence statistical system. Due to the involvement of a large number of specialists this is a very time-consuming and expensive process. One promising approache in helping reduce the time and cost of creating a tagged corpus is the use of games with a purpose. The objective of this paper is to present the stages of creating games with a purpose used for obtaining annotated language resources and to discuss its effectiveness. This analysis will be done based on the Wordrobe project, a collection of games created to support the gathering of an annotated corpus of natural language.
Źródło:
Ogrody Nauk i Sztuk; 2015, 5; 112-220
2084-1426
Pojawia się w:
Ogrody Nauk i Sztuk
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
On Semantic Annotation in Clarin-PL Parallel Corpora
Autorzy:
Koseska-Toszewa, Violetta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677121.pdf
Data publikacji:
2015
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
manual semantic annotation
semantic definiteness/indefiniteness category
logical quantification
uniqueness
existentiality
universality
elements of the semantic category of time
event
state
sequence of events and states finally ended with an event
Opis:
On Semantic Annotation in Clarin-PL Parallel CorporaIn the article, the authors present a proposal for semantic annotation in Clarin-PL parallel corpora: Polish-Bulgarian-Russian and Polish-Lithuanian ones. Semantic annotation of quantification is a novum in developing sentence level semantics in multilingual parallel corpora. This is why our semantic annotation is manual. The authors hope it will be interesting to IT specialists working on automatic processing of the given natural languages. Semantic annotation defined the way it is defined here will make contrastive studies of natural languages more efficient, which in turn will help verify the results of those studies, and will certainly improve human and machine translations.
Źródło:
Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
[Nota o książce:] Emmanuel Desurvire, „Karol Edmund Chojecki, polski patriota, odkrywca, żołnierz, dramaturg, powieściopisarz, publicysta, bibliotekarz…”
[Annotation about the Book:] Emmanuel Desurvire, ‘Karol Edmund Chojecki, Polish Patriot, Explorer, Soldier, Poet, Playwright, Novelist, Publicist, Librarian…’
Autorzy:
Chaplain, Jacques
Powiązania:
https://bibliotekanauki.pl/articles/690304.pdf
Data publikacji:
2014
Wydawca:
Łódzkie Towarzystwo Naukowe
Tematy:
Karol Edmund Chojecki (Charles Edmond)
biografia
literatura polska XIX wieku
literatura francuska XIX wieku
Emmanuel Desurvire
polonica zagraniczne
nota o książce
przekład
biography
Polish literature of 19th Century
French literature of 19th Century;
foreign polonica
annotation about the book
translation
Opis:
This text is a complement to the presented a year ago in the translation into Polish, Jacques Chaplain’s review which is a fragmentary overview of the three volume monograph by a worldknown physicist Emmanuel Desurvire, devoted to his ancestor, Karol Edmund Chojecki (1822−1899). Source materials, meticulously collected by the French biographer — family archives and other documents that were not printed before — enable us to get to know closely this Polish emigrant of the era of Romanticism, known in France under the pseudonym Charles Edmond, inter alia of the side of various, closer or further relationships with the representatives of French culture, science and politics of that time (such as Gustave Flaubert, Georges Sand, Louis Blanc or Georges Clemenceau), as well as his own creative achievements (in the fields of playwriting, dramaturgy, publicism and art of translation) and their twentieth century reception. Desurvire’s work — of impressive size and substantive content — has been currently published for the second time. The second edition, revised and bearing indices is completed with the two-volume supplement.
Źródło:
Prace Polonistyczne; 2014, 69; 223-225
0079-4791
Pojawia się w:
Prace Polonistyczne
Dostawca treści:
Biblioteka Nauki
Artykuł
    Wyświetlanie 1-40 z 40

    Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies