Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "annotation" wg kryterium: Temat


Tytuł:
A COMPARISON OF THE EFFECT OF TEXTUAL, AUDIO AND TEXTUAL-PICTORIAL AND AUDIO–PICTORIAL ANNOTATIONS ON ENHANCING READING COMPREHENSION AMONG IRANIAN EFL LEARNERS
Autorzy:
Karbalaei, Alireza
Zare, Amaneh
Powiązania:
https://bibliotekanauki.pl/articles/955805.pdf
Data publikacji:
2019
Wydawca:
Uniwersytet Marii Curie-Skłodowskiej w Lublinie. IATEFL Poland Computer Special Interest Group
Tematy:
textual annotation
audio annotation
textual-pictorial annotation
audio-pictorial annotation
reading comprehension
Opis:
This study aimed to investigate the interaction between L2 readers and the reading text equipped with four different annotations or glosses including text-only, audio-only, text-picture and audio-picture annotations. The participants in the study were selected from four intact classes consisting of 100 students studying English at intermediate level in Kish Institute of Science & technology (olom va fonon), in Iran. After they were given a reading comprehension text, the four experimental groups were given the same reading comprehension texts with different annotations. Then, they were asked to take the same reading test as posttest. The results of the study demonstrated that text-only and audio-only were more effective than other kinds of annotation. The results suggested that providing the new words whether in audio or text annotation during reading comprehension can help students to comprehend reading in an effective way. Educational implications suggest that provision of different kinds of glosses is beneficial for L2 students although they need some scaffolding for utilizing glosses in a beneficial way.
Źródło:
Teaching English with Technology; 2019, 19, 3; 40-67
1642-1027
Pojawia się w:
Teaching English with Technology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Bulgarian sense-annotated corpus – between the tradition and novelty
Autorzy:
Koeva, Svetla
Powiązania:
https://bibliotekanauki.pl/articles/677294.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpus studies
corpus annotation
annotation principles
Opis:
Bulgarian sense-annotated corpus – between the tradition and noveltyThe Bulgarian Sense-annotated Corpus (BulSemCor) is compiled according to the general methodology established by the SemCor project. It is a subset of the Brown Corpus of Bulgarian semantically annotated with a corresponding synonym set (synset) in the Bulgarian wordnet. Unlike the bulk of sense-annotated corpora where only (sets of) content words are annotated, in BulSemCor each lexical unit has been assigned a sense. The main contributions achieved in the work on BulSemCor are briefly decides in the presented paper: definition of an annotation schema, compilation of an input corpus, development of a sense-annotated corpus, Bulgarian wordnet enlargement.
Źródło:
Cognitive Studies; 2012, 12
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish Speakers
Autorzy:
Trzebińska, Joanna
Bartoszewicz, Jakub
Powiązania:
https://bibliotekanauki.pl/articles/677159.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
speech corpus
pragmatic annotation
semantic annotation
disability
Opis:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish SpeakersWhile Polish language is relatively well represented in general purpose corpora such as National Polish Language Corpus still there are groups of speakers that are underrepresented in reference corpora. One of such sub-groups is the disabled people community. On the other hand there is a growing need for understanding how disability influences social and cognitive abilities, language in particular. In this paper, we present a specialized Corpus of Dialogs of Disabled Speakers. The process of compiling, transcription and annotation of pragmatic, semantic and morphosyntactic features will be described, as well as Corpus applications will be discussed.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
A French corpus annotated for multiword expressions and named entities
Autorzy:
Candito, Marie
Constant, Mathieu
Ramisch, Carlos
Savary, Agata
Guillaume, Bruno
Parmentier, Yannick
Cordeiro, Silvio Ricardo
Powiązania:
https://bibliotekanauki.pl/articles/1818889.pdf
Data publikacji:
2020
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
multiword expressions
annotation
corpus
French
Opis:
We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status. In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with the syntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
Źródło:
Journal of Language Modelling; 2020, 8, 2; 415--479
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
A French corpus annotated for multiword expressions and named entities
Autorzy:
Candito, Marie
Constant, Mathieu
Ramisch, Carlos
Savary, Agata
Guillaume, Bruno
Parmentier, Yannick
Cordeiro, Silvio Ricardo
Powiązania:
https://bibliotekanauki.pl/articles/1818891.pdf
Data publikacji:
2020
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
multiword expressions
annotation
corpus
French
Opis:
We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status. In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with the syntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
Źródło:
Journal of Language Modelling; 2020, 8, 2; 415--479
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Design and analysis of a lean interface for Sanskrit corpus annotation
Autorzy:
Goyal, P.
Huet, G.
Powiązania:
https://bibliotekanauki.pl/articles/103855.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
Sanskrit
text segmentation
annotation
interface
Opis:
We describe an innovative computer interface designed to assist annotators in the efficient selection of segmentation solutions for proper tagging of Sanskrit corpora. The proposed solution uses a compact representation of the shared forest of all segmentations. The main idea is to represent the union of all segmentations, abstracting from the sandhi rules used, and aligning with the input sentence. We show that this representation provides an exponential saving, in both space and time. The segmentation methodology is lexicon-directed. When the lexicon does not have full coverage of the corpus vocabulary, some chunks of the input may fail to be recognized. We designed a lexiconacquisition facility, which remedies this incompleteness and makes the interface more robust. This interface has been implemented, and is currently being applied to the annotation of the Sanskrit Library corpus. Evaluation over 1,500 sentences from the Pañcatantra text shows the effectiveness of the proposed interface on real corpus data.
Źródło:
Journal of Language Modelling; 2016, 4, 2; 145-182
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Towards an event annotated corpus of Polish
Autorzy:
Marcińczuk, Michał
Oleksy, Marcin
Bernaś, Tomasz
Kocoń, Jan
Wolski, Michał
Powiązania:
https://bibliotekanauki.pl/articles/677125.pdf
Data publikacji:
2015
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
information extraction
event recognition
corpus annotation
Opis:
Towards an event annotated corpus of PolishThe paper presents a typology of events built on the basis of TimeML specification adapted to Polish language. Some changes were introduced to the definition of the event categories and a motivation for event categorization was formulated. The event annotation task is presented on two levels – ontology level (language independent) and text mentions (language dependant). The various types of event mentions in Polish text are discussed. A procedure for annotation of event mentions in Polish texts is presented and evaluated. In the evaluation a randomly selected set of documents from the Corpus of Wrocław University of Technology (called KPWr) was annotated by two linguists and the annotator agreement was calculated. The evaluation was done in two iterations. After the first evaluation we revised and improved the annotation procedure. The second evaluation showed a significant improvement of the agreement between annotators. The current work was focused on annotation and categorisation of event mentions in text. The future work will be focused on description of event with a set of attributes, arguments and relations.
Źródło:
Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Języki słowiańskie i litewski w korpusach równoległych Clarin-PL
Autorzy:
Koseska-Toszewa, Violetta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/678946.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
multilingual parallel corpora
semantic annotation
scope quantification
Opis:
Slavic languages and the Lithuanian language in the Clarin-PL parallel corporaThe Clarin Eric and Clarin-PL strategic scientific purpose is to support humanistic research in a multicultural and multilingual Europe. Polish researchers put the emphasis on building a bridge between the Polish language and Polish linguistic technologies and other European languages and their linguistic technologies. So far, the Polish scientific community has mainly focused on Polish-English connections. Clarin-PL has been developing the first and only multilingual corpora of the Polish language in conjunction with other Slavic languages and the Lithuanian language: the Polish-Bulgarian-Russian Parallel Corpus and the Polish- Lithuanian Parallel Corpus. The parallel corpora created by the ISS PAS Corpus Linguistics and Semantics Team break through the existing “canons” and allow scientists access to interlinked multilingual language resources – in the first phase limited to the languages of the three Slavic groups and the Lithuanian language. In the article, the authors present very detailed information on their original system of the semantic annotation of scope quantification in multilingual parallel corpora, hitherto unused in the subject literature. Due to the system’s originality, the semantic annotation is carried out manually. Identification of particular values of scope quantification in a sentence and the hereby presented attempts of its recording are supported by long-term research conducted by an international team of linguists and computer scientists / mathematicians developing the issue of quantification of names, time and aspect in natural languages. Języki słowiańskie i litewski w korpusach równoległych Clarin-PLStrategicznym celem naukowym Clarin ERIC i Clarin-PL jest wspieranie badań humanistycznych w wielokulturowej i wielojęzycznej Europie. Dla polskich badaczy ważna jest budowa pomostu między językiem polskim, polskimi technologiami językowymi a innymi językami europejskimi i na ich rzecz opracowanymi technologiami językowymi. Dotychczas w nauce polskiej największy nacisk był kładziony na powiązania polsko-angielskie. Clarin-PL opracowuje zatem pierwsze jak dotąd wielojęzyczne korpusy języka polskiego w zestawieniu z innymi językami słowiańskimi oraz z językiem litewskim: Korpus równoległy polsko-bułgarsko-rosyjski i Korpus równoległy polsko-litewski. Tworzone przez Zespół Lingwistyki Korpusowej i Semantyki (IS PAN) korpusy równoległe przełamują dotychczasowe „kanony” i udostępniają nauce powiązane wielojęzyczne zasoby – w pierwszym etapie ograniczone do języków trzech grup słowiańskich oraz języka litewskiego. W artykule autorzy przedstawiają bardzo szczegółową informację o zastosowanej po raz pierwszy w literaturze przedmiotu anotacji semantycznej dotyczącej kwantyfikacji zakresowej w wielojęzycznych korpusach równoległych. Z powodu swojego rozległego zakresu i nowatorstwa ta anotacja semantyczna jest nanoszona ręcznie. Identyfikacja poszczególnych wartości kwantyfikacji zakresowej w zdaniu oraz przedstawiane tu próby jej zapisu są poparte wieloletnimi badaniami międzynarodowego zespołu lingwistów i matematyków-informatyków opracowujących zagadnienie kwantyfikacji imion, czasu i aspektu w językach naturalnych.
Źródło:
Studia z Filologii Polskiej i Słowiańskiej; 2016, 51
2392-2435
0081-7090
Pojawia się w:
Studia z Filologii Polskiej i Słowiańskiej
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Construction of a medical corpus based on information extraction results
Autorzy:
Marciniak, M.
Mykowiecka, A.
Powiązania:
https://bibliotekanauki.pl/articles/206379.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
corpus
semantic annotation
clinical data
information extraction
Opis:
The paper presents a method of automatic construction of a semantically annotated corpus using the results of a rulebased information extraction (IE) application. Construction of the corpus is based on using existing programs for text tokenization and morphological analysis and combining their results with domain related correction rules. We reuse the specialized IE system to obtain a corpus annotated on the semantic level. The texts included within the corpus are Polish free text clinical data. We present the documents - diabetic patients' discharge records, the structure of the corpus annotation and the methods for obtaining the annotations. Initial evaluations based on the results of manual verification of selected data subset are also presented. The corpus, once manually corrected, is designed to be used for developing supervised machine learning models for IE applications.
Źródło:
Control and Cybernetics; 2011, 40, 2; 337-360
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Information searching for an experience management platform of the EU Pellucid project
Autorzy:
Majewska, M.
Krawczyk, K.
Słota, R.
Kitowski, J.
Hluchy, L.
Lambert, S.
Powiązania:
https://bibliotekanauki.pl/articles/1964213.pdf
Data publikacji:
2004
Wydawca:
Politechnika Gdańska
Tematy:
experience management
ontologies
information retrieval
semantic annotation
Opis:
The EU Pellucid project is developing an experience management system for public organizations with staff mobility. The paper presents an activity whitin the project focused on searching for information in repositories of documents. The project's background and the process of information searching are described. Ontological methods such as semantic annotation and similarity searching, as well as ontology- and full-text-based searching are presented. Monitoring of organizational repositories is discussed.
Źródło:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2004, 8, 4; 513-523
1428-6394
Pojawia się w:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Automatyczna anotacja genomu jako narzędzie biologii systemów
Automatic genome annotation as a tool of systems biology
Autorzy:
Bizukojć, M.
Powiązania:
https://bibliotekanauki.pl/articles/2070453.pdf
Data publikacji:
2009
Wydawca:
Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:
anotacja
genom
Aspergillus
KAAS
KEGG
annotation
genome
Opis:
W pracy przedstawiono metodę analizy metabolizmu organizmów polegającą na rekonstrukcji sieci metabolicznej na podstawie całkowicie lub częściowo zsekwencjonowanego genomu. Analizę tę przeprowadzono dla siedmiu gatunków grzybów nitkowych z rodzaju Aspergillus wykorzystując serwer automatycznej anotacji, a jej wyniki porównano z wybranymi danymi fizjologicznymi.
A method based upon the reconstruction of fully or partially sequenced genome to analyse metabolic networks of organisms is presented. This analysis was performed for seven fungal species of genus Aspergillus with the use of automatic annotation server. The results were compared with selected physiological data.
Źródło:
Inżynieria i Aparatura Chemiczna; 2009, 3; 25-27
0368-0827
Pojawia się w:
Inżynieria i Aparatura Chemiczna
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Badania aspektu w językach polskim, czeskim i rosyjskim za pomocą korpusów i baz danych (pierwsze podsumowanie tematu)
Autorzy:
Wiemer, Björn
Wrzesień-Kwiatkowska, Joanna
Łaziński, Marek
Powiązania:
https://bibliotekanauki.pl/articles/1036249.pdf
Data publikacji:
2020-11-20
Wydawca:
Wydawnictwo Uniwersytetu Śląskiego
Tematy:
aspect
verbal prefixes
aspect triples
electronic corpora
annotation
Opis:
The article is connected to the project “DiAsPol250” in which Polish is compared to Czech and Russian from the perspective of the evolution of their aspect systems (http://www.diaspol.uw.edu.pl/). We make use of existing synchronic and diachronic electronic corpora and are building a corpus of our own with annotated aspect pairs; we also create a database of aspect triplets whose role we consider as particularly important for the system. We want to assess which changes have occurred since the mid-18th century in prefixing and suffixing strategies of verb stems, both in general and by comparing particular prefixes and suffixes, especially so-called natural (vs. specialized) prefixes (according to Janda et al., 2013). The article supplies a sketch of the general premises of the project, and it summarizes our experience with existing large corpora and databases which we have been employing. We also present a case study in order to demonstrate a procedure designed to compare the distribution of the Czech prefix z- in triplets and in the corpus. This procedure is meant to check more general tendencies; it also illustrates why electronic corpora cannot be replaced in research on distributional properties and why their role does not consist simply in providing examples for the illustration of hypotheses.
Źródło:
Forum Lingwistyczne; 2020, 7; 45-58
2449-9587
2450-2758
Pojawia się w:
Forum Lingwistyczne
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
KIS: An automated attribute induction method for classification of DNA sequences
Autorzy:
Biedrzycki, R.
Arabas, J.
Powiązania:
https://bibliotekanauki.pl/articles/330979.pdf
Data publikacji:
2012
Wydawca:
Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:
klasyfikacja
optymalizacja
anotacja
wzorzec
classification
optimization
annotation
patterns
Opis:
This paper presents an application of methods from the machine learning domain to solving the task of DNA sequence recognition. We present an algorithm that learns to recognize groups of DNA sequences sharing common features such as sequence functionality. We demonstrate application of the algorithm to find splice sites, i.e., to properly detect donor and acceptor sequences. We compare the results with those of reference methods that have been designed and tuned to detect splice sites. We also show how to use the algorithm to find a human readable model of the IRE (Iron-Responsive Element) and to find IRE sequences. The method, although universal, yields results which are of quality comparable to those obtained by reference methods. In contrast to reference methods, this approach uses models that operate on sequence patterns, which facilitates interpretation of the results by humans.
Źródło:
International Journal of Applied Mathematics and Computer Science; 2012, 22, 3; 711-721
1641-876X
2083-8492
Pojawia się w:
International Journal of Applied Mathematics and Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Enhancing grammar and valence resources for Akan and Ga
Autorzy:
Beermann, Dorothee
Hellan, Lars
Linde-Usiekniewicz, Jadwiga
Storch, Anne
Powiązania:
https://bibliotekanauki.pl/chapters/1040110.pdf
Data publikacji:
2020
Wydawca:
Uniwersytet Warszawski. Wydawnictwa Uniwersytetu Warszawskiego
Tematy:
digital resources
lexicon
valence
corpus annotation
Akan
Ga
Opis:
We present a case study in valence comparison between closely related Kwa languages, assessing frames and meanings of the verb ba (‘come’) in Akan with a homophonous corresponding item in Ga. The discussion draws on the Akan dictionary (Christaller 1881), a Ga valence dictionary based on (Dakubu 2009), and an online annotated corpus of Akan hosted in TypeCraft (Beermann & Mihaylov 2014). With a view to the possibility of making use of resources for one language in the development of resources for another, we demonstrate how digital resources and linguistic specifications can inform each other.
Źródło:
West African languages. Linguistic theory and communication; 166-185
9788323546313
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Annogene: Restful Web Service for Annotating Genomic Features
Autorzy:
Tomski, A.
Piechota, M.
Przewłocki, R.
Powiązania:
https://bibliotekanauki.pl/articles/108641.pdf
Data publikacji:
2014
Wydawca:
Społeczna Akademia Nauk w Łodzi
Tematy:
BED annotation
RESTful web service
ChIP-seq peaks
Opis:
Modern high-throughput sequencing techniques generate a constantly increasing amount of genomic data from eukaryotes. The main problem is quickly identifying the data that may provide information about the nature of intracellular processes, such as the targeting of transcription factor-binding sites. Typically, thousands of peaks or signals are found across the genome and the nearby genes must be annotated. We introduce AnnoGene - a web service for annotating genomic features. AnnoGene was implemented in a representational state transfer (REST) architectural style. The program searches for the gene nearest to the center of a genomic position. Subsequently, the location and annotationsof the gene are shown. The tool can be downloaded and run on a local computer, but it was designed to be a web service. AnnoGene is freely available through a web browser. Moreover, our paper covers examples of the REST clients written in the Python, R and Java programming languages. AnnoGene only requires genomic positions from the user. Even when annotating several thousand positions, the output is typically ready in a few seconds. Moreover, this tool supports Seqinspector – a web tool for finding regulators of the genes.
Źródło:
Journal of Applied Computer Science Methods; 2014, 6 No. 2; 101-110
1689-9636
Pojawia się w:
Journal of Applied Computer Science Methods
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements
Autorzy:
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677259.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
parallel and comparable corpora
annotation
Polish
Lithuanian
Opis:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The method of automatic summarization from different sources
Autorzy:
Shakhovska, N.
Cherna, T.
Powiązania:
https://bibliotekanauki.pl/articles/411243.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:
annotation
abstracting
national system of abstracting
heterogeneous data
analysis
Opis:
In this article is analyzed technology of automatic text abstracting and annotation. The role of annotation in automatic search and classification for different scientific articles is described. The algorithm of summarization of natural language documents using the concept of importance coefficients is developed. Such concept allows considering the peculiarity of subject areas and topics that could be found in different kinds of documents. Method for generating abstracts of single document based on frequency analysis is developed. The recognition elements for unstructured text analysis are given. The method of pre-processing analysis of several documents is developed. This technique simultaneously considers both statistical approaches to abstracting and the importance of terms in a particular subject domain. The quality of generated abstract is evaluated. For the developed system there was conducted experts evaluation. It was held only for texts in Ukrainian. The developed system concluding essay has higher aggregate score on all criteria. The summarization system architecture is building. To build an information system model there is used CASE-tool AllFusion ERwin Data Modeler. The database scheme for information saving was built. The system is designed to work primarily with Ukrainian texts, which gives a significant advantage, since most modern systems still oriented to English texts.
Źródło:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2016, 5, 1; 103-109
2084-5715
Pojawia się w:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Robust Audio Watermarks in Frequency Domain
Autorzy:
Dymarski, P.
Markiewicz, R.
Powiązania:
https://bibliotekanauki.pl/articles/308461.pdf
Data publikacji:
2014
Wydawca:
Instytut Łączności - Państwowy Instytut Badawczy
Tematy:
annotation watermarking
audio watermarking
digital signature
dirty paper codes
LDPC
Opis:
In this paper an audio watermarking technique is presented, using log-spectrum, dirty paper codes and LDPC for watermark embedding. This technique may be used as a digital communication channel, transmitting data at about 40 b/s. It may be also applied for hiding a digital signature, e.g., for copyright protection purposes. Robustness of the watermarks against audio signal compression, resampling and transmitting through an acoustic channel is tested.
Źródło:
Journal of Telecommunications and Information Technology; 2014, 2; 12-21
1509-4553
1899-8852
Pojawia się w:
Journal of Telecommunications and Information Technology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Erotetic Reasoning Corpus. A data set for research on natural question processing
Autorzy:
Łupkowski, P.
Urbański, M.
Wiśniewski, A.
Błądek, W.
Juska, A.
Kostrzewa, A.
Pankow, D.
Paluszkiewicz, K.
Ignaszak, O.
Urbańska, J.
Żyluk, N.
Gajda, A.
Marciniak, B.
Powiązania:
https://bibliotekanauki.pl/articles/103809.pdf
Data publikacji:
2017
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
question
logic of question
question processing
erotetic reasoning
corpus annotation
Opis:
The aim of this paper is to present the Erotetic Reasoning Corpus (ERC) which constitutes a data set for research on natural question processing. We describe the theoretical background, linguistic data and tags used for the annotation process. We also discuss the potential areas in which the ERC can be exploited.
Źródło:
Journal of Language Modelling; 2017, 5, 3; 607-631
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Abstrakt i adnotacja jako element opisu dokumentu w bazie iSybislaw
Abstract and annotation as an element of bibliographic description in the iSybislaw database
Autorzy:
Kowalski, Paweł
Powiązania:
https://bibliotekanauki.pl/articles/965802.pdf
Data publikacji:
2014-12-31
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
abstract
annotation
bibliographic description
database
information retrieval system
iSybislaw
summary
Opis:
Abstract (sometimes called summary) of a scientific publication is a brief text that contains keywords. It is one of the elements of a bibliographic description in the bibliographic database iSybislaw – a modern information retrieval system. In the paper definitions of terms such as abstract, annotation and summary along with their constitutive elements are presented. A characteristics of such short texts inserted in the iSybislaw database in the fields Abstract and Abstract 2 is also given. Based on some examples excerpted from the iSybislaw system a typology of short texts, which are elements of the database bibliographic description, is proposed. The material allows to list three kinds of texts that are being used in the iSybislaw database: annotations, abstracts and biographic annotations.
Przedmiotem analizy artykułu są krótkie teksty, które stanowią jeden z elementów opisu bibliograficznego w systemie wyszukiwawczym iSybislaw. W praktyce naukowej używane są różne terminy odnoszące się do takich tekstów (abstrakt, adnotacja, streszczenie). Autor podaje ich definicje oraz wskazuje elementy konstytutywne. Na podstawie przykładów wyekscerpowanych z systemu iSybislaw przedstawia ich typologię oraz omawia miejsce i funkcje w opisie bibliograficznym.
Źródło:
Studia z Filologii Polskiej i Słowiańskiej; 2014, 49; 88-98
2392-2435
0081-7090
Pojawia się w:
Studia z Filologii Polskiej i Słowiańskiej
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
3D Medical Segmentation Visualization in Julia with MedEye3d
Autorzy:
Mitura, Jakub
Chrapko, Beata
Powiązania:
https://bibliotekanauki.pl/articles/1838173.pdf
Data publikacji:
2021-12
Wydawca:
Warszawska Wyższa Szkoła Informatyki
Tematy:
OpenGl
Computer Tomagraphy
PET/CT
medical image annotation
medical image visualization
Opis:
MedEye3d is a Julia language package designed to simplify visualizations of segmentation in three dimensional setting. Motivation to develop this application was to provide to rapidly growing Julia language scientific community tool for research in three dimensional medical images. Package is based on multiple open source software packages, yet most prominent is utilization of OpenGl specification to enable GPU acceleration.Application was tested both on Linux and Windows platforms and in both cases latency observed by the user in most common interaction like scrolling, annotation and change of displayed plane was very small.Thanks to utilization of many modern packages and methodologies developed package is providing convenient visualization in rapid prototyping with medical image segmentation algorithms. Application also is easily extendable and will be included in medical image segmentation framework that is currently in development.
Źródło:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki; 2021, 15, 25; 57-67
1896-396X
2082-8349
Pojawia się w:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Encapsulation of image metadata for ease of retrieval and mobility
Autorzy:
Woods, Nancy
Robert, Charles
Powiązania:
https://bibliotekanauki.pl/articles/117866.pdf
Data publikacji:
2019
Wydawca:
Polskie Towarzystwo Promocji Wiedzy
Tematy:
automatic image annotation
image tagging
metadata
automatyczna adnotacja obrazu
znakowanie obrazów
metadane
Opis:
Increasing proliferation of images due to multimedia capabilities of hand-held devices has resulted in loss of source information resulting from inherent mobility. These images are cumbersome to search out once stored away from their original source because they drop their descriptive data. This work, developed a model to encapsulate descriptive metadata into the Exif section of image header for effective retrieval and mobility. The resulting metadata used for retrieval purposes was mobile, searchable and non-obstructive.
Źródło:
Applied Computer Science; 2019, 15, 1; 62-73
1895-3735
Pojawia się w:
Applied Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Auditory Display Applied to Research in Music and Acoustics
Autorzy:
Kostek, B.
Powiązania:
https://bibliotekanauki.pl/articles/176425.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:
auditory display
music
acoustics
music technology
music information retrieval
sonification
music annotation
Opis:
This paper presents a relationship between Auditory Display (AD) and the domains of music and acoustics. First, some basic notions of the Auditory Display area are shortly outlined. Then, the research trends and system solutions within the fields of music technology, music information retrieval and music recommendation and acoustics that are within the scope of AD are discussed. Finally, an example of AD solution based on gaze tracking that may facilitate music annotation process is shown. The paper concludes with a few remarks about directions for further research in the domains discussed.
Źródło:
Archives of Acoustics; 2014, 39, 2; 203-214
0137-5075
Pojawia się w:
Archives of Acoustics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic Annotation
Autorzy:
Roszko, Danuta
Powiązania:
https://bibliotekanauki.pl/articles/677261.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
annotation
Lithuanian local dialect of Punsk in Poland
experimental dialectal corpus
Opis:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic AnnotationIn the article the author describes the experimental corpus of the Lithuanian local dialect of Puńsk in Poland (ECorp-of-Punsk). It is the first corpus of this type for the Lithuanian local dialect. The corpus consists of three subcorpora. The first one (referred to as fundamental) contains utterances given by Lithuanians in the local dialect, the second one – utterances given by Lithuanians in Polish, the third one – aligned Polish-dialectal texts.  The texts recorded in the years 1986–2012 have been included in the Ecorp-of-Punsk resources.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Trilingual aligned corpus – current state and new applications
Autorzy:
Dimitrova, Ludmila
Koseska, Violetta
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/967220.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
aligned trilingual corpus
digital resources
event
Petri net theory
semantic annotation
state
Opis:
Trilingual aligned corpus – current state and new applicationsThis article describes current state of a trilingual parallel corpus consisted of texts in two Slavic (Bulgarian and Polish) and one Baltic language (Lithuanian). The corpus contains original literary texts (fiction, novels, and short stories) in one of the three languages with translations to the other two, and texts in other languages translated into Bulgarian, Polish, and Lithuanian. A part of the texts are aligned at the sentence level. The authors propose a semantic annotation of verbs appearing in these aligned texts that will facilitate contrastive studies of natural languages. A theoretical background for the proposed semantic annotation is briefly also discussed.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł

Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies