Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "annotation" wg kryterium: Temat


Tytuł:
Annotating a non-model plant genome – a study on the narrow-leafed lupin
Autorzy:
Zielezinski, A.
Potarzycki, P.
Ksiazkiewicz, M.
Karlowski, W.M.
Powiązania:
https://bibliotekanauki.pl/articles/80218.pdf
Data publikacji:
2012
Wydawca:
Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:
genome
plant genome
pipeline
software
narrow-leaved lupin
gene annotation system
gene sequence
DNA sequence
Źródło:
BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology; 2012, 93, 3
0860-7796
Pojawia się w:
BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Encapsulation of image metadata for ease of retrieval and mobility
Autorzy:
Woods, Nancy
Robert, Charles
Powiązania:
https://bibliotekanauki.pl/articles/117866.pdf
Data publikacji:
2019
Wydawca:
Polskie Towarzystwo Promocji Wiedzy
Tematy:
automatic image annotation
image tagging
metadata
automatyczna adnotacja obrazu
znakowanie obrazów
metadane
Opis:
Increasing proliferation of images due to multimedia capabilities of hand-held devices has resulted in loss of source information resulting from inherent mobility. These images are cumbersome to search out once stored away from their original source because they drop their descriptive data. This work, developed a model to encapsulate descriptive metadata into the Exif section of image header for effective retrieval and mobility. The resulting metadata used for retrieval purposes was mobile, searchable and non-obstructive.
Źródło:
Applied Computer Science; 2019, 15, 1; 62-73
1895-3735
Pojawia się w:
Applied Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Zastosowanie gier skierowanych na cel do anotacji korpusów językowych
The applications of games with a purpose used for obtaining annotated language resources
Autorzy:
Włodarczyk, Wojciech
Powiązania:
https://bibliotekanauki.pl/articles/460019.pdf
Data publikacji:
2015
Wydawca:
Fundacja Pro Scientia Publica
Tematy:
gry skierowane na cel
GWAP
crowdsourcing
human computation
przetwarzanie języka naturalnego
sztuczna inteligencja, AI-zupełne
anotacja korpusu
Wordrobe
game with a purpose
natural language processing
artificial intelligence, AI-complete
corpus annotation
Opis:
Istnienie problemów AI-zupełnych przyczyniło się do poszukiwań alternatywnych sposobów rozwiązywania problemów sztucznej inteligencji, nie opartych wyłącznie na pracy komputera. Pomimo że komunikacja jest dla ludzi czymś oczywistym, nadal nie istnieje sposób jej automatyzacji. Aktualnie powszechnie stosowanym podejściem w rozwiązywaniu problemów NLP jest podejście statystyczne, którego powodzenie zależy od wielkości korpusu językowego. Przygotowanie rzetelnego zbioru danych jest zatem kluczowym aspektem tworzenia statystycznego systemu sztucznej inteligencji. Z uwagi na zaangażowanie specjalistów jest to proces czasochłonny i kosztowny. Jednym z obiecujących podejść, pomagających zredukować czas i koszt tworzenia otagowanego korpusu, jest korzystanie z gier skierowanych na cel. Ambicją niniejszej pracy jest przybliżenie poszczególnych etapów tworzenia gry przeznaczonej do pozyskania zasobów językowych oraz omówienie skuteczności jej działania. Analiza ta zostanie przeprowadzona na podstawie kolekcji gier Wordrobe wspierających anotacje korpusu języka naturalnego.
The existence of AI-complete problems has led to a growth in research of alternative ways of solving artificial intelligence problems, which are not based solely on the computer. Although for us communication is obvious, there is still no way automate it. The current widely-used approach to solving the problems of NLP is a statistical one, whose success depends on the size of the training corpus. The preparation of a reliable set of data is therefore a key aspect in creating an artificial intelligence statistical system. Due to the involvement of a large number of specialists this is a very time-consuming and expensive process. One promising approache in helping reduce the time and cost of creating a tagged corpus is the use of games with a purpose. The objective of this paper is to present the stages of creating games with a purpose used for obtaining annotated language resources and to discuss its effectiveness. This analysis will be done based on the Wordrobe project, a collection of games created to support the gathering of an annotated corpus of natural language.
Źródło:
Ogrody Nauk i Sztuk; 2015, 5; 112-220
2084-1426
Pojawia się w:
Ogrody Nauk i Sztuk
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Badania aspektu w językach polskim, czeskim i rosyjskim za pomocą korpusów i baz danych (pierwsze podsumowanie tematu)
Autorzy:
Wiemer, Björn
Wrzesień-Kwiatkowska, Joanna
Łaziński, Marek
Powiązania:
https://bibliotekanauki.pl/articles/1036249.pdf
Data publikacji:
2020-11-20
Wydawca:
Wydawnictwo Uniwersytetu Śląskiego
Tematy:
aspect
verbal prefixes
aspect triples
electronic corpora
annotation
Opis:
The article is connected to the project “DiAsPol250” in which Polish is compared to Czech and Russian from the perspective of the evolution of their aspect systems (http://www.diaspol.uw.edu.pl/). We make use of existing synchronic and diachronic electronic corpora and are building a corpus of our own with annotated aspect pairs; we also create a database of aspect triplets whose role we consider as particularly important for the system. We want to assess which changes have occurred since the mid-18th century in prefixing and suffixing strategies of verb stems, both in general and by comparing particular prefixes and suffixes, especially so-called natural (vs. specialized) prefixes (according to Janda et al., 2013). The article supplies a sketch of the general premises of the project, and it summarizes our experience with existing large corpora and databases which we have been employing. We also present a case study in order to demonstrate a procedure designed to compare the distribution of the Czech prefix z- in triplets and in the corpus. This procedure is meant to check more general tendencies; it also illustrates why electronic corpora cannot be replaced in research on distributional properties and why their role does not consist simply in providing examples for the illustration of hypotheses.
Źródło:
Forum Lingwistyczne; 2020, 7; 45-58
2449-9587
2450-2758
Pojawia się w:
Forum Lingwistyczne
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish Speakers
Autorzy:
Trzebińska, Joanna
Bartoszewicz, Jakub
Powiązania:
https://bibliotekanauki.pl/articles/677159.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
speech corpus
pragmatic annotation
semantic annotation
disability
Opis:
Multi-level annotation of the specialized Corpus of Dialogs of Disabled Polish SpeakersWhile Polish language is relatively well represented in general purpose corpora such as National Polish Language Corpus still there are groups of speakers that are underrepresented in reference corpora. One of such sub-groups is the disabled people community. On the other hand there is a growing need for understanding how disability influences social and cognitive abilities, language in particular. In this paper, we present a specialized Corpus of Dialogs of Disabled Speakers. The process of compiling, transcription and annotation of pragmatic, semantic and morphosyntactic features will be described, as well as Corpus applications will be discussed.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Annogene: Restful Web Service for Annotating Genomic Features
Autorzy:
Tomski, A.
Piechota, M.
Przewłocki, R.
Powiązania:
https://bibliotekanauki.pl/articles/108641.pdf
Data publikacji:
2014
Wydawca:
Społeczna Akademia Nauk w Łodzi
Tematy:
BED annotation
RESTful web service
ChIP-seq peaks
Opis:
Modern high-throughput sequencing techniques generate a constantly increasing amount of genomic data from eukaryotes. The main problem is quickly identifying the data that may provide information about the nature of intracellular processes, such as the targeting of transcription factor-binding sites. Typically, thousands of peaks or signals are found across the genome and the nearby genes must be annotated. We introduce AnnoGene - a web service for annotating genomic features. AnnoGene was implemented in a representational state transfer (REST) architectural style. The program searches for the gene nearest to the center of a genomic position. Subsequently, the location and annotationsof the gene are shown. The tool can be downloaded and run on a local computer, but it was designed to be a web service. AnnoGene is freely available through a web browser. Moreover, our paper covers examples of the REST clients written in the Python, R and Java programming languages. AnnoGene only requires genomic positions from the user. Even when annotating several thousand positions, the output is typically ready in a few seconds. Moreover, this tool supports Seqinspector – a web tool for finding regulators of the genes.
Źródło:
Journal of Applied Computer Science Methods; 2014, 6 No. 2; 101-110
1689-9636
Pojawia się w:
Journal of Applied Computer Science Methods
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The method of automatic summarization from different sources
Autorzy:
Shakhovska, N.
Cherna, T.
Powiązania:
https://bibliotekanauki.pl/articles/411243.pdf
Data publikacji:
2016
Wydawca:
Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:
annotation
abstracting
national system of abstracting
heterogeneous data
analysis
Opis:
In this article is analyzed technology of automatic text abstracting and annotation. The role of annotation in automatic search and classification for different scientific articles is described. The algorithm of summarization of natural language documents using the concept of importance coefficients is developed. Such concept allows considering the peculiarity of subject areas and topics that could be found in different kinds of documents. Method for generating abstracts of single document based on frequency analysis is developed. The recognition elements for unstructured text analysis are given. The method of pre-processing analysis of several documents is developed. This technique simultaneously considers both statistical approaches to abstracting and the importance of terms in a particular subject domain. The quality of generated abstract is evaluated. For the developed system there was conducted experts evaluation. It was held only for texts in Ukrainian. The developed system concluding essay has higher aggregate score on all criteria. The summarization system architecture is building. To build an information system model there is used CASE-tool AllFusion ERwin Data Modeler. The database scheme for information saving was built. The system is designed to work primarily with Ukrainian texts, which gives a significant advantage, since most modern systems still oriented to English texts.
Źródło:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2016, 5, 1; 103-109
2084-5715
Pojawia się w:
ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Language resources for named entity annotation in the National Corpus of Polish
Autorzy:
Savary, A.
Piskorski, J.
Powiązania:
https://bibliotekanauki.pl/articles/206388.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
natural language processing
proper names
named entities
corpus annotation
Polish National Corpus
SProUT
Opis:
We present the named entity annotation subtask of a project aiming at creating the National Corpus of Polish. We summarize the annotation requirements defined for this corpus, and we discuss how existing lexical resources and grammars for named entity recognition for Polish have been adapted to meet those requirements. We show detailed results of the corpus annotation using the information extraction platform SProUT. We also analyze the errors committed by our knowledge-based method and suggest its further improvements.
Źródło:
Control and Cybernetics; 2011, 40, 2; 361-391
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic Annotation
Autorzy:
Roszko, Danuta
Powiązania:
https://bibliotekanauki.pl/articles/677261.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
annotation
Lithuanian local dialect of Punsk in Poland
experimental dialectal corpus
Opis:
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic AnnotationIn the article the author describes the experimental corpus of the Lithuanian local dialect of Puńsk in Poland (ECorp-of-Punsk). It is the first corpus of this type for the Lithuanian local dialect. The corpus consists of three subcorpora. The first one (referred to as fundamental) contains utterances given by Lithuanians in the local dialect, the second one – utterances given by Lithuanians in Polish, the third one – aligned Polish-dialectal texts.  The texts recorded in the years 1986–2012 have been included in the Ecorp-of-Punsk resources.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements
Autorzy:
Roszko, Danuta
Roszko, Roman
Powiązania:
https://bibliotekanauki.pl/articles/677259.pdf
Data publikacji:
2013
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
corpora
parallel and comparable corpora
annotation
Polish
Lithuanian
Opis:
Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Źródło:
Cognitive Studies; 2013, 13
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
3D Medical Segmentation Visualization in Julia with MedEye3d
Autorzy:
Mitura, Jakub
Chrapko, Beata
Powiązania:
https://bibliotekanauki.pl/articles/1838173.pdf
Data publikacji:
2021-12
Wydawca:
Warszawska Wyższa Szkoła Informatyki
Tematy:
OpenGl
Computer Tomagraphy
PET/CT
medical image annotation
medical image visualization
Opis:
MedEye3d is a Julia language package designed to simplify visualizations of segmentation in three dimensional setting. Motivation to develop this application was to provide to rapidly growing Julia language scientific community tool for research in three dimensional medical images. Package is based on multiple open source software packages, yet most prominent is utilization of OpenGl specification to enable GPU acceleration.Application was tested both on Linux and Windows platforms and in both cases latency observed by the user in most common interaction like scrolling, annotation and change of displayed plane was very small.Thanks to utilization of many modern packages and methodologies developed package is providing convenient visualization in rapid prototyping with medical image segmentation algorithms. Application also is easily extendable and will be included in medical image segmentation framework that is currently in development.
Źródło:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki; 2021, 15, 25; 57-67
1896-396X
2082-8349
Pojawia się w:
Zeszyty Naukowe Warszawskiej Wyższej Szkoły Informatyki
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Construction of a medical corpus based on information extraction results
Autorzy:
Marciniak, M.
Mykowiecka, A.
Powiązania:
https://bibliotekanauki.pl/articles/206379.pdf
Data publikacji:
2011
Wydawca:
Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:
corpus
semantic annotation
clinical data
information extraction
Opis:
The paper presents a method of automatic construction of a semantically annotated corpus using the results of a rulebased information extraction (IE) application. Construction of the corpus is based on using existing programs for text tokenization and morphological analysis and combining their results with domain related correction rules. We reuse the specialized IE system to obtain a corpus annotated on the semantic level. The texts included within the corpus are Polish free text clinical data. We present the documents - diabetic patients' discharge records, the structure of the corpus annotation and the methods for obtaining the annotations. Initial evaluations based on the results of manual verification of selected data subset are also presented. The corpus, once manually corrected, is designed to be used for developing supervised machine learning models for IE applications.
Źródło:
Control and Cybernetics; 2011, 40, 2; 337-360
0324-8569
Pojawia się w:
Control and Cybernetics
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Towards an event annotated corpus of Polish
Autorzy:
Marcińczuk, Michał
Oleksy, Marcin
Bernaś, Tomasz
Kocoń, Jan
Wolski, Michał
Powiązania:
https://bibliotekanauki.pl/articles/677125.pdf
Data publikacji:
2015
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
information extraction
event recognition
corpus annotation
Opis:
Towards an event annotated corpus of PolishThe paper presents a typology of events built on the basis of TimeML specification adapted to Polish language. Some changes were introduced to the definition of the event categories and a motivation for event categorization was formulated. The event annotation task is presented on two levels – ontology level (language independent) and text mentions (language dependant). The various types of event mentions in Polish text are discussed. A procedure for annotation of event mentions in Polish texts is presented and evaluated. In the evaluation a randomly selected set of documents from the Corpus of Wrocław University of Technology (called KPWr) was annotated by two linguists and the annotator agreement was calculated. The evaluation was done in two iterations. After the first evaluation we revised and improved the annotation procedure. The second evaluation showed a significant improvement of the agreement between annotators. The current work was focused on annotation and categorisation of event mentions in text. The future work will be focused on description of event with a set of attributes, arguments and relations.
Źródło:
Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Information searching for an experience management platform of the EU Pellucid project
Autorzy:
Majewska, M.
Krawczyk, K.
Słota, R.
Kitowski, J.
Hluchy, L.
Lambert, S.
Powiązania:
https://bibliotekanauki.pl/articles/1964213.pdf
Data publikacji:
2004
Wydawca:
Politechnika Gdańska
Tematy:
experience management
ontologies
information retrieval
semantic annotation
Opis:
The EU Pellucid project is developing an experience management system for public organizations with staff mobility. The paper presents an activity whitin the project focused on searching for information in repositories of documents. The project's background and the process of information searching are described. Ontological methods such as semantic annotation and similarity searching, as well as ontology- and full-text-based searching are presented. Monitoring of organizational repositories is discussed.
Źródło:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2004, 8, 4; 513-523
1428-6394
Pojawia się w:
TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Erotetic Reasoning Corpus. A data set for research on natural question processing
Autorzy:
Łupkowski, P.
Urbański, M.
Wiśniewski, A.
Błądek, W.
Juska, A.
Kostrzewa, A.
Pankow, D.
Paluszkiewicz, K.
Ignaszak, O.
Urbańska, J.
Żyluk, N.
Gajda, A.
Marciniak, B.
Powiązania:
https://bibliotekanauki.pl/articles/103809.pdf
Data publikacji:
2017
Wydawca:
Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:
question
logic of question
question processing
erotetic reasoning
corpus annotation
Opis:
The aim of this paper is to present the Erotetic Reasoning Corpus (ERC) which constitutes a data set for research on natural question processing. We describe the theoretical background, linguistic data and tags used for the annotation process. We also discuss the potential areas in which the ERC can be exploited.
Źródło:
Journal of Language Modelling; 2017, 5, 3; 607-631
2299-856X
2299-8470
Pojawia się w:
Journal of Language Modelling
Dostawca treści:
Biblioteka Nauki
Artykuł

Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies