Temat: automatic extraction - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Collocations terminologiques et extraction automatique : une étude pilote dans le domaine du commerce électronique
Terminological Collocations and Automatic Extraction: A Pilot Study on E-Commerce
Autorzy:: Calvi, Silvia
Powiązania:: https://bibliotekanauki.pl/articles/2015042.pdf
Data publikacji:: 2021
Wydawca:: Komisja Nauk Filologicznych Polskiej Akademii Nauk, Oddział we Wrocławiu
Tematy:: collocations
terminology
automatic extraction
e-commerce
Opis:: This article concentrates on the automatic extraction of collocations, defined by the Explanatory and Combinatorial Lexicology as phraseological units composed by two elements – the base and the collocate. The aim of this article is to propose a methodology to follow in order to automatically extract collocations from a terminological corpus. This method takes into account different measures: the syntactic dependences between the items of the collocation, their frequency, their tendency to co-occur (PMI) and their specificity to the e-commerce domain. After having explained the theoretical framework, the methodology is illustrated using a pilot study of the French terminology of e-commerce. In the pilot study, data were extracted from a corpus made up of e-commerce texts, which are drawn from a larger corpus called DIACOM-fr, a corpus in the process of being built at the University of Verona within the project Digital Humanities Applied to Foreign Languages and Literatures. Data extraction was primarily done using two tools: Stanza a Python natural language analysis package developed by the Stanford NLP group and TermoStat an automatic extractor tool developed at the Observatoire de Linguistique Sens-Texte of the University of Montreal.
Źródło:: Academic Journal of Modern Philology; 2021, 13; 75-82
2299-7164
2353-3218
Pojawia się w:: Academic Journal of Modern Philology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: A type-logical treebank for French
Autorzy:: Moot, R.
Powiązania:: https://bibliotekanauki.pl/articles/103845.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Instytut Podstaw Informatyki PAN
Tematy:: type-logical grammar
categorial grammar
semi-automatic grammar extraction
Opis:: This paper describes the TLGbank, a treebank developed in the framework of (multimodal) type-logical grammar. Using the French Treebank as a starting point, a combination of automated and manual techniques are applied to obtain type-logical derivations (parses) corresponding to the phrases of the French Treebank. The TLGbank has been developped with applications to wide-coverage semantics in mind. This means that the TLGbank has richer structure than the original French Treebank, especially where it concerns semantically relevant information such as passives, coordination, extraction and gapping.
Źródło:: Journal of Language Modelling; 2015, 3, 1; 229-264
2299-856X
2299-8470
Pojawia się w:: Journal of Language Modelling
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: The logic and linguistic model for automatic extraction of collocation similarity
Autorzy:: Khairova, N.
Petrasova, S.
Gautam, A. P. S.
Powiązania:: https://bibliotekanauki.pl/articles/411457.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:: automatic extraction
identification of collocation similarity
finite predicates algebra
logicalalgebraic equations
grammatical and semantic features
Opis:: The article discusses the process of automatic identification of collocation similarity. The semantic analysis is one of the most advanced as well as the most difficult NLP task. The main problem of semantic processing is the determination of polysemy and synonymy of linguistic units. In addition, the task becomes complicated in case of word collocations. The paper suggests a logical and linguistic model for automatic determining semantic similarity between colocations in Ukraine and English languages. The proposed model formalizes semantic equivalence of collocations by means of semantic and grammatical characteristics of collocates. The basic idea of this approach is that morphological, syntactic and semantic characteristics of lexical units are to be taken into account for the identification of collocation similarity. Basic mathematical means of our model are logical-algebraic equations of the finite predicates algebra. Verb-noun and noun-adjective collocations in Ukrainian and English languages consist of words belonged to main parts of speech. These collocations are examined in the model. The model allows extracting semantically equivalent collocations from semi-structured and non-structured texts. Implementations of the model will allow to automatically recognize semantically equivalent collocations. Usage of the model allows increasing the effectiveness of natural language processing tasks such as information extraction, ontology generation, sentyment analysis and some others.
Źródło:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2015, 4, 4; 43-48
2084-5715
Pojawia się w:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Kościelne słownictwo prawno-administracyjne w polskiej łacinie średniowiecznej – analiza z wykorzystaniem metod korpusowych
The Vocabulary of the Church’s Law and Administration in Polish Medieval Latin: An Analysis Using the Methods of Corpus Linguistics
Autorzy:: Halida, Łukasz
Powiązania:: https://bibliotekanauki.pl/articles/2015048.pdf
Data publikacji:: 2021
Wydawca:: Komisja Nauk Filologicznych Polskiej Akademii Nauk, Oddział we Wrocławiu
Tematy:: medieval Latin
electronic corpus
automatic term extraction
specialized vocabulary
ecclesiastical law and administration
Opis:: This paper concerns the vocabulary of the Church’s law and administration in Latin texts written during the Middle Ages in Poland and its automatic extraction using the methods of corpus linguistics. The first part of this article considers the basic theoretical assumptions of the automatic extraction of this specialized vocabulary and the main characteristics of the Electronic Corpus of Polish Medieval Latin. In the second part presents the methods and results of term extraction. For the purpose of this research, a specialized subcorpus, including synodal statues and documents of ecclesiastical chapters, was created and then compared with the reference corpus. As a result, a list of lexemes, which appeared relatively frequently in the subcorpus and rarely in the reference corpus, was obtained. This difference in relative frequency was the main criterion for the recognition of potential terminological units. Verification on the basis of lexicographic data demonstrated the effectiveness of the adopted methods. The aim of this paper was to present the usefulness of the Electronic Corpus of Polish Medieval Latin for the research and analysis of specialized vocabulary.
Źródło:: Academic Journal of Modern Philology; 2021, 13; 123-132
2299-7164
2353-3218
Pojawia się w:: Academic Journal of Modern Philology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: On the applicability of integrated UAV photogrammetry and automatic feature extraction for cadastral mapping
Autorzy:: Ajayi, Oluibukun Gbenga
Oruma, Emmanuel
Powiązania:: https://bibliotekanauki.pl/articles/43852813.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Czasopisma i Monografie PAN
Tematy:: zarządzanie gruntami
segmentacja obrazu
mapowanie
land management
remote sensing applications
image segmentation
automatic boundary extraction
UAV mapping
Opis:: The applicability of integratedUnmannedAerialVehicle (UAV)-photogrammetry and automatic feature extraction for cadastral or property mapping was investigated in this research paper. Multi-resolution segmentation (MRS) algorithm was implemented on UAVgenerated orthomosaic for mapping and the findings were compared with the result obtained from conventional ground survey technique using Hi-Target Differential Global Positioning System (DGPS) receivers. The overlapping image pairs acquired with the aid of a DJI Mavic air quadcopter were processed into an orthomosaic using Agisoft metashape software while MRS algorithm was implemented for the automatic extraction of visible land boundaries and building footprints at different Scale Parameter (SPs) in eCognition developer software. The obtained result shows that the performance of MRS improves with an increase in SP, with optimal results obtained when the SP was set at 1000 (with completeness, correctness, and overall accuracy of 92%, 95%, and 88%, respectively) for the extraction of the building footprints. Apart from the conducted cost and time analysis which shows that the integrated approach is 2.5 times faster and 9 times cheaper than the conventional DGPS approach, the automatically extracted boundaries and area of land parcels were also compared with the survey plans produced using the ground survey approach (DGPS) and the result shows that about 99% of the automatically extracted spatial information of the properties fall within the range of acceptable accuracy. The obtained results proved that the integration of UAVphotogrammetry and automatic feature extraction is applicable in cadastral mapping and that it offers significant advantages in terms of project time and cost.
Źródło:: Advances in Geodesy and Geoinformation; 2022, 71, 1; art. no. e19, 2022
2720-7242
Pojawia się w:: Advances in Geodesy and Geoinformation
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Utilization of ASTER data in lithological and lineament mapping of the southern flank of the Central High Atlas in Morocco
Autorzy:: Errami, Maryam
Algouti, Ahmed
Algouti, Abdellah
Farah, Abdelouhed
Agli, Saloua
Powiązania:: https://bibliotekanauki.pl/articles/2204372.pdf
Data publikacji:: 2023
Wydawca:: Uniwersytet im. Adama Mickiewicza w Poznaniu
Tematy:: Amezri-Amassine area
PCA
Principal Component Analysis
band ratio
MNF
Minimum Noise Fraction
automatic lineament extraction
Amezri
Amassine
analiza głównych składowych
Opis:: Geological mapping undoubtedly plays an important role in several studies and remote sensing data are of great significance in geological mapping, particularly in poorly mapped areas situated in inaccessible regions. In the present study, Principal Component Analysis (PCA), Band Rationing (BR) and Minimum Noise Fraction (MNF) algorithms are applied to map lithological units and extract lineaments in the Amezri-Amassine area, by using multispectral ASTER image and global digital elevation model (GDEM) data for the first time. Following preprocessing of ASTER images, advanced image algorithms such as PCA, BR and MNF analyses are applied to the 9ASTER bands. Validation of the resultant maps has relied on matching lithological boundaries and faults in the study area and on the basis of pre-existing geological maps. In addition to the PCA image, a new band-ratio image, 4/6–5/8–4/5, as adopted in the present work, provides high accuracy in discriminating lithological units. The MNF transformation reveals improvement over previous enhancement techniques, in detailing most rock units in the area. Hence, results derived from the enhancement techniques show a good correlation with the existing litho-structural map of the study area. In addition, the present results have allowed to update this map by identifying new lithological units and structural lineaments. Consequently, the methodology followed here has provided satisfactory results and has demonstrated the high potential of multispectral ASTER data for improving lithological discrimination and lineament extraction.
Źródło:: Geologos; 2023, 29, 1; 1--20
1426-8981
2080-6574
Pojawia się w:: Geologos
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Automatic wrapper generation and generalization for social media websites
Autorzy:: Baziński, B.
Brzezicki, M.
Powiązania:: https://bibliotekanauki.pl/articles/206411.pdf
Data publikacji:: 2012
Wydawca:: Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:: automatic wrapper generation
information extraction
Opis:: The data contained within user generated kontent websites prove to be valuable in many applications, for example in social media monitoring or in acquisition of training sets for machine learning algorithms. Mining such data is especially difficult in case of web forums, because of hundreds of various forum engines used. We propose an algorithm capable of unsupervised extraction of posts from social websites, without the need to analyse more than one page in advance. Our method localizes potential data regions by repetition analysis within document structure and filtering potential results. Subsequently, the fields of data records are fund using key characteristics and series-wide dependencies. We manager to achieve 85% precision of extraction and 79% recall after experiments on single pages taken from 258 websites. Our solution is characterized by high computing efficiency, thus enabling wide applications.
Źródło:: Control and Cybernetics; 2012, 41, 4; 817-834
0324-8569
Pojawia się w:: Control and Cybernetics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Efficient Two-Step Approach for Automatic Number Plate Detection
Autorzy:: Gorovyi, I.
Powiązania:: https://bibliotekanauki.pl/articles/226386.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: automatic number plate recognition (ANPR)
stroke width transform
features extraction
neural network
Opis:: Intelligent transportation systems are rapidly growing mainly due to active development of novel hardware and software solutions. In the paper a problem of automatical number plate detection is considered. An efficient two-step approach based on plate candidates extraction with further classification by neural network is proposed. Stroke width transform and contours detection techniques are utilized for the image preprocessing and extraction of regions of interest. Different local feature sets are used for the final number plate extraction step. Efficiency of the developed method is tested with real datasets.
Źródło:: International Journal of Electronics and Telecommunications; 2015, 61, 4; 351-356
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Modelowanie i optymalizacja generatora cech dla systemu rozpoznawania mówcy
Modeling and optimization of features generator for speaker recognition systems
Autorzy:: Majda, E.
Dobrowolski, A. P.
Smólski, B. L.
Powiązania:: https://bibliotekanauki.pl/articles/209417.pdf
Data publikacji:: 2012
Wydawca:: Wojskowa Akademia Techniczna im. Jarosława Dąbrowskiego
Tematy:: automatyczne rozpoznawanie mówcy
analiza cepstralna
ekstrakcja cech
selekcja cech
analiza składników głównych
automatic speaker recognition
cepstral analysis
features extraction
features selection
principal component analysis
Opis:: W pracy przedstawiono zagadnienia związane z modelowaniem i optymalizacją generatora cech dla systemu automatycznego rozpoznawania mówcy (ang. Automatic Speaker Recognition - ASR). Etap generacji cech (parametryzacji sygnału mowy) jest fundamentalny w tego typu systemach, z uwagi na fakt, że unikatowy wektor cech ma decydujące znaczenie w procesie rozpoznawania. Zadaniem generatora cech jest opisanie sygnału mowy za pomocą możliwie mało licznego zbioru deskryptorów, bez utraty informacji istotnych z punktu widzenia rozpoznawania mówcy. Ponadto parametryzacja powinna wykazywać odporność na warunki akustyczne i techniczne rejestracji oraz na zawartość lingwistyczną rejestrowanego materiału. Badania przedstawione w referacie koncentrowały się przede wszystkim na wielokryterialnej optymalizacji wybranych parametrów generatora cech opartego na analizie cepstralnej, uwzględniającej dodatkowo selekcję cech. Oceny otrzymanych wyników dokonano w oparciu o analizę składników głównych (ang. Principal Component Analysis - PCA) zbioru deskryptorów wyznaczonych dla próbek głosu pochodzących od 24 mówców.
The paper presents issues related to modeling and optimization of the features generator for the speaker recognition system (ASR - Automatic Speakers Recognition). Parameterization's stage of the speech signal (features generation) is fundamental in this type of systems, due to the fact that the unique vector of features is crucial in the process of recognition. The task is to describe the speech signal using descriptors as little as possible, without loss of relevant information to the speaker recognition. In addition, parametrization should have robust to acoustic and technical registration conditions and the recorded linguistic material. The research presented in this paper is focused primarily on the multicriteria optimization of selected parameters of the features generator based on cepstral analysis, additionally allowing features selection. Finally, evaluation of the results was based on the analysis of main components, a set of descriptors for the samples voice acquired from 24 speakers.
Źródło:: Biuletyn Wojskowej Akademii Technicznej; 2012, 61, 4; 153-168
1234-5865
Pojawia się w:: Biuletyn Wojskowej Akademii Technicznej
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Application of Support Vector Machines in automatic human face recognition
Autorzy:: Kawulok, M.
Powiązania:: https://bibliotekanauki.pl/articles/333790.pdf
Data publikacji:: 2005
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: automatyczne rozpoznanie twarzy
metoda wektorów nośnych
wykrywanie twarzy
wybór cech
fuzja wielometodowa
automatic face recognition
support vector machines
face detection
feature extraction
multi-method fusion
Opis:: This paper presents the possibilities of applying the Support Vector Machines (SVM) in the process of automatic human face recognition. It is described how the existing methods of face recognition can be improved by the SVM. Moreover, a new approach to the multi-method fusion utilising the SVM is proposed. Usefulness of all the methods described in the paper improving the face recognition effectiveness by the SVM is confirmed by the experimental results.
Źródło:: Journal of Medical Informatics & Technologies; 2005, 9; 143-150
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "automatic extraction" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język