Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "n-gram" wg kryterium: Temat


Wyświetlanie 1-8 z 8
Tytuł:
Using frequent pattern mining algorithms in text analysis
Autorzy:
Ożdżyński, P.
Zakrzewska, D.
Powiązania:
https://bibliotekanauki.pl/articles/95011.pdf
Data publikacji:
2017
Wydawca:
Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Wydawnictwo Szkoły Głównej Gospodarstwa Wiejskiego w Warszawie
Tematy:
GSP
SuffixArray
PrefixSpan
N-Gram
frequent sequences
Opis:
In text mining, effectiveness of methods depends on document representations. The ones based on frequent word sequences are used in such tasks as categorization, clustering and topic modelling. In the paper a comparison of different algorithms for finding frequent word sequences is presented. There are considered techniques dedicated for market basket analysis such as GSP and PrefixSpan as well as a method based on a suffix array. The investigated techniques are compared with the new approach of searching maximum frequent word sequences in document sets. Performance of the algorithms is examined taking into account execution times for the considered test collections.
Źródło:
Information Systems in Management; 2017, 6, 3; 213-222
2084-5537
2544-1728
Pojawia się w:
Information Systems in Management
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
The role of word and n-gram frequency analysis in inference of the content of scientific publication
Autorzy:
Zdonek, Iwona
Powiązania:
https://bibliotekanauki.pl/articles/1931609.pdf
Data publikacji:
2020
Wydawca:
Politechnika Śląska. Wydawnictwo Politechniki Śląskiej
Tematy:
text mining
R
n-grams
scientific publication analysis
eksploracja tekstu
n-gram
analiza publikacji naukowych
Opis:
Purpose: The paper presents an analysis of a scientific publication with regard to the frequency of words and n-grams. The research problem addressed was the question to what extent the text mining analysis of a scientific publication will allow to infer its content. Design/methodology/approach: The main research method is the analysis of tokenized text using word count functions, bigrams, and trigrams in selected sections of a scientific publication. The results of text mining analysis were compared with the classic, non-automated text analysis of the publication. The presented study is a pilot project in the form of a case study. Findings: The proposed method of analyzing a scientific text using an analysis of the frequency of words and n-grams enables inference of the content of the paper with regard to the names of variables involved in the study, the statistical apparatus used and the key literature cited. It should be observed, however, that the discussed method does not make it possible to establish which variables are moderators and which are mediators. Originality/value: In this paper, the text mining technique was used differently in the discussed study than in previous works. The publication was not examined in its entirety, as previous researchers did, but text mining analysis was applied to individual parts of the paper, i.e. the part discussing theoretical foundations of the research and the part presenting the research method, research results, and their discussion. This allowed for obtaining more precise results regarding the content of the publication.
Źródło:
Zeszyty Naukowe. Organizacja i Zarządzanie / Politechnika Śląska; 2020, 142; 21-31
1641-3466
Pojawia się w:
Zeszyty Naukowe. Organizacja i Zarządzanie / Politechnika Śląska
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Rapid Text Entry Using Mobile and Auxiliary Devices for People with Speech Disorders Communication
Autorzy:
Krak, Iurii V.
Barmak, Olexander V.
Bahrii, Ruslan O.
Wójcik, Waldemar
Rakhmetullina, Saule
Amirgaliyeva, Saltanat
Powiązania:
https://bibliotekanauki.pl/articles/227152.pdf
Data publikacji:
2020
Wydawca:
Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:
information technology
alternative communication
ambiguous virtual keyboard
text prediction
statistical language model
N-gram
Opis:
The article considers information technology for the realization of human communication using residual human capabilities, obtained by organizing text entry using mobile and auxiliary devices. The components of the proposed technology are described in detail: the method for entering text information to realize the possibility of introducing a limited number of controls and the method of predicting words that are most often encountered after words already entered in the sentence. A generalized representation of the process of entering text is described with the aid of an ambiguous virtual keyboard and the representation of control signals for the selection of control elements. The approaches to finding the optimal distribution of the set of alphabet characters for different numbers of control signals are given. The method of word prediction is generalized and improved, the statistical language model with "back-off" is used, and the approach to the formation of the training corpus of the spoken Ukrainian language is proposed.
Źródło:
International Journal of Electronics and Telecommunications; 2020, 66, 2; 273-279
2300-1933
Pojawia się w:
International Journal of Electronics and Telecommunications
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Równoległa implementacja algorytmu winnowing dla operacji strumieniowej analizy tekstu
Parallel Winnowing Implementation for text stream analysis
Autorzy:
Wielgosz, M.
Żurek, D.
Pietroń, M.
Dąbrowska-Boruch, A.
Wiatr, K.
Powiązania:
https://bibliotekanauki.pl/articles/154404.pdf
Data publikacji:
2014
Wydawca:
Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:
n-gramowy model
eksploracja danych
przetwarzanie strumieniowe
GPGPU
n-gram-based model
document comparison
GPU
information retrieval
Opis:
W ramach praca przeprowadzona została analiza możliwości wykorzystania algorytmu winnowing do strumieniowego przetwarzania informacji tekstowej. W szczególności nacisk został położony na operacje generacji odcisku jako jej zredukowanej reprezentacji wiadomości tekstowej. Autorzy przeprowadzili szereg eksperymentów, w celu określenia efektywności działania algorytmu oraz możliwego do uzyskania przyspieszenia obliczeń, z wykorzy-staniem węzła procesorów Intel Xeon E5645 2.40GHz oraz karty GPU Nvidia Tesla m2090.
There are several models available for information retrieval and text analysis but the two are considered to be the dominant ones, namely Boolean and the vector space model (VSM). A model maps the existing words or text into a new representation space. This paper presents a boolean n-gram-based algorithm - winnowing for fast text search and comparison of documents with main focus on its implementation and performance analysis. The algorithm is used to generate fingerprints (i.e. a set of hashes) of the analyzed documents. A dedicated test framework was designed and implemented to handle the task of the algorithm evaluation which utilizes PAN test corpus and programming environment. Several tests were conducted in order to determine the comparison quality of the obfuscated and not obfuscated text for the winnowing algorithm and different window and n-gram size. The tests revealed interesting properties of the algorithms with respect to comparison of documents as well as defied the limits of their applicability. The n-gram-based algorithms due to their simplicity are well suited for hardware implementation. Thus, the authors implemented compu-tationally demanding part of both fingerprint generation both on CPU and GPU. Performance measurements for Intel Xeon E5645, 2.40GHz and Nvidia Tesla m2090 implementation of Ngram-based algorithm show approximately 14x computational speedup.
Źródło:
Pomiary Automatyka Kontrola; 2014, R. 60, nr 5, 5; 309-312
0032-4140
Pojawia się w:
Pomiary Automatyka Kontrola
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Removing grammar ambiguity of word forms by statistical methods
Autorzy:
Karnaukh, Ganna
Powiązania:
https://bibliotekanauki.pl/articles/967221.pdf
Data publikacji:
2014
Wydawca:
Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:
grammatical homonyms
linguistic processor
N-gram
trigram
word form
grammatical meaning
capital / small letters
statistical methods
Opis:
Removing grammar ambiguity of word forms by statistical methodsResearch is devoted to the study of behaviour of linguistic processor at simultaneous application of software supporting functions (taking into account the characteristics of the writing word forms (capital / small letters), punctuation marks in trigrams and location of trigrams within a sentence). The article analyses qualitative quantitative characteristics of the results removing grammatical homonyms of word forms using statistical methods in compliance with requirements. The research is based on the texts of normative legal document.
Źródło:
Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:
Cognitive Studies
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Synthesis, characterization and antimicrobial activity of some new dihydropyrano[c]chromenes
Autorzy:
Bhalu, A.
Moteriya, P.
Chanda, S.
Baluja, S.
Powiązania:
https://bibliotekanauki.pl/articles/412452.pdf
Data publikacji:
2014
Wydawca:
Przedsiębiorstwo Wydawnictw Naukowych Darwin / Scientific Publishing House DARWIN
Tematy:
Dihydropyrano[c]chromenes
antibacterial activity
Gram positive bacteria
Gram negative bacteria
N,N-Dimethylformamide
Opis:
Some new dihydropyrano[c]chromenes derivatives are synthesized from 4-hydroxycoumarin. The structure of newly synthesized compounds was confirmed by mass, 1H NMR and IR spectroscopy. Further, antimicrobial screening of these synthesized compounds was done against some bacterial (Gram positive as well as Gram negative) and fungal strains in N,N-dimethylformamide. It is observed that some of synthesized compounds exhibited significant antibacterial activity against Gram positive bacterial strains. The selected fungal strains were most resistant for the studied compounds as none of the synthesized compounds showed activity against any of the fungal strain studied. The best antibacterial activity was shown by ABR-10.
Źródło:
International Letters of Chemistry, Physics and Astronomy; 2014, 12; 1-6
2299-3843
Pojawia się w:
International Letters of Chemistry, Physics and Astronomy
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Synthesis, characterization and antimicrobial activity of pyrazolo chalcone compounds
Autorzy:
Hirapara, Asmita V.
Baluja, Shipra H.
Powiązania:
https://bibliotekanauki.pl/articles/1076569.pdf
Data publikacji:
2019
Wydawca:
Przedsiębiorstwo Wydawnictw Naukowych Darwin / Scientific Publishing House DARWIN
Tematy:
Dimethyl sulphoxide
Fungai
Gram negative bacteria
Gram positive bacteria
N
N-dimethyl formamide
Pyrazolo chalcone compounds
Opis:
Some new pyrazolo chalcone compounds were synthesized and their structure characterization was done by spectroscopic techniques such as FT-IR, 1H NMR and mass. Screenings of all these synthesized compounds were done in vitro against some bacterial and fungal strains in dimethyl sulphoxide and N,N-dimethyl formamide using agar well diffusion method. It is observed that N,N-dimethyl formamide is good solvent for these compounds in selected strains.
Źródło:
World Scientific News; 2019, 115; 242-259
2392-2192
Pojawia się w:
World Scientific News
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Antimicrobial activity of some novel triazolo quinoline derivatives
Autorzy:
Hirapara, Asmita V.
Baluja, Shipra H.
Powiązania:
https://bibliotekanauki.pl/articles/1076047.pdf
Data publikacji:
2019
Wydawca:
Przedsiębiorstwo Wydawnictw Naukowych Darwin / Scientific Publishing House DARWIN
Tematy:
Dimethyl sulphoxide
Fungal strains
Gram negative bacteria
Gram positive bacteria
N
N-dimethyl formamide
Triazolo quinoline derivatives
Opis:
Some triazolo quinoline derivatives were synthesized and their structures were confirmed by IR, 1H NMR, 13C NMR and mass spectroscopy. Screening of all these synthesized compounds were done in vitro against bacteria and three fungal strains in dimethyl sulphoxide and N, N-dimethyl formamide. It is observed that N, N-dimethyl formamide is good solvent for these compounds in selected strains.
Źródło:
World Scientific News; 2019, 117; 102-121
2392-2192
Pojawia się w:
World Scientific News
Dostawca treści:
Biblioteka Nauki
Artykuł
    Wyświetlanie 1-8 z 8

    Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies