Temat: text clustering - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: The system developing of forming research schools basis of publication elements analysis
Autorzy:: Shakhovska, N.
Noha, R
Powiązania:: https://bibliotekanauki.pl/articles/117910.pdf
Data publikacji:: 2014
Wydawca:: Polskie Towarzystwo Promocji Wiedzy
Tematy:: research school
clustering
text mining
Opis:: In this paper the method of research publications elements analysis that is determining common qualities of research publications and their clustering as an instrument of selecting and sorting out the information about research schools has been introduced. In module structuring documents transmitted there are tape that indicates the address of the file. Depending on where the file is, it can be a path to a file on the local disk or URL on the Internet.
Źródło:: Applied Computer Science; 2014, 10, 2; 57-66
1895-3735
Pojawia się w:: Applied Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Text mining in practice: exploring patterns in text collections of remote work job offers
Autorzy:: Kuligowska, Karolina
Lasek, Mirosława
Powiązania:: https://bibliotekanauki.pl/articles/431872.pdf
Data publikacji:: 2013
Wydawca:: Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu
Tematy:: text mining
text analytics
clustering
concept linking
remote work
telecommuting
Opis:: The aim of this paper is to give an insight into text mining techniques in the context of unstructured text collections of location independent job offers. In order to extract useful information, uncover interesting patterns and features of remote work, we analyze the five most popular and most visited websites containing job offers. We examine clusters of remote job offers, the keywords describing those clusters, as well as the linkages between strongly associated terms describing mobile work offers. It is interesting to observe the maturity of the text mining tools which have broadened their applications to new research topics and have become suitable to explore new phenomena.
Źródło:: Informatyka Ekonomiczna; 2013, 4(30); 181-195
1507-3858
Pojawia się w:: Informatyka Ekonomiczna
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Semantic web techniques for clinical topic detection in health care
Autorzy:: Raman, R.
Sahayaraj, Kishore Anthuvan
Soni, Mukesh
Nayak, Nihar Ranjan
Govindarajan, Ramya
Singh, Nikhil Kumar
Powiązania:: https://bibliotekanauki.pl/articles/38698068.pdf
Data publikacji:: 2024
Wydawca:: Instytut Podstawowych Problemów Techniki PAN
Tematy:: clinical text
frequent word set
feature selection
clustering
topic detection
time sequence
semantics
tekst kliniczny
częsty zestaw słów
wybór funkcji
grupowanie
wykrywanie tematu
sekwencja czasu
semantyka
Opis:: The scope of this paper is that it investigates and proposes a new clustering method thattakes into account the timing characteristics of frequently used feature words and thesemantic similarity of microblog short texts as well as designing and implementing mi-croblog topic detection and detection based on clustering results. The aim of the proposedresearch is to provide a new cluster overlap reduction method based on the divisions ofsemantic memberships to solve limited semantic expression and diversify short microblogcontents. First, by defining the time-series frequent word set of the microblog text, a fea-ture word selection method for hot topics is given; then, for the existence of initial clusters,according to the time-series recurring feature word set, to obtain the initial clustering ofthe microblog.
Źródło:: Computer Assisted Methods in Engineering and Science; 2024, 31, 2; 139-155
2299-3649
Pojawia się w:: Computer Assisted Methods in Engineering and Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Geodesic distances for clustering linked text data
Autorzy:: Tekir, S.
Mansmann, F.
Keimer, D.
Powiązania:: https://bibliotekanauki.pl/articles/91737.pdf
Data publikacji:: 2012
Wydawca:: Społeczna Akademia Nauk w Łodzi. Polskie Towarzystwo Sieci Neuronowych
Tematy:: clustering
geodesic distance
text data
k-means algorithm
cosine distance
k-harmonic means
microprecision values
Opis:: The quality of a clustering not only depends on the chosen algorithm and its parameters, but also on the definition of the similarity of two respective objects in a dataset. Applications such as clustering of web documents is traditionally built either on textual similarity measures or on link information. Due to the incompatibility of these two information spaces, combining these two information sources in one distance measure is a challenging issue. In this paper, we thus propose a geodesic distance function that combines traditional similarity measures with link information. In particular, we test the effectiveness of geodesic distances as similarity measures under the space assumption of spherical geometry in a 0-sphere. Our proposed distance measure is thus a combination of the cosine distance of the term-document matrix and some curvature values in the geodesic distance formula. To estimate these curvature values, we calculate clustering coefficient values for every document from the link graph of the data set and increase their distinctiveness by means of a heuristic as these clustering coefficient values are rough estimates of the curvatures. To evaluate our work, we perform clustering tests with the k-means algorithm on a subset of the EnglishWikipedia hyperlinked data set with both traditional cosine distance and our proposed geodesic distance. Additionally, taking inspiration from the unified view of the performance functions of k-means and k-harmonic means, min and harmonic average of the cosine and geodesic distances are taken in order to construct alternate distance forms. The effectiveness of our approach is measured by computing microprecision values of the clusters based on the provided categorical information of each article.
Źródło:: Journal of Artificial Intelligence and Soft Computing Research; 2012, 2, 3; 247-258
2083-2567
2449-6499
Pojawia się w:: Journal of Artificial Intelligence and Soft Computing Research
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Document Clustering : Concepts, Metrics and Algorithms
Autorzy:: Tarczynski, T.
Powiązania:: https://bibliotekanauki.pl/articles/226231.pdf
Data publikacji:: 2011
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: document clustering
text mining
k-means
hierarchical clustersting
vector space model
Opis:: Document clustering, which is also refered to as text clustering, is a technique of unsupervised document organisation. Text clustering is used to group documents into subsets that consist of texts that are similar to each orher. These subsets are called clusters. Document clustering algorithms are widely used in web searching engines to produce results relevant to a query. An example of practical use of those techniques are Yahoo! hierarchies of documents [1]. Another application of document clustering is browsing which is defined as searching session without well specific goal. The browsing techniques heavily relies on document clustering. In this article we examine the most important concepts related to document clustering. Besides the algorithms we present comprehensive discussion about representation of documents, calculation of similarity between documents and evaluation of clusters quality.
Źródło:: International Journal of Electronics and Telecommunications; 2011, 57, 3; 271-277
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Cluster analysis of medical text documents by using semi-clustering approach based on graph representation
Autorzy:: Woźniak, R.
Ożdżyński, P.
Zakrzewska, D.
Powiązania:: https://bibliotekanauki.pl/articles/94773.pdf
Data publikacji:: 2018
Wydawca:: Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Wydawnictwo Szkoły Głównej Gospodarstwa Wiejskiego w Warszawie
Tematy:: cluster analysis
semi-clustering
text mining
Opis:: The development of Internet resulted in an increasing number of online text repositories. In many cases, documents are assigned to more than one class and automatic multi-label classification needs to be used. When the number of labels exceeds the number of the documents, effective label space dimension reduction may significantly improve classification accuracy, what is a major priority in the medical field. In the paper, we propose document clustering for label selection. We use semiclustering method, by considering graph representation, where documents are represented by vertices and edge weights are calculated according to their mutual similarity. Assigning documents to semi-clusters helps in reducing number of labels, further used in multi-label classification process. The performance of the method is examined by experiments conducted on real medical datasets.
Źródło:: Information Systems in Management; 2018, 7, 3; 213-224
2084-5537
2544-1728
Pojawia się w:: Information Systems in Management
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Analysis of methods and means of text mining
Autorzy:: Rybchak, Z.
Basystiuk, O.
Powiązania:: https://bibliotekanauki.pl/articles/411072.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:: text mining
text analytics
data analysis
high-quality information
text categorization
text clustering
document summarization
sentiment analysis
sieć językowa
analiza tekstu
analiza danych
wysoka jakość informacji
klasyfikacja tekstowa
kategoryzacja tekstowa
grupowanie tekstu
streszczenie dokumentów tekstowych
technika sentiment analysis
Opis:: In Big Data era when data volume doubled every year analyzing of all this data become really complicated task, so in this case text mining systems, techniques and tools become main instrument of analyzing tones and tones of information, selecting that information that suit the best for your needs and just help save your time for more interesting thing. The main aims of this article are explain basic principles of this field and overview some interesting technologies that nowadays are widely used in text mining.
Źródło:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2017, 6, 2; 73-78
2084-5715
Pojawia się w:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: A chi assomiglia Elena Ferrante? Un profilo stilometrico aggiornato
Who Does Elena Ferrante Look Like? A Revised Stylometric Identikit
Autorzy:: Cortelazzo, Michele A.
Tuzzi, Arjuna
Powiązania:: https://bibliotekanauki.pl/articles/446396.pdf
Data publikacji:: 2020-07-31
Wydawca:: Wydawnictwo Adam Marszałek
Tematy:: Elena Ferrante
contemporary Italian literature
authorship attribution
similarity measure
text clustering
letteratura italiana contemporanea
attribuzione d’autore
misure di
similarità
classificazione dei testi
Opis:: Based on a corpus including 150 novels by 40 authors, a stylometric survey was conducted to assess which modern authors were similar to Elena Ferrante, the pen name used for eight novels, including "My Brilliant Friend" (Tuzzi & Cortelazzo 2018a and 2018b). The survey proved that Elena Ferrante’s writing style is remarkably different from that of the other main contemporary Italian novelists with the notable exception of Domenico Starnone. Follow-up studies (Cortelazzo, Mikros & Tuzzi 2018 and another under way) show that non-fiction works signed by Elena Ferrante may be attributed to different authors, i.e., Anita Raja, Starnone again, and a collective author including the staff of the E/O publishing house. This study complements the results obtained by previous research by assessing Elena Ferrante’s role in modern Italian fiction following the publication of her latest novel, "The Lying Life of Adults". In addition, the analysis of her similarities to Domenico Starnone was enhanced by means of a larger corpus of his novels, thus corroborating the outcome of previous research.
Sulla base di un corpus di 150 romanzi di 40 autori, abbiamo condotto un’indagine stilometrica, per valutare quali autori contemporanei risultassero più vicini a Elena Ferrante, pseudonimo con cui sono stati firmati otto romanzi, tra cui "l’Amica geniale" (Tuzzi & Cortelazzo 2018a e 2018b). L’indagine ha dimostrato che lo stile di scrittura di Elena Ferrante è notevolmente diverso da quello degli altri principali romanzieri italiani contemporanei, con la sola eccezione di Domenico Starnone. Studi successivi (Cortelazzo, Mikros & Tuzzi 2018 e in preparazione) hanno mostrato che, invece, le opere non letterarie firmate da Elena Ferrante possono essere attribuite ad autori diversi, tra i quali Anita Raja, di nuovo Starnone e autore collettivo costituito dallo staff della casa editrice E / O. Questo studio integra i risultati delle ricerche precedenti, aggiornando la collocazione di Elena Ferrante nel panorama contemporaneo, in seguito all’uscita del suo ultimo romanzo ("La vita bugiarda degli adulti") e affinando lo studio delle similarità con Domenico Starnone, utilizzando un corpus più ampio di opere di questo autore. Risultano rafforzati i risultati delle ricerche precedenti.
Źródło:: Italica Wratislaviensia; 2020, 11.1; 123-141
2084-4514
Pojawia się w:: Italica Wratislaviensia
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "text clustering" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język