Temat: cluster. - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Modification of Hinov Method of Variable Selection for Multiple Cluster Structure Analysis
Modyfikacja metody HINoV selekcji zmiennych w analizie wielokrotnych struktur skupień
Autorzy:: Korzeniewski, Jerzy
Powiązania:: https://bibliotekanauki.pl/articles/904539.pdf
Data publikacji:: 2013
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: cluster analysis
variable choice
multiple cluster structures
Opis:: The original HINoV method (Carmone et al., 1999 ) is not robust to the presence of correlated unimodal and uniform variables among noisy variables (e.g. Korzeniewski, 2012). Moreover, HINoV can be applied only to a single cluster structure analysis. In the article, a modification is proposed consisting in grouping all variables (separately for each reference variable) into two classes. One of the classes consists of variables similar to the reference variable, the other consists of variables which are “less similar”. Similarity between two variables is based on the similarity of the data set division into an established number of clusters (from 2 to 10) measured with the modified Rand index. We arrive at a zero-one matrix describing relations between every pair of variables. Then, a set of variables creating the same (the strongest) cluster structure is selected by means of a criterion optimizing the matrix division into four blocks. After completing the first stage selection one can search another cluster structure applying the same procedure to the set of remaining variables. The modification is assessed in a broad experiment based on 2250 data sets generated from the mixtures of normal distribution.
Oryginalna metoda HINoV jest zupełnie nieodporna na występowanie wśród zmiennych zanieczyszczających strukturę skupień zmiennych skorelowanych jednomodalnych lub równomiernych. Ponadto HINoV można stosować tylko w przypadku jednej struktury skupień.W referacie zaproponowana jest modyfikacja polegająca na tym, by, oddzielnie, dla każdej ustalonej zmiennej, grupować zmienne w dwie klasy zmiennych podobnych i niepodobnych do niej w sensie podobieństwa podziału zbioru danych na daną liczbę skupień (od 2 do 10). Otrzymujemy wówczas macierz zerojedynkową opisującą związki pomiędzy każdą parą zmiennych. Następnie, podzbiór zmiennych tworzących tę samą (najsilniejszą) strukturę skupień wybierany jest za pomocą kryterium optymalizującego podział macierzy na cztery bloki. Po wybraniu zmiennych tworzących jedną strukturę skupień można, w dalszym kroku, wybierać zmienne tworzące następną strukturę skupień spośród zmiennych, które nie zostały wybrane w pierwszym kroku. W celu selekcji właściwego bloku macierzy stosowane jest kryterium stabilności podziału zbioru danych oparte na wielokrotnym losowaniu połowy zbioru i porównywaniu podziałów otrzymanych przy pomocy metody k-średnich. Modyfikacja oceniona jest w obszernym eksperymencie symulacyjnym na 2250 zbiorach danych wygenerowanych w postaci mieszanin rozkładów normalnych.
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2013, 286
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Empirical Evaluation of Oclus and Genrandomclust Algorithms of Generating Cluster Structures
Autorzy:: Korzeniewski, Jerzy
Powiązania:: https://bibliotekanauki.pl/articles/973546.pdf
Data publikacji:: 2014
Wydawca:: Główny Urząd Statystyczny
Tematy:: cluster analysis
cluster structure generation
OCLUS algorithm
genRandomClust algorithm
Opis:: The OCLUS algorithm and genRandomClust algorithm are newest proposals of generating multivariate cluster structures. Both methods have the capacity of controlling cluster overlap, but both do it quite differently. It seems that OCLUS method has much easier, intuitive interpretation. In order to verify this opinion a comparative assessment of both algorithms was carried out. For both methods multiple cluster structures were generated and each of them was grouped into the proper number of clusters using k-means. The groupings were assessed by means of divisions similarity index (modified Rand index) referring to the classification resulting from the generation. The comparison criterion is the behaviour of the overlap parameters of structures. The monotonicity of the overlap parameters with respect to the similarity index is assessed as well as the variability of the similarity index for the fixed value of overlap parameters. Moreover, particular attention is given to checking the existence of an overlap parameter limit for the classical grouping procedures as well as uniform nature of overlap control with respect to all clusters.
Źródło:: Statistics in Transition new series; 2014, 15, 3; 487-494
1234-7655
Pojawia się w:: Statistics in Transition new series
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Ocena jakości wyników grupowania – przegląd bibliografii
Cluster Validity Measurement - a Bibliography Review
Autorzy:: Migdał-Najman, Kamila
Powiązania:: https://bibliotekanauki.pl/articles/1830784.pdf
Data publikacji:: 2011-12-31
Wydawca:: Główny Urząd Statystyczny
Tematy:: analiza skupień
wskaźniki jakości grupowania
cluster analysis
cluster validity indices
Opis:: W artykule dokonano syntetycznego przeglądu literatury tematu począwszy od prac P. Jaccarda z roku 1908 a skończywszy na pracach B. Mirkina z 2011 roku. Dokonano próby klasyfikacji znanych wskaźników jakości grupowania, uwzględniając kryteria pochodzące z różnych dyscyplin naukowych. W szczególności dokonano klasyfikacji wskaźników optymalnej liczby skupień jako podklasy wskaźników jakości grupowania. Wyniki prezentowanych badań powinny być użyteczne dla wszystkich zajmujących się problemami grupowania i klasyfikacji.
In the article are presented the synthetic review of the literature from P. Jaccard in 1908 to B. Mirkin, 2011. In this paper, the concept and classification of cluster validity indices are proposed. There are presented classification of validity indices to find the optimal number of clusters. The results of this study should be useful for all concerned with the problems of classification.
Źródło:: Przegląd Statystyczny; 2011, 58, 3-4; 281-299
0033-2372
Pojawia się w:: Przegląd Statystyczny
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Clustering of European Countries with Respect to Food Consumption
Grupowanie państw europejskich ze względu na spożycie żywności
Autorzy:: Dudek, Hanna
Orłowski, Arkadiusz
Powiązania:: https://bibliotekanauki.pl/articles/905668.pdf
Data publikacji:: 2006
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: food consumption
cluster analysis
Opis:: Problem of clustering of European countries with respect to food consumption is considered. Data related to average yearly per capita consumption of 14 main categories of food products in 39 countries are collected and analysed. Food consumption data for two years: 2000 and 1993 are elaborated. The year 2000 was because there are no more recent data sets available. The year 1993 was chosen as a good reference point: data for that year are the oldest complete. To perform a reasonable grouping of countries the cluster analysis is performed. As a proper number of cluster is not known in advance, hierarchical methods offered by statistical packages Statgraphics are used. The desirable number of clusters is estimated by distance matrices analysis, dendrograms, and graphical representations of distance between clusters with respect to different clustering stages. Squared Euclidean distance is used as a measure of similarity. It is remarkable that all hierarchical methods applied in this paper, apart from nearest neighborhood approach, lead to very similar classification results. Therefore we believe that obtained results provide a valuable and objective insight into the problem of diversification of food consumption in Europe. It has been verified that in spite of visible changes in food consumption in investigated countries, sets of countries belonging to particular clusters obtained for 2000 and for 1993 are almost indistinguishable.
W artykule rozważono zagadnienie pogrupowania państw europejskich ze względu na konsumpcję żywności. Zgromadzono dane o rocznym spożyciu na osobę 14 głównych grup produktów żywnościowych w 39 państwach. Dane dotyczą konsumpcji żywności w latach 2000 oraz 1993. W celu pogrupowania państw wykorzystano analizę skupień. Z uwagi na brak przesłanek dotyczących liczby skupień zastosowano hierarchiczne metody aglomeracyjne, oprogramowane w pakietach statystycznych Statgraphics. Liczbę skupień ustalono na podstawie analizy macierzy odległości, dendrogramów oraz wykresów odległości skupień względem etapów grupowania. Za miarę podobieństwa przyjęto kwadrat odległości euklidesowej. Ustalono, że poza metodą najbliższego sąsiedztwa, wszystkie hierarchiczne metody aglomeracyjne prowadzą do skupień o zbliżonym zestawie państw. Na podstawie wykonanej analizy skupień stwierdzono, że mimo zmian w spożyciu produktów żywnościowych w poszczególnych krajach, zestawy państw w otrzymanych skupieniach w roku 2000 i 1993 były niemal identyczne.
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2006, 196
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Application of Density Based Clustering to Microarray Data Analysis
Autorzy:: Raczynski, L.
Wozniak, K.
Rubel, T.
Zaremba, K.
Powiązania:: https://bibliotekanauki.pl/articles/226804.pdf
Data publikacji:: 2010
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: microarrays
cluster analysis
DBSCAN
Opis:: In just a few years, gene expression microarrays have rapidly become a standard experimental tool in the biological and medical research. Microarray experiments are being increasingly carried out to address the wide range of problems, including the cluster analysis. The estimation of the number of clusters in datasets is one of the main problems of clustering microarrays. As a supplement to the existing methods we suggest the use of a density based clustering technique DBSCAN that automatically defines the number of clusters. The DBSCAN and other existing methods were compared using the microarray data from two datasets used for diagnosis of leukemia and lung cancer.
Źródło:: International Journal of Electronics and Telecommunications; 2010, 56, 3; 281-286
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Statistical methods and algorithms for spatio-temporal cluster analysis
Autorzy:: Abramovich, M.
Mitskevich, M.
Powiązania:: https://bibliotekanauki.pl/articles/92828.pdf
Data publikacji:: 2017
Wydawca:: Uniwersytet Przyrodniczo-Humanistyczny w Siedlcach
Tematy:: cluster analysis
spatiotemporal
scan statistic
flexible
robust
algorithm
cluster construction
thyroid carcinoma
detection
Opis:: The global clusterization test and scan statistic method for studying geographical distribution of the objects are considered. The algorithm of windows set construction for the flexible spatial was developed. The robust version of spatial scan statistic method is proposed. The children carcinoma of the Belarus was analyzed using scan statistic method.
Źródło:: Studia Informatica : systems and information technology; 2017, 1-2(21); 5-14
1731-2264
Pojawia się w:: Studia Informatica : systems and information technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Bayesian discrimination method for special covariance structure
Autorzy:: Pasewicz, Wiesław
Powiązania:: https://bibliotekanauki.pl/articles/748617.pdf
Data publikacji:: 1985
Wydawca:: Polskie Towarzystwo Matematyczne
Tematy:: Classification and discrimination
cluster analysis
Opis:: .
Let us assume that the observed random vector from population has a p-dimensional normal distribution with a mean vector and a positive definite covariance matrix. A multivariate observation is known and it belongs to one of two multivariate normal populations but it is not known to which. Let E be the pxp matrix with each element eąual to unity and let I be the p x p identity matrix. In the paper we consider a Bayesian discrimination between s.
Źródło:: Mathematica Applicanda; 1985, 13, 26
1730-2668
2299-4009
Pojawia się w:: Mathematica Applicanda
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: The Use of Cluster Analysis and the Theory of Mathematical Records in the Process of Planning the Production-Warehouse Flow
Autorzy:: Bujak, A.
Topolska, K.
Topolski, M.
Powiązania:: https://bibliotekanauki.pl/articles/409409.pdf
Data publikacji:: 2016
Wydawca:: Politechnika Poznańska. Wydawnictwo Politechniki Poznańskiej
Tematy:: cluster analysis
warehousing
logistics flows
Opis:: The paper proposes a new approach to the agglomeration of data in cluster analysis. These approach faces the problem of data classification where under the same conditions different conclusions are draw. Such problems occur in many areas of daily life: medicine, technology, and economic and social sciences and economics. The new approach assumes that events like the harvest are attributed to the cumulative probability of their occurrence at the same time. Such approaches will not be found in probability. Thanks to the mathematical theory of records fairly accurate classification of the object can be provided. The paper presents a method of agglomeration of measurement data using the mathematical theory of records. This is the method which can be used in the cluster analysis by agglomeration.
Źródło:: Research in Logistics & Production; 2016, 6, 3; 259-267
2083-4942
2083-4950
Pojawia się w:: Research in Logistics & Production
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: An entropy based non-wrapper approach for choosing variables in cluster analysis
Autorzy:: Korzeniewski, Jerzy
Powiązania:: https://bibliotekanauki.pl/articles/657955.pdf
Data publikacji:: 2011
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: cluster analysis
entropy
variable choice
Opis:: W artykule badamy sprawność algorytmu wybierania zmiennych w analizie skupień opartego na entropii (por. Dash, Liu, 2000). Ocena oparta jest na eksperymencie, w którym zbiory generowane są w postaci mieszanin rozkładów normalnych. Wyniki wskazują na to. że metoda nie radzi sobie tak dobrze jak to sugerowali Autorzy.
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2011, 255
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Cluster analysis with clusterSim computer program and R environment
Zagadnienia analizy skupień z wykorzystaniem programu komputerowego clusterSim i środowiska R
Autorzy:: Walesiak, Marek
Powiązania:: https://bibliotekanauki.pl/articles/907036.pdf
Data publikacji:: 2008
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: cluster analysis
R
clusterSim
data analysis
Opis:: The article presents auxiliary functions of clusterSim package (see Walesiak & Dudek (2006)) and selected functions of packages stats, cluster, and ade4, which are applied to solving clustering problems. In addition, the examples of the procedures for solving different clustering problems are presented. These procedures, which are not available in statistical packages (SPSS, Statistica, SAS), can help solving a broad range of classification problems.
W artykule scharakteryzowano funkcje pomocnicze pakietu clusterSim oraz wybrane funkcje pakietów stats, cluster i ade4 służące zagadnieniu analizy skupień. Ponadto zaprezentowano przykładowe procedury, wykorzystujące analizowane funkcje, ułatwiające potencjalnemu użytkownikowi realizację wielu zagadnień klasyfikacyjnych niedostępnych w podstawowych pakietach statystycznych (np. SPSS, Statistica, SAS).
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2008, 216
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: Cluster Analysis as a Tool for Strategic Analysis at the State Level
Autorzy:: Dawidczyk, Andrzej
Powiązania:: https://bibliotekanauki.pl/articles/1375052.pdf
Data publikacji:: 2020
Wydawca:: Wyższa Szkoła Policji w Szczytnie
Tematy:: security methodology
strategic analysis
cluster analysis
Opis:: The article presents the evolution of the idea of approach to conducting strategic analysis of the state’s environment, used to assess its functioning in the existing and forecasted conditions, which can also be used in the analysis conducted at lower levels of administration. The author presents one of the methods of strategic analysis used in the process of strategic planning in the field of national security. This method can also be successfully used when designing a defence strategy or police development strategy, as it precedes the process of formulating objectives and action concepts. According to the author, the advantages of the proposed method called cluster analysis are its universality, transparency and simplicity of application. The proposed method was preceded by reflections on the issues of strategic analysis related to the current practice of strategic planning in Poland, as well as similar to the presented taxonomic method, to finally focus the reader’s attention on cluster analysis, a mathematical method of data grouping, which can be a proposal to conduct strategic analysis at the level of the state and its institutions.
Źródło:: Internal Security; 2020, 12(1); 97-111
2080-5268
Pojawia się w:: Internal Security
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: Cluster analysis of medical text documents by using semi-clustering approach based on graph representation
Autorzy:: Woźniak, R.
Ożdżyński, P.
Zakrzewska, D.
Powiązania:: https://bibliotekanauki.pl/articles/94773.pdf
Data publikacji:: 2018
Wydawca:: Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Wydawnictwo Szkoły Głównej Gospodarstwa Wiejskiego w Warszawie
Tematy:: cluster analysis
semi-clustering
text mining
Opis:: The development of Internet resulted in an increasing number of online text repositories. In many cases, documents are assigned to more than one class and automatic multi-label classification needs to be used. When the number of labels exceeds the number of the documents, effective label space dimension reduction may significantly improve classification accuracy, what is a major priority in the medical field. In the paper, we propose document clustering for label selection. We use semiclustering method, by considering graph representation, where documents are represented by vertices and edge weights are calculated according to their mutual similarity. Assigning documents to semi-clusters helps in reducing number of labels, further used in multi-label classification process. The performance of the method is examined by experiments conducted on real medical datasets.
Źródło:: Information Systems in Management; 2018, 7, 3; 213-224
2084-5537
2544-1728
Pojawia się w:: Information Systems in Management
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Genetic diversity of winter wheat cultivars and strains determined by electrophoregrams of gliadin and glutenin proteins
Autorzy:: Węgrzyn, Stanisław
Waga, Jacek
Powiązania:: https://bibliotekanauki.pl/articles/2198933.pdf
Data publikacji:: 2004-06-20
Wydawca:: Instytut Hodowli i Aklimatyzacji Roślin
Tematy:: cluster analysis
electrophoresis
gliadins
glutenins
polymorphism
Opis:: Based on the polymorphism of gliadin and glutenin proteins relationships of 45 cultivars and strains of winter wheat were evaluated. The cluster analysis showed a considerable variation of the investigated genotypes. The similarity indices were calculated using the Nei and Li formula. The genetic distances between the cultivars ranged from 1.00 to 0.12. The highest similarity index - SI=1.00- being proof of the identical physicochemical composition of storage proteins, was found for the pair Farmer and Elena. The groups of similar and genetically distant cultivars have been presented in the form of a dendrogram. The possibility of using the results obtained from the cluster analysis in breeding programmes has been discussed.
Źródło:: Plant Breeding and Seed Science; 2004, 49; 51-61
1429-3862
2083-599X
Pojawia się w:: Plant Breeding and Seed Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: A Proposal of Modification of Agglomerative Clustering Algorithms
Propozycja modyfikacji alorytmów aglomeracynych konstruowania skupień
Autorzy:: Korzeniewski, Jerzy
Powiązania:: https://bibliotekanauki.pl/articles/906270.pdf
Data publikacji:: 2009
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: cluster analysis
agglomerative algorithms
silhouette indices
Opis:: W pracy przedstawiono propozycję modyfikacji dowolnego algorytmu aglomeracyjnego łączenia obserwacji w skupienia. Ideą modyfikacji jest położenie większego nacisku na łączenie skupień w tych obszarach, w których lokalna gęstość rozkładu obserwacji jest większa. Modyfikację zastosowano do czterech klasycznych algorytmów: aglomeracji pojedynczego połączenia, całkowitego połączenia, środka ciężkości i średniej odległości klasowej. Jakość otrzymywanych grupowań była oceniana przy pomocy odsetka obserwacji o ujemnym indeksie sylwetkowym. Wyniki pokazują, że zaproponowane modyfikacje prawie zawsze poprawiają tradycyjne algorytmy.
In the paper, a modification o f agglomerative clustering algorithms is proposed which can be applied to any kind o f agglomeraitve algorithm. The idea o f die modification is to stress the local density o f observations’ distribution, while performing clustering based on the dissimilarity matrix. The following clustering algorithms are examined: single link, complete link, group average link and centroid link. The quality o f clustering is assessed by means o f the silhouette indices on subsets generated with the Milligan’s Clustgen software. The results prove that the Author’s modifications almost always improve the standard methods.
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2009, 228
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 15.

Tytuł:: Przegląd technik grupowania danych i obszary zastosowań
Autorzy:: Sala, Karolina
Powiązania:: https://bibliotekanauki.pl/articles/2157869.pdf
Data publikacji:: 2017
Wydawca:: Instytut Studiów Międzynarodowych i Edukacji Humanum
Tematy:: cluster analysis
hierarchical clustering
k-means
Opis:: The paper presents an overview of various clustering techniques used in data mining. Clustering is an unsupervised learning problem that is used to identify groups in a set of unlabeled data. Data is grouped by probability so that objects of the same group / cluster have similar properties / characteristics [1]. This article aims at exploring and comparing different clustering algorithms. Grouping is used in many areas, including machine learning, pattern recognition, image analysis, information retrieval.
Źródło:: Społeczeństwo i Edukacja. Międzynarodowe Studia Humanistyczne; 2017, 2(25); 141-145
1898-0171
Pojawia się w:: Społeczeństwo i Edukacja. Międzynarodowe Studia Humanistyczne
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "cluster." wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język