Temat: grupowanie danych - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Linguistically defined clustering of data
Autorzy:: Leski, J. M.
Kotas, M. P.
Powiązania:: https://bibliotekanauki.pl/articles/329995.pdf
Data publikacji:: 2018
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: data clustering
possibility theory
linguistic rules
data analysis
grupowanie danych
teoria możliwości
analiza danych
Opis:: This paper introduces a method of data clustering that is based on linguistically specified rules, similar to those applied by a human visually fulfilling a task. The method endeavors to follow these remarkable capabilities of intelligent beings. Even for most complicated data patterns a human is capable of accomplishing the clustering process using relatively simple rules. His/her way of clustering is a sequential search for new structures in the data and new prototypes with the use of the following linguistic rule: search for prototypes in regions of extremely high data densities and immensely far from the previously found ones. Then, after this search has been completed, the respective data have to be assigned to any of the clusters whose nuclei (prototypes) have been found. A human again uses a simple linguistic rule: data from regions with similar densities, which are located exceedingly close to each other, should belong to the same cluster. The goal of this work is to prove experimentally that such simple linguistic rules can result in a clustering method that is competitive with the most effective methods known from the literature on the subject. A linguistic formulation of a validity index for determination of the number of clusters is also presented. Finally, an extensive experimental analysis of benchmark datasets is performed to demonstrate the validity of the clustering approach introduced. Its competitiveness with the state-of-the-art solutions is also shown.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2018, 28, 3; 545-557
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Wyznaczanie wartości granicznych z wykorzystaniem metod grupowania danych
The determination of limit values using methods of data clustering
Autorzy:: Targosz, M.
Timofiejczuk, A.
Powiązania:: https://bibliotekanauki.pl/articles/257429.pdf
Data publikacji:: 2011
Wydawca:: Sieć Badawcza Łukasiewicz - Instytut Technologii Eksploatacji - Państwowy Instytut Badawczy
Tematy:: wartości graniczne
grupowanie danych
modelowanie uszkodzeń
limits values
data clustering
failure modeling
Opis:: W artykule zaproponowano podejście do wyznaczenia wartości granicznych za pomocą algorytmów rozmytego grupowania danych. Wykorzystano algorytmy FCM, PCM oraz algorytm Gustafsona-Kessela. Eksperyment przeprowadzano na danych symulacyjnych. W tym celu zbudowano model numeryczny maszyny wirnikowej, symulującej określone stany i wielkości niewyważenia. Wyznaczone wartości graniczne porównano z wartościami otrzymanymi przy pomocy metody statystycznej. Wszystkie obliczenia wykonywano w środowisku Matlab-Simulink.
The paper describes a methodology for estimating the limit values of char-icteristics of diagnostic signals using methods of fuzzy data clustering (FCM, PCM and Gustafson-Kessel algorithms). The experiment was conducted on simulated data, using a numerical model of a rotor machine, simulating given inbalanced states. Limits were compared with value estimating using the statistical method.
Źródło:: Problemy Eksploatacji; 2011, 2; 213-221
1232-9312
Pojawia się w:: Problemy Eksploatacji
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: A quaternion clustering framework
Autorzy:: Piórek, Michał
Jabłoński, Bartosz
Powiązania:: https://bibliotekanauki.pl/articles/330038.pdf
Data publikacji:: 2020
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: data clustering
quaternions data processing
human gait
data processing
grupowanie danych
chód człowieka
przetwarzanie danych
Opis:: Data clustering is one of the most popular methods of data mining and cluster analysis. The goal of clustering algorithms is to partition a data set into a specific number of clusters for compressing or summarizing original values. There are a variety of clustering algorithms available in the related literature. However, the research on the clustering of data parametrized by unit quaternions, which are commonly used to represent 3D rotations, is limited. In this paper we present a quaternion clustering methodology including an algorithm proposal for quaternion based k-means along with quaternion clustering quality measures provided by an enhancement of known indices and an automated procedure of optimal cluster number selection. The validity of the proposed framework has been tested in experiments performed on generated and real data, including human gait sequences recorded using a motion capture technique.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2020, 30, 1; 133-147
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Unsupervised clustering for fetal state assessment based on selected features of the cardiotocographic signals
Autorzy:: Przybyła, T.
Jeżewski, J.
Roj, D.
Powiązania:: https://bibliotekanauki.pl/articles/333112.pdf
Data publikacji:: 2009
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: klasyfikacja algorytmów bez nadzoru
grupowanie danych
monitoring płodu
kardiotokografia
unsupervised classification
fuzzy clustering
principal component analysis
fetal monitoring
Opis:: In modern obstetrics the cardiotocography is a routine method of fetal condition assessment based mainly on analysis of the fetal heart rate signals. The correct interpretation of recorded traces from a bedside monitor is very difficult even for experienced clinicians. Therefore, computerized fetal monitoring systems are used to yield the quantitative description of the signal. However, the effective techniques enabling automated conclusion generation based on cardiotocograms are still being searched. The paper presents an attempt to diagnose the fetal state basing on seventeen features describing the cardiotocographic records. The proposed method applies the unsupervised classification of signals. During our research we tried to classify the fetal state using the fuzzy c-means (FCM) clustering. We also tested how the efficiency of classification could be influenced by application of principal component analysis (PCA) algorithm. The obtained results showed that unsupervised classification cannot be considered as a support to fetal state assessment.
Źródło:: Journal of Medical Informatics & Technologies; 2009, 13; 157-162
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Estimation of parameters of Gaussian mixture models by a hybrid method combining a self-adaptive differential evolution with the EM algorithm
Estymacja parametrów modeli mieszanin rozkładów normalnych przy pomocy metody hybrydowej łączącej samoadaptacyjną ewolucję różnicową z algorytmem EM
Autorzy:: Kwedlo, W.
Powiązania:: https://bibliotekanauki.pl/articles/88410.pdf
Data publikacji:: 2014
Wydawca:: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Tematy:: mieszaniny rozkładów normalnych
ewolucja różnicowa
algorytm EM
grupowanie danych
Gaussian mixture models
differential evolution
expectation maximization
model-based clustering
Opis:: In the paper the problem of learning of Gaussian mixture models (GMMs) is considered. A new approach based on hybridization of a self-adaptive version of differential evolution (DE) with the classical EM algorithm is described. In this approach, called DEEM, the EM algorithm is run until convergence to fine-tune each solution obtained by the mutation and crossover operators of DE. To avoid the problem with parameter representation and infeasible solutions we use a method in which the covariance matrices are encoded using their Cholesky factorizations. In a simulation study GMMs were used to cluster synthetic datasets differing by a degree of separation between clusters. The results of experiments indicate that DE-EM outperforms the standard multiple restart expectation-maximization algorithm (MREM). For datasets with high number of features it also outperforms the state of-the-art random swap EM (RSEM).
W pracy poruszono problem uczenia modeli mieszanin rozkładów normalnych. Zaproponowano nowe podejście, nazwane DE-EM, oparte na hybrydyzacji samoadaptacyjnego algorytmu ewolucji różnicowej i klasycznego algorytmu EM. W nowej metodzie rozwiązanie otrzymane jako wynik operatorów mutacji i krzyżowania jest poddawane optymalizacji lokalnej, prowadzonej aż do momentu uzyskania zbieżności, przez algorytm EM. Aby uniknąć problemu z reprezentacją macierzy kowariancji i niedopuszczalności rozwiązań użyto metody, w której macierze kowariancji są kodowane przy pomocy dekompozycji Cholesky’ego. W badaniach symulacyjnych modele mieszanin rozkładów normalnych zastosowano do grupowania danych syntetycznych. Wyniki eksperymentów wskazują, że metoda DE-EM osiąga lepsze wyniki niż standardowa technika wielokrotnego startu algorytmu ˙ EM. Dla zbiorów danych z dużą liczbą cech, metoda osiąga lepsze wyniki niż technika losowej wymiany rozwiązań połączona z algorytmem EM.
Źródło:: Advances in Computer Science Research; 2014, 11; 109-123
2300-715X
Pojawia się w:: Advances in Computer Science Research
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Wybrane aspekty aktywności zawodowej kobiet w państwach UE. Analiza dla lat 2006–2014
Participation of women in the labormarket in the years 2006–2014. EU ststes clustering
Autorzy:: Matuszewska-Janica, Aleksandra
Powiązania:: https://bibliotekanauki.pl/articles/658551.pdf
Data publikacji:: 2016
Wydawca:: Uniwersytet Łódzki. Wydawnictwo Uniwersytetu Łódzkiego
Tematy:: rynek pracy
aktywność zawodowa kobiet
grupowanie danych
metoda k-średnich
labour market
female employment activity
data clustering
k-mean method
Opis:: The main aim of the analysis was European Union's states clustering in point of view of female activity on the labour market. The data was obtained from the Eurosat's Labour Force Surveys from years 2006–2014. K-means algorithms was the main analysis method. Obtained results indicate to a wide variety of different forms of women's labour market participation among European Union countries. We can mention three phenomena that have been most highlighted. (1) We found a high percentage of women working in part-time work, especially in the Netherlands. (2) The results show a high proportion of full-time employed women, especially in the EU countries from the former Eastern Bloc. (3) We noted a significant proportion of selfemployed among the employed women in the countries of Southern Europe (Greece, Italy, Portugal, Spain) and in Poland. So, in terms of female economic activity, Poland has a similar pattern as the Southern European countries (also in terms of self-employment).
Jednym z założeń strategii Europa 2020 jest podwyższenie odsetka osób zatrudnionych w wieku 20–64 lata na terenie Unii Europejskiej (UE) do 75%. Realizacja tej strategii ma nastąpić między innymi poprzez zwiększenie aktywności zawodowej kobiet i osób starszych. Powiększenie zasobów siły roboczej ma swoje ekonomiczne uzasadnienie. Związane jest ono przede wszystkim z procesami demograficznymi, a zwłaszcza z powiększaniem się grupy osób w wieku poprodukcyjnym. W różnych państwach UE sytuacja związana z aktywnością zawodową kobiet kształtuje się odmiennie, dlatego też powstało pytanie o podobieństwa i różnice w zaangażowaniu kobiet na rynku pracy w poszczególnych regionach UE. Celem analizy jest grupowanie państw Unii Europejskiej ze względu na różne formy aktywności zawodowej kobiet. Klasyfikacja ta pozwoli na ocenę wzorców dotyczących aktywności zawodowej kobiet w poszczególnych krajach UE. W analizie wykorzystano metodę k-średnich. Dane pochodzą z zasobów Eurostatu z Badania Aktywności Ekonomicznej Ludności. Analizowany okres to lata 2006–2014.
Źródło:: Acta Universitatis Lodziensis. Folia Oeconomica; 2016, 5, 325
0208-6018
2353-7663
Pojawia się w:: Acta Universitatis Lodziensis. Folia Oeconomica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Selection of clustering methods for wind turbines operational data
Dobór metod grupowania danych procesowych dla turbin wiatrowych
Autorzy:: Gibiec, M.
Barszcz, T.
Bielecka, M.
Powiązania:: https://bibliotekanauki.pl/articles/327686.pdf
Data publikacji:: 2010
Wydawca:: Polska Akademia Nauk. Polskie Towarzystwo Diagnostyki Technicznej PAN
Tematy:: diagnostyka maszyn
turbina wiatrowa
eksploracja danych
grupowanie
machine diagnostics
wind turbine
data mining
clustering
Opis:: Quickly growing number of monitored wind turbines has changed the needs for monitoring and diagnostic algorithms. The data from hundreds of monitoring systems are transferred to the diagnostic centers, where the data should be analyzed. High cost of labor created the need for automated diagnostic methods. The first task in this wide discipline is classification of the data and detection of malfunction states. The paper investigates application of data mining methods for classification of operational data from wind turbines. It is shown, that combination of the agglomeration method with the C-means clustering yields very good results and can be used for automated diagnostics of wind farms.
Szybko rosnąca liczba monitorowanych turbin wiatrowych zmieniła potrzeby w zakresie algorytmów monitorowania diagnostyki. Obecnie dane z setek systemów monitorowania przesyłane są do centrów diagnostycznych, gdzie muszą zostać przeanalizowane. Wysokie koszty pracy ekspertów spowodowały potrzebę zautomatyzowania metod diagnostycznych. Pierwszym zadaniem stała się automatyczna klasyfikacja danych i wykrywanie stanów niesprawności. Artykuł przedstawia zastosowanie metod "data mining" do klasyfikacji danych procesowych z turbin wiatrowych. Pokazano, że połączenie metody aglomeracji danych z metodą K-means daje bardzo dobre wyniki i może być zastosowane do zautomatyzowanej diagnostyki farm wiatrowych.
Źródło:: Diagnostyka; 2010, 4(56); 37-42
1641-6414
2449-5220
Pojawia się w:: Diagnostyka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Inefficiency of data mining algorithms and its architecture: with emphasis to the shortcoming of data mining algorithms on the output of the researches
Autorzy:: Tesema, Workineh
Powiązania:: https://bibliotekanauki.pl/articles/118221.pdf
Data publikacji:: 2019
Wydawca:: Polskie Towarzystwo Promocji Wiedzy
Tematy:: data mining
classification
clustering
association
regression
algorithms bottleneck
pozyskiwanie danych
klasyfikacja
grupowanie
asocjacja
regresja
wąskie gardło algorytmów
Opis:: This review paper presents a shortcoming associated to data mining algorithm(s) classification, clustering, association and regression which are highly used as a tool in different research communities. Data mining researches has successfully handling large amounts of dataset to solve the problems. An increase in data sizes was brought a bottleneck on algorithms to retrieve hidden knowledge from a large volume of datasets. On the other hand, data mining algorithm(s) has been unable to analysis the same rate of growth. Data mining algorithm(s) must be efficient and visual architecture in order to effectively extract information from huge amounts of data in many data repositories or in dynamic data streams. Data visualization researchers believe in the importance of giving users an overview and insight into the data distributions. The combination of the graphical interface is permit to navigate through the complexity of statistical and data mining techniques to create powerful models. Therefore, there is an increasing need to understand the bottlenecks associated with the data mining algorithms in modern architectures and research community. This review paper basically to guide and help the researchers specifically to identify the shortcoming of data mining techniques with domain area in solving a certain problems they will explore. It also shows the research areas particularly a multimedia (where data can be sequential, audio signal, video signal, spatio-temporal, temporal, time series etc) in which data mining algorithms not yet used.
Źródło:: Applied Computer Science; 2019, 15, 3; 73-86
1895-3735
Pojawia się w:: Applied Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: A complete gradient clustering algorithm formed with kernel estimators
Autorzy:: Kulczycki, P.
Charytanowicz, M.
Powiązania:: https://bibliotekanauki.pl/articles/907781.pdf
Data publikacji:: 2010
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: analiza danych
eksploracja danych
grupowanie
metoda statystyczna
estymacja jądrowa
obliczenia numeryczne
data analysis
data mining
clustering
gradient procedures
nonparametric statistical methods
kernel estimators
numerical calculations
Opis:: The aim of this paper is to provide a gradient clustering algorithm in its complete form, suitable for direct use without requiring a deeper statistical knowledge. The values of all parameters are effectively calculated using optimizing procedures. Moreover, an illustrative analysis of the meaning of particular parameters is shown, followed by the effects resulting from possible modifications with respect to their primarily assigned optimal values. The proposed algorithm does not demand strict assumptions regarding the desired number of clusters, which allows the obtained number to be better suited to a real data structure. Moreover, a feature specific to it is the possibility to influence the proportion between the number of clusters in areas where data elements are dense as opposed to their sparse regions. Finally, the algorithm-by the detection of one-element clusters-allows identifying atypical elements, which enables their elimination or possible designation to bigger clusters, thus increasing the homogeneity of the data set.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2010, 20, 1; 123-134
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Zone estimation with cluster analysis of public transport stops
Ocena strefowa z analizą skupień przystanków transportu publicznego
Autorzy:: Horváth, B.
Nagy, V.
Powiązania:: https://bibliotekanauki.pl/articles/192544.pdf
Data publikacji:: 2016
Wydawca:: Stowarzyszenie Inżynierów i Techników Komunikacji Rzeczpospolitej Polskiej
Tematy:: public transport
big data
time series
similarity matrix
clustering
transport publiczny
duże zbiory danych
szeregi czasowe
podobieństwa macierzy
grupowanie
Opis:: Nowadays the data sets are spreading continually, generated by different devices and systems. The public transport is also not an exception in this. The modern GPS based tracking systems and the electronic tickets are producing lot of data, and we could use them for improving the service level. In the right case, these data are storing, and the service suppliers are not dealing with its information content, but, on the other hand, maybe they are just deleting these, in the interest of the avoidance of digital space occupancy. However, these data are processable with the modern devices and methods, and we can use them for information obtaining. Thanks to the spread of data mining, these tools are not appearing only in marketing research but in the most various kind of scientific area too, and they are advertising a new scientific revolution. Although the importance of these data sources is essential it is not widespread in general in transport planning, only in some specific areas [1] as described by Csiszár et al. This article presents possible application of the digital raw materials, taking the public transport passengers boarding information as base. We created a three step method which could be useful in automatic zone shaping or to supervise the manually created zone borders. It is also able to give help in land-usage examinations. The procedure is effective in making traveling chains from smart card data and in creation of origin destination matrix from check-in data. In this article we are showing how the zone distribution is possible with the assistance of different distance measurement methods and clustering procedures, and we are presenting the results on the example of a selected city.
Obecnie nieustannie powstają różne zbiory danych, generowane przez różne urządzenia i systemy. Transport publiczny nie jest w tym zakresie wyjątkiem. Nowoczesne systemy monitorowania oparte na GPS i bilety elektroniczne wytwarzają duże ilości danych, i moglibyśmy je wykorzystać dla poprawy poziomu usług. Z jednej strony dane te są przechowywane, a dostawcy usług nie mają do czynienia z zawartością informacji, ale z drugiej strony, być może są one po prostu usuwane, aby ograniczyć obciążanie cyfrowej przestrzeni. Dane te mogą być przetwarzane dzięki nowoczesnym urządzeniom i metodom, i możemy je wykorzystać do uzyskania informacji. Dzięki rozprzestrzenianiu eksploracji danych, narzędzia te pojawiają się nie tylko w badaniach marketingowych, ale w większości różnych działań badawczych, i reklamują one nową rewolucję naukową. Chociaż znaczenie tych źródeł danych jest zasadnicze, nie jest to rozpowszechnione w planowaniu transportu, a jedynie w pewnych określonych obszarach [1], tak jak pisze Csiszár i in. W artykule przedstawiono możliwości zastosowania materiałów nieprzetworzonych, biorąc jako podstawę informacje o pasażerach podróżujących transportem publicznym. Stworzono metodę trzech kroków, która może być przydatna do automatycznego kształtowania strefy lub do nadzorowania granic stref utworzonych konwencjonalnie. Może też przydać się przy kontroli gruntów użytkowych. Procedura ta jest skuteczna w tworzeniu łańcuchów podróży z danych z kart inteligentnych oraz w tworzeniu macierzy żródło-cel na podstawie danych zameldowania. W artykule pokazano, w jaki sposób możliwa jest dystrybucja stref, za pomocą różnych metod pomiaru odległości i procedur gromadzenia, i zaprezentowano tego efekty na przykładzie wybranego miasta.
Źródło:: Transport Miejski i Regionalny; 2016, 7; 9-12
1732-5153
Pojawia się w:: Transport Miejski i Regionalny
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: Visualization of medical rule-based knowledge bases
Autorzy:: Nowak-Brzezińska, A.
Rybotycki, T.
Powiązania:: https://bibliotekanauki.pl/articles/333277.pdf
Data publikacji:: 2015
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: data mining
medical knowledge bases
cluster visualization
hierarchical clustering
treemaps
eksploracja danych
medyczne bazy wiedzy
wizualizacja skupień
grupowanie hierarchiczne
diagramy
Opis:: In this work the topic of applying clustering as a knowledge extraction method from real-world data is discussed. The authors propose hierarchical clustering method and visualization technique for knowledge base representation in the context of medical knowledge bases for which data mining techniques are successfully employed and may resolve different problems. What is more, the authors analyze the impact of different clustering parameters on the result of searching through such a structure. Particular attention was also given to the problem of cluster visualization. Authors review selected, two-dimensional approaches, stating their advantages and drawbacks in the context of representing complex cluster structures.
Źródło:: Journal of Medical Informatics & Technologies; 2015, 24; 91-98
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: PaX-DBSCAN: a proposed algorithm for improved clustering
PaX-DBSCAN: propozycja algorytmu dla doskonalonego grupowania
Autorzy:: Samson, Grace L.
Lu, Joan
Powiązania:: https://bibliotekanauki.pl/articles/592926.pdf
Data publikacji:: 2016
Wydawca:: Uniwersytet Ekonomiczny w Katowicach
Tematy:: Bulk-loading
Clustering
Parallel computing
Partition
Spatial database
Spatial index
X-tree
Algorytm bulk loading
Grupowanie
Indeks przestrzenny
Przestrzenne bazy danych
Przetwarzanie równoległe
Rozdzielanie
Struktura drzewiasta X-tree
Opis:: We focused on applying parallel computing technique to the bulk loading of X-tree in other to improve the performance of DBSCAN clustering algorithm. We have given a full description of how the system can be archived. We proposed a new parallel algorithm for DBSCAN and another algorithm to extend the X-tree spatial indexing structure. Spatial database systems incorporate space in database systems, they support nontraditional data types and more complex queries, therefore in order to optimise such systems for efficient information processing and retrieval, appropriate techniques must be adopted to facilitate the construction of suitable index structures.
W artykule autorzy skupiają swoją uwagę na zastosowaniu techniki przetwarzania równoległego przy wykorzystaniu struktur drzewiastych X-tree i algorytmu bulk loading. Zaproponowano nowy algorytm przetwarzania równoległego DBSCAN i drugi algorytm dla rozszerzania struktur indeksowania przestrzennego. Algorytm grupowania DBSCAN jest efektywnym algorytmem grupowania dla Systemów Przestrzennych Baz Danych, który ma możliwość wykrywania zakłóceń i nie wymaga znacznej liczby skupień wcześniej ustalonych, jednakże działanie algorytmu zmienia się, gdy rozmiar danych jest duży. Ten algorytm może nie działać optymalnie, jeśli niewłaściwe wartości są wybrane dla minpts i eps. Dlatego nowy zaproponowany algorytm powinien eliminować te ograniczenia.
Źródło:: Studia Ekonomiczne; 2016, 296; 86-121
2083-8611
Pojawia się w:: Studia Ekonomiczne
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Wykorzystanie gradacyjnej analizy danych do klasyfikacji podregionów pod względem struktury agrarnej
Use gradation data analysis to the classification of sub-regions in terms of the agrarian structure
Использование градационного анализа данных для классификации субрегионов в отношении к аграрной структуре
Autorzy:: Koszela, Grzegorz
Powiązania:: https://bibliotekanauki.pl/articles/543759.pdf
Data publikacji:: 2016-06
Wydawca:: Główny Urząd Statystyczny
Tematy:: podobieństwo struktur
mapa nadreprezentacji
grupowanie obiektów
gradacyjna analiza danych
similarity of structures
map the over-representation
grouping objects
gradation data analysis
сходство структур
карта перепредставленности
группировка объектов
градационный анализ данных
Opis:: W artykule podjęto problem dotyczący pomiaru niepodobieństwa struktur. Okazało się, że zaproponowana w nim miara, zbudowana na zasadzie analogii ze współczynnikiem Giniego, pozwala wychwycić subtelności, na które nie są czułe powszechnie stosowane w literaturze miary oparte na metrykach. W opracowaniu przedstawiono sposób wizualizacji struktur przy wykorzystaniu map nadreprezentacji oraz sposoby grupowania obiektów za pomocą gradacyjnej analizy danych. Otrzymane wyniki grupowania przedstawiono na tle wyników uzyskanych innymi metodami.
The article deals with the problem of measuring the dissimilarity structures. It turned out that the proposed measure, built by analogy with the Gini coefficient, can capture the subtleties, to which are not sensitive measure based on the metrics, commonly used in the literature. This paper presents a way to visualize structures using maps over-representation and ways of grouping objects by gradation data analysis. The groupings results are shown in comparison with the results obtained by other methods.
В статье была предпринята проблема измерения несходства структур. Оказалось, что предлагаемая мера, разработанная по принципу аналогии с коэффициентом Джини, позволяет обнаружить тонкости, на которые не чувствительны обычно используемые меры основанные на метриках. В разработке был представлен способ визуализации структур с использованием карт перепредставленности, а также способы группировки объектов с помощью градационного анализа данных. Полученные результаты группировки представляются на фоне результатов полученных другими методами.
Źródło:: Wiadomości Statystyczne. The Polish Statistician; 2016, 6; 10-30
0043-518X
Pojawia się w:: Wiadomości Statystyczne. The Polish Statistician
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: Analysis of methods and means of text mining
Autorzy:: Rybchak, Z.
Basystiuk, O.
Powiązania:: https://bibliotekanauki.pl/articles/411072.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Oddział w Lublinie PAN
Tematy:: text mining
text analytics
data analysis
high-quality information
text categorization
text clustering
document summarization
sentiment analysis
sieć językowa
analiza tekstu
analiza danych
wysoka jakość informacji
klasyfikacja tekstowa
kategoryzacja tekstowa
grupowanie tekstu
streszczenie dokumentów tekstowych
technika sentiment analysis
Opis:: In Big Data era when data volume doubled every year analyzing of all this data become really complicated task, so in this case text mining systems, techniques and tools become main instrument of analyzing tones and tones of information, selecting that information that suit the best for your needs and just help save your time for more interesting thing. The main aims of this article are explain basic principles of this field and overview some interesting technologies that nowadays are widely used in text mining.
Źródło:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes; 2017, 6, 2; 73-78
2084-5715
Pojawia się w:: ECONTECHMOD : An International Quarterly Journal on Economics of Technology and Modelling Processes
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "grupowanie danych" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język