- Tytuł:
- Heterogeneous distance functions for prototype rules : influence of parameters on probability estimation
- Autorzy:
-
Blachnik, M.
Duch, W.
Wieczorek, T. - Powiązania:
- https://bibliotekanauki.pl/articles/92882.pdf
- Data publikacji:
- 2006
- Wydawca:
- Uniwersytet Przyrodniczo-Humanistyczny w Siedlcach
- Tematy:
-
prototype rules
probability estimation
heterogeneous distance functions
similarity-based methods
classification
data mining - Opis:
- An interesting and little explored way to understand data is based on prototype rules (P-rules). The goal of this approach is to find optimal similarity (or distance) functions and position of prototypes to which unknown vectors are compared. In real applications similarity functions frequently involve different types of attributes, such as continuous, discrete, binary or nominal. Heterogeneous distance functions that may handle such diverse information are usually based on probability distance measure, such as the Value Difference Metrics (VDM). For continuous attributes calculation of probabilities requires estimations of probability density functions. This process requires careful selection of several parameters that may have important impact on the overall classification of accuracy. In this paper, various heterogeneous distance function based on VDM measure are presented, among them some new heterogeneous distance functions based on different types of probability estimation. Results of many numerical experiments with such distance functions are presented on artificial and real datasets, and quite simple P-rules for several heterogeneous databases extracted.
- Źródło:
-
Studia Informatica : systems and information technology; 2006, 1(7); 19-30
1731-2264 - Pojawia się w:
- Studia Informatica : systems and information technology
- Dostawca treści:
- Biblioteka Nauki