- Tytuł:
- Discretization of data using Boolean transformations and information theory based evaluation criteria
- Autorzy:
-
Jankowski, C.
Reda, D.
Mańkowski, M.
Borowik, G. - Powiązania:
- https://bibliotekanauki.pl/articles/200750.pdf
- Data publikacji:
- 2015
- Wydawca:
- Polska Akademia Nauk. Czytelnia Czasopism PAN
- Tematy:
-
machine learning
discretization
discernibility function
logic minimization
information theory
entropy
nauczanie maszynowe
dyskretyzacja
minimalizacja funkcji logicznych
teoria informacji
entropia - Opis:
- Discretization is one of the most important parts of decision table preprocessing. Transforming continuous values of attributes into discrete intervals influences further analysis using data mining methods. In particular, the accuracy of generated predictions is highly dependent on the quality of discretization. The paper contains a description of three new heuristic algorithms for discretization of numeric data, based on Boolean reasoning. Additionally, an entropy-based evaluation of discretization is introduced to compare the results of the proposed algorithms with the results of leading university software for data analysis. Considering the discretization as a data compression method, the average compression ratio achieved for databases examined in the paper is 8.02 while maintaining the consistency of databases at 100%.
- Źródło:
-
Bulletin of the Polish Academy of Sciences. Technical Sciences; 2015, 63, 4; 923-932
0239-7528 - Pojawia się w:
- Bulletin of the Polish Academy of Sciences. Technical Sciences
- Dostawca treści:
- Biblioteka Nauki