High Speed Data Streams Using Data Mining Techniques

Szczegóły
Opis

Tytuł:: High Speed Data Streams Using Data Mining Techniques
Autorzy:: Sangeetha, R.
Powiązania:: https://bibliotekanauki.pl/articles/1193580.pdf
Data publikacji:: 2016
Wydawca:: Przedsiębiorstwo Wydawnictw Naukowych Darwin / Scientific Publishing House DARWIN
Tematy:: University of Washington
VFDT
data-bases
Źródło:: World Scientific News; 2016, 41; 247-252
2392-2192
Język:: angielski
Prawa:: CC BY-NC: Creative Commons Uznanie autorstwa - Użycie niekomercyjne 4.0
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

Many organizations today have more than very large data-bases; they have databases that grow without limit at a rate of several million records per day. Mining these continuous data streams brings unique opportunities, but also new challenges. This paper describes and evaluates VFDT, an anytime system that builds decision trees using constant memory and constant time per example. VFDT can in-corporate tens of thousands of examples per second using the-shelf hardware. It uses Hoeffding bounds to guar-antee that its output is asymptotically nearly identical to that of a conventional learner. The study VFDT's proper-ties and demonstrate its utility through an extensive set of experiments on synthetic data. To apply VFDT to mining the continuous stream of Web access data from the whole University of Washington main campus.

Informacja

Powiązane pozycje