A study of parallel techniques for dimensionality reduction and its impact on the quality of text processing algorithms

Szczegóły
Opis

Tytuł:: A study of parallel techniques for dimensionality reduction and its impact on the quality of text processing algorithms
Autorzy:: Pietroń, M.
Wielgosz, M.
Karwatowski, M.
Wiatr, K.
Powiązania:: https://bibliotekanauki.pl/articles/114190.pdf
Data publikacji:: 2015
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:: singular value decomposition
vector space model
TFIDF
Źródło:: Measurement Automation Monitoring; 2015, 61, 7; 352-353
2450-2855
Język:: angielski
Prawa:: CC BY: Creative Commons Uznanie autorstwa 3.0 PL
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

The presented algorithms employ the Vector Space Model (VSM) and its enhancements such as TFIDF (Term Frequency Inverse Document Frequency) with Singular Value Decomposition (SVD). TFIDF were applied to emphasize the important features of documents and SVD was used to reduce the analysis space. Consequently, a series of experiments were conducted. They revealed important properties of the algorithms and their accuracy. The accuracy of the algorithms was estimated in terms of their ability to match the human classification of the subject. For unsupervised algorithms the entropy was used as a quality evaluation measure. The combination of VSM, TFIDF, and SVD came out to be the best performing unsupervised algorithm with entropy of 0.16.

Informacja

Powiązane pozycje