- Tytuł:
- A study of parallel techniques for dimensionality reduction and its impact on the quality of text processing algorithms
- Autorzy:
-
Pietroń, M.
Wielgosz, M.
Karwatowski, M.
Wiatr, K. - Powiązania:
- https://bibliotekanauki.pl/articles/114190.pdf
- Data publikacji:
- 2015
- Wydawca:
- Stowarzyszenie Inżynierów i Techników Mechaników Polskich
- Tematy:
-
singular value decomposition
vector space model
TFIDF - Opis:
- The presented algorithms employ the Vector Space Model (VSM) and its enhancements such as TFIDF (Term Frequency Inverse Document Frequency) with Singular Value Decomposition (SVD). TFIDF were applied to emphasize the important features of documents and SVD was used to reduce the analysis space. Consequently, a series of experiments were conducted. They revealed important properties of the algorithms and their accuracy. The accuracy of the algorithms was estimated in terms of their ability to match the human classification of the subject. For unsupervised algorithms the entropy was used as a quality evaluation measure. The combination of VSM, TFIDF, and SVD came out to be the best performing unsupervised algorithm with entropy of 0.16.
- Źródło:
-
Measurement Automation Monitoring; 2015, 61, 7; 352-353
2450-2855 - Pojawia się w:
- Measurement Automation Monitoring
- Dostawca treści:
- Biblioteka Nauki