- Tytuł:
- Using frequent pattern mining algorithms in text analysis
- Autorzy:
-
Ożdżyński, P.
Zakrzewska, D. - Powiązania:
- https://bibliotekanauki.pl/articles/95011.pdf
- Data publikacji:
- 2017
- Wydawca:
- Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Wydawnictwo Szkoły Głównej Gospodarstwa Wiejskiego w Warszawie
- Tematy:
-
GSP
SuffixArray
PrefixSpan
N-Gram
frequent sequences - Opis:
- In text mining, effectiveness of methods depends on document representations. The ones based on frequent word sequences are used in such tasks as categorization, clustering and topic modelling. In the paper a comparison of different algorithms for finding frequent word sequences is presented. There are considered techniques dedicated for market basket analysis such as GSP and PrefixSpan as well as a method based on a suffix array. The investigated techniques are compared with the new approach of searching maximum frequent word sequences in document sets. Performance of the algorithms is examined taking into account execution times for the considered test collections.
- Źródło:
-
Information Systems in Management; 2017, 6, 3; 213-222
2084-5537
2544-1728 - Pojawia się w:
- Information Systems in Management
- Dostawca treści:
- Biblioteka Nauki