Informacja

Drogi użytkowniku, aplikacja do prawidłowego działania wymaga obsługi JavaScript. Proszę włącz obsługę JavaScript w Twojej przeglądarce.

Wyszukujesz frazę "bag of words" wg kryterium: Temat


Wyświetlanie 1-2 z 2
Tytuł:
Bag of words and embedding text representation methods for medical article classification
Autorzy:
Cichosz, Paweł
Powiązania:
https://bibliotekanauki.pl/articles/24403007.pdf
Data publikacji:
2023
Wydawca:
Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:
text representation
text classification
bag of words
word embedding
reprezentacja tekstu
klasyfikacja tekstu
osadzanie słów
Opis:
Text classification has become a standard component of automated systematic literature review (SLR) solutions, where articles are classified as relevant or irrelevant to a particular literature study topic. Conventional machine learning algorithms for tabular data which can learn quickly from not necessarily large and usually imbalanced data with low computational demands are well suited to this application, but they require that the text data be transformed to a vector representation. This work investigates the utility of different types of text representations for this purpose. Experiments are presented using the bag of words representation and selected representations based on word or text embeddings: word2vec, doc2vec, GloVe, fastText, Flair, and BioBERT. Four classification algorithms are used with these representations: a naive Bayes classifier, logistic regression, support vector machines, and random forest. They are applied to datasets consisting of scientific article abstracts from systematic literature review studies in the medical domain and compared with the pre-trained BioBERT model fine-tuned for classification. The obtained results confirm that the choice of text representation is essential for successful text classification. It turns out that, while the standard bag of words representation is hard to beat, fastText word embeddings make it possible to achieve roughly the same level of classification quality with the added benefit of much lower dimensionality and capability of handling out-of-vocabulary words. More refined embeddings methods based on deep neural networks, while much more demanding computationally, do not appear to offer substantial advantages for the classification task. The fine-tuned BioBERT classification model performs on par with conventional algorithms when they are coupled with their best text representation methods.
Źródło:
International Journal of Applied Mathematics and Computer Science; 2023, 33, 4; 603--621
1641-876X
2083-8492
Pojawia się w:
International Journal of Applied Mathematics and Computer Science
Dostawca treści:
Biblioteka Nauki
Artykuł
Tytuł:
Categorization of Similar Objects Using Bag of Visual Words and k - Nearest Neighbour Classifier
Autorzy:
Artiemjew, P.
Górecki, P.
Sopyła, K.
Powiązania:
https://bibliotekanauki.pl/articles/298103.pdf
Data publikacji:
2012
Wydawca:
Uniwersytet Warmińsko-Mazurski w Olsztynie
Tematy:
kategoryzacja obrazu
metoda k najbliższych sąsiadów
zbiór słów wizualnych
Image categorization
k-Nearest Neighbor Classifier
Bag of Visual Words
Opis:
Image categorization is one of the fundamental tasks in computer vision, it has wide application in methods of artificial intelligence, robotic vision and many others. There are a lot of difficulties in computer vision to overcome, one of them appears during image recognition and classification. The difficulty arises from an image variance, which may be caused by scaling, rotation, changes in a perspective, illumination levels, or partial occlusions. Due to these reasons, the main task is to represent represent images in such way that would allow recognizing them even if they have been modified. Bag of Visual Words (BoVW) approach, which allows for describing local characteristic features of images, has recently gained much attention in the computer vision community. In this article we have presented the results of image classification with the use of BoVW and k - Nearest Neighbor classifier with different kinds of metrics and similarity measures. Additionally, the results of k - NN classification are compared with the ones obtained from a Support Vector Machine classifier.
Źródło:
Technical Sciences / University of Warmia and Mazury in Olsztyn; 2012, 15(2); 293-305
1505-4675
2083-4527
Pojawia się w:
Technical Sciences / University of Warmia and Mazury in Olsztyn
Dostawca treści:
Biblioteka Nauki
Artykuł
    Wyświetlanie 1-2 z 2

    Ta witryna wykorzystuje pliki cookies do przechowywania informacji na Twoim komputerze. Pliki cookies stosujemy w celu świadczenia usług na najwyższym poziomie, w tym w sposób dostosowany do indywidualnych potrzeb. Korzystanie z witryny bez zmiany ustawień dotyczących cookies oznacza, że będą one zamieszczane w Twoim komputerze. W każdym momencie możesz dokonać zmiany ustawień dotyczących cookies