Temat: small data set - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Alternative Approach to Evaluating Interpolation Methods of Small and Imbalanced Data Sets
Nowe podejście do oceny jakości metod interpolacji niewielkich i zróżnicowanych zestawów danych
Autorzy:: Gonet, T.
Gonet, K.
Powiązania:: https://bibliotekanauki.pl/articles/385300.pdf
Data publikacji:: 2017
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: interpolacja
Surfer
walidacja krzyżowa
mały zestaw danych
interpolation
cross-validation
small data set
Opis:: Badania dotyczą alternatywnego podejścia do oceny jakości metod interpolacji niewielkich i zróżnicowanych zestawów danych. Podstawowa analiza statystyczna oparta na klasycznej walidacji krzyżowej nie zawsze daje jednoznaczne wnioski. W przypadku analizowanego zestawu danych (niezgodnego z rozkładem normalnym) trzy metody interpolacji zostały wybrane jako najlepsze (zgodnie z procedurą klasycznej walidacji krzyżowej). Niemniej jednak mapy powstałe na podstawie tych metod wyraźnie się od siebie różnią. To jest powód, dla którego dogłębna analiza statystyczna była konieczna. Zaproponowano alternatywne podejście do tego zagadnienia, które uwzględnia szersze spektrum parametrów opisujących badany zestaw danych. Głównym założeniem tej metodyki jest porównanie nie tylko odchylenia standardowego estymatora, ale również trzech dodatkowych parametrów. To powoduje, iż końcowa ocena jest znacznie dokładniejsza. Analizę wykonano za pomocą programu Surfer (Golden Software). Zapewnia on możliwość wykorzystania wielu metod interpolacji wraz z różnorakimi, regulowanymi parametrami.
The research concerns an alternative approach to the evaluation of interpolation methods for mapping small and imbalanced data sets. A basic statistical analysis of the standard cross-validation procedure is not always conclusive. In the case of the investigated data set (which is inconsistent with normal distribution), three interpolation methods have been selected as the most reliable (according to standard cross-validation). However, maps resulting from the aforementioned methods clearly differ from each other. This is the reason why a comprehensive statistical analysis of the studied data is a necessity. We propose an alternative approach that evaluates a broadened scope of parameters describing the data distribution. The general idea of the methodology is to compare not only the standard deviation of the estimator but also three additional parameters to make the final assessment much more accurate. The analysis has been carried out with the use of Golden Software Surfer. It provides a wide range of interpolation methods and numerous adjustable parameters.
Źródło:: Geomatics and Environmental Engineering; 2017, 11, 3; 49-65
1898-1135
Pojawia się w:: Geomatics and Environmental Engineering
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Artificial intelligence-based analysis of small data sets in medicine
Analiza małych zbiorów danych medycznych oparta na sztucznej inteligencji
Autorzy:: Mikołajewski, Dariusz
Mikołajewska, Emilia
Powiązania:: https://bibliotekanauki.pl/articles/41205822.pdf
Data publikacji:: 2023
Wydawca:: Uniwersytet Kazimierza Wielkiego w Bydgoszczy
Tematy:: artificial intelligence
small data set
clinical applications
sztuczna inteligencja
małe zbiory danych
zastosowanie kliniczne
Opis:: AI-based computing of small data sets are a step towards edge computing and further personalization of diagnostics, therapy and predictions in clinical practice. However, this still requires many intermediate steps, both in hardware and software. The aim of the work is to assess to what extent current achievements in the area of AI-based small sets analysis constitute the basis for the development of a new group of clinical and programming solutions.
Obliczenia małych zbiorów danych w oparciu o sztuczną inteligencję stanowią krok w kierunku obliczeń brzegowych i dalszej personalizacji diagnostyki, terapii i prognoz w praktyce klinicznej.Jednak nadal wymaga to wielu etapów pośrednich, zarówno sprzętowych, jak i programowych. Celem pracy jest ocena, w jakim stopniu dotychczasowe osiągnięcia w obszarze analizy małych zbiorów w oparciu o sztuczną inteligencję stanowią podstawę do opracowania nowej grupy rozwiązań klinicznych i programistycznych.
Źródło:: Studia i Materiały Informatyki Stosowanej; 2023, 15, 2; 18-23
1689-6300
Pojawia się w:: Studia i Materiały Informatyki Stosowanej
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccine
Autorzy:: Bansal, Anmol
Choudhry, Arjun
Sharma, Anubhav
Susan, Seba
Powiązania:: https://bibliotekanauki.pl/articles/27312860.pdf
Data publikacji:: 2023
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: Covid-19
vaccine
transformer
Twitter
BERTweet
CT-BERT
BERT
XLNet
RoBERTa
text oversampling
LMOTE
class imbalance
small sample data set
Opis:: Covid-19 has spread across the world and many different vaccines have been developed to counter its surge. To identify the correct sentiments associated with the vaccines from social media posts, we fine-tune various state-of-the-art pretrained transformer models on tweets associated with Covid-19 vaccines. Specifically, we use the recently introduced state-of-the-art RoBERTa, XLNet, and BERT pre-trained transformer models, and the domain-specific CT-BER and BERTweet transformer models that have been pre-trained on Covid-19 tweets. We further explore the option of text augmentation by oversampling using the language model-based oversampling technique (LMOTE) to improve the accuracies of these models - specifically, for small sample data sets where there is an imbalanced class distribution among the positive, negative and neutral sentiment classes. Our results summarize our findings on the suitability of text oversampling for imbalanced, small-sample data sets that are used to fine-tune state-of-the-art pre-trained transformer models as well as the utility of domain-specific transformer models for the classification task.
Źródło:: Computer Science; 2023, 24 (2); 163--182
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Convolutional neural networks in the SSI analysis for mine-induced vibrations
Autorzy:: Zając, Maciej
Kuźniar, Krystyna
Powiązania:: https://bibliotekanauki.pl/articles/38707462.pdf
Data publikacji:: 2024
Wydawca:: Instytut Podstawowych Problemów Techniki PAN
Tematy:: deep learning
convolutional neural network
shallow neural network
small data set
soil-structure interaction
mine-induced vibrations
głęboka nauka
splotowa sieć neuronowa
płytka sieć neuronowa
mały zestaw danych
interakcja gleba-struktura
wibracje wywołane minami
Opis:: Deep neural networks (DNNs) have recently become one of the most often used softcomputational tools for numerical analysis. The huge success of DNNs in the field of imageprocessing is associated with the use of convolutional neural networks (CNNs). CNNs,thanks to their characteristic structure, allow for the effective extraction of multi-layerfeatures. In this paper, the application of CNNs to one of the important soil-structureinteraction (SSI) problems, i.e., the analysis of vibrations transmission from the free-field next to a building to the building foundation, is presented in the case of mine-induced vibrations. To achieve this, the dataset from in-situ experimental measurements,containing 1D ground acceleration records, was converted into 2D spectrogram imagesusing either Fourier transform or continuous wavelet transform. Next, these images wereused as input for a pre-trained CNN. The output is a ratio of maximal vibration valuesrecorded simultaneously on the building foundation and on the ground. Therefore, the lastlayer of the CNN had to be changed from a classification to a regression one. The obtainedresults indicate the suitability of CNN for the analyzed problem.
Źródło:: Computer Assisted Methods in Engineering and Science; 2024, 31, 1; 3-28
2299-3649
Pojawia się w:: Computer Assisted Methods in Engineering and Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Probable annual floods in a small lowland river estimated with the use of various sets of data
Powodzie prawdopodobne w malej rzece nizinnej okreslone przy wykorzystaniu roznych zbiorow danych
Autorzy:: Banasik, K.
Byczkowski, A.
Powiązania:: https://bibliotekanauki.pl/articles/81795.pdf
Data publikacji:: 2007
Wydawca:: Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Wydawnictwo Szkoły Głównej Gospodarstwa Wiejskiego w Warszawie
Tematy:: flood
small catchment
catchment
design flood
annual flood
lowland river
river
small river
data set
probable flood flow
hydrology
Opis:: The results of estimation of the probable annual fl ood fl ows with the use of various sets of data from a small agricultural lowland river are presented. The traditional statistical series are formed from the annual maximum (AM) fl ows of hydrological years. After examination of the homogeneity of the series, two sets of data were formed (AM1 of 34 elements and AM2 of 40 elements), which differed in the signifi cance level of one of the stationary tests. The other series for fl ood frequency analysis has been formed by selecting peaks over threshold discharges (POT). A computer program, developed by IMGW (Institute of Hydrology and Water Management), and spreadsheet were applied for frequency analysis with the use of AM series and POT series, espectively. Results of computations with the use of AM1 and AM2 data indicate for nonsignifi cant differences in probable fl ood fl ows, and signifi cant with the probable fl ood fl ow estimated with the POT data.
Na podstawie 40-letniego ciągu przepływów dobowych zarejestrowanych w profi lu Płachty Stare na Zagożdżonce (A = 82,4 km2) utworzono zbiory danych do analizy częstotliwości przepływów powodziowych. Po statystycznej analizie jednorodności rocznych przepływów maksymalnych utworzono dwa zbiory AM1 i AM2 (annual maxima), odpowiednio 34- i 40-elementowe, różniące się poziomem zgodności jednego z testów stacjonarności ciągów. Trzeci zbiór danych utworzono ze wszystkich wezbrań okresu 40-letniego, nie mniejszych niż NWQ. Obliczenia wykonano stosując do zbiorów AM1 i AM2 program komputerowy IMGW, identyfi kujący dla obydwu zbiorów rozkład logarytmiczno-normalny (spośród czterech) jako najbardziej zgodny z danymi pomiarowymi. Analizę trzeciego zbioru danych (zawierajacych 261 przepływów) przeprowadzono przy wykorzystaniu arkusza kalkulacyjnego. Na podstawie wyników obliczeń stwierdzono znikome różnice w wartościach przepływów maksymalnych prawdopodobnych przy wykorzystaniu zbiorów AM1 i AM2 oraz znaczne różnice w porównaniu z wartościami uzyskanymi przy wykorzystaniu trzeciego zbioru danych.
Źródło:: Annals of Warsaw University of Life Sciences - SGGW. Land Reclamation; 2007, 38; 3-10
0208-5771
Pojawia się w:: Annals of Warsaw University of Life Sciences - SGGW. Land Reclamation
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "small data set" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język