Temat: random forest regression - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: A Study on the Optimization of Metalloid Contents of Fe-Si-B-C Based Amorphous Soft Magnetic Materials Using Artificial Intelligence Method
Autorzy:: Choi, Young-Sin
Kwon, Do-Hun
Lee, Min_Woo
Cha, Eun-Ji
Jeon, Junhyub
Lee, Seok-Jae
Kim, Jongryoul
Kim, Hwi-Jun
Powiązania:: https://bibliotekanauki.pl/articles/2174571.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: Fe-based amorphous
soft magnetic properties
artificial intelligence
machine learning
random forest regression
Opis:: The soft magnetic properties of Fe-based amorphous alloys can be controlled by their compositions through alloy design. Experimental data on these alloys show some discrepancy, however, with predicted values. For further improvement of the soft magnetic properties, machine learning processes such as random forest regression, k-nearest neighbors regression and support vector regression can be helpful to optimize the composition. In this study, the random forest regression method was used to find the optimum compositions of Fe-Si-B-C alloys. As a result, the lowest coercivity was observed in Fe80.5Si3.63B13.54C2.33 at.% and the highest saturation magnetization was obtained Fe81.83Si3.63B12.63C1.91at.% with R2 values of 0.74 and 0.878, respectively.
Źródło:: Archives of Metallurgy and Materials; 2022, 67, 4; 1459--1463
1733-3490
Pojawia się w:: Archives of Metallurgy and Materials
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: A System for Filling Store Displays: Pitting a Single Model against a Set of Demand Forecasting Models
System zapełnienia ekspozycji sklepowych: pojedynczy model a zespół modeli prognozowania popytu
Autorzy:: Myna, Artur
Myna, Jacek
Powiązania:: https://bibliotekanauki.pl/articles/2206342.pdf
Data publikacji:: 2023
Wydawca:: Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu
Tematy:: Extreme Gradient Boosting
logistic regression
random forest
regresja logistyczna
las losowy
Opis:: The aim of the paper was to develop the concept of retail display space allocation as a system and to assess the quality of very slow-moving products demand forecasting models (that have not yet been used by retail companies in Poland) as its key subsystem. Forecasts were made using the example of a clothing company. The quality of these models was assessed using the Weighted Mean Absolute Percentage Error. The first step was to build the individual models. Later, the authors built separate models for brick-and-mortar and online stores as well as brands, creating a set of six models. The findings show that the classification approach for very slow movers provides as precise results as the regression approach. No single model or set of models (built with a particular machine learning method) could be identified that made the best demand forecasts for brick-and-mortar stores, as statistical tests generally did not confirm the significance of the differences between the median forecasts.
Celem artykułu jest opracowanie koncepcji zapełnienia ekspozycji sklepowych jako sys- temu oraz ocena jakości modeli prognozowania popytu (które w Polsce nie są jeszcze wykorzystywane przez sieci handlowe) bardzo wolno rotujących produktów jako jego kluczowego podsystemu. Jakość modeli oceniono za pomocą miary Weighted Mean Absolute Percentage Error na różnych poziomach szczegółowości: dla całej sieci sprzedaży i określonego miesiąca oraz na „na przecięciu” sklepu, produk- tu i rozmiaru produktu. Najpierw zbudowano pojedyncze modele, następnie zaś odrębne modele dla sklepów stacjonarnych i internetowych, jak również marek, tworząc zespół sześciu modeli. Poprawę dopasowania modeli osiągnięto tylko dla sklepów internetowych. Wyniki pracy wskazują, że podejście klasyfikacyjne dla bardzo wolno rotujących produktów charakteryzują równie precyzyjne wyniki pro- gnoz jak podejście regresyjne. Nie można wskazać jednego modelu lub zespołu modeli (zbudowanego określoną metodą uczenia maszynowego), który wykonał najlepsze prognozy popytu dla sklepów sta- cjonarnych, gdyż istotności różnic median prognoz na ogół nie potwierdzono testami statystycznymi.
Źródło:: Prace Naukowe Uniwersytetu Ekonomicznego we Wrocławiu; 2023, 67, 2; 96-106
1899-3192
Pojawia się w:: Prace Naukowe Uniwersytetu Ekonomicznego we Wrocławiu
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Classification and modelling of sound emission signals in selected tribosystems
Klasyfikacja i modelowanie sygnałów dżwięków w wybranych systemach tribologicznych
Autorzy:: Kekez, Michał
Jurczak, Wojciech
Ozimina, Dariusz
Powiązania:: https://bibliotekanauki.pl/articles/2055620.pdf
Data publikacji:: 2021
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:: tribosystem
sound level
regression trees
random forest
system tribologiczny
poziom dźwięku
drzewa regresji
las losowy
Opis:: The paper presents an analysis of the sound level recorded during dry sliding friction conditions. Balls with a diameter of 6 mm placed on pins were made of 100Cr6 steel, silicon carbide (SiC), and corundum (Al2 O3 ), while rotating discs with a height of 6 mm and a diameter of 42 mm were made of 100Cr6 steel. Each pin and disc system was tested for two values of the relative humidity of the air (50 ± 5% and 90 ± 5%). Models of the A-sound level were developed using regression trees and random forest. The paper presents an analysis of the accuracy of the models obtained. Classifications of the six tests performed on the basis of sound level descriptors were also carried out.
W pracy przedstawiono analizę poziomu dźwięku zarejestrowanego podczas tarcia technicznie suchego w ruchu ślizgowym. Podczas sześciu testów tribologicznych stosowano próbkę wykonaną ze stali 100Cr6 oraz trzy przeciwpróbki, wykonane ze stali 100Cr6, węglika krzemu (SiC) i korundu (Al2 O3 ), przy czym każdy układ próbka – przeciwpróbka był testowany dla dwóch wartości wilgotności względnej powietrza (50 ± 5% i 90 ± 5%). Opracowano modele poziomu dźwięku A z użyciem drzew regresji i lasu losowego. W pracy zamieszczono analizę dokładności otrzymanych modeli. Została również przeprowadzona klasyfikacja sześciu wykonanych testów w oparciu o deskryptory poziomu dźwięku.
Źródło:: Tribologia; 2021, 297, 3; 19--26
0208-7774
Pojawia się w:: Tribologia
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Application of selected data mining techniques in unintentional accounting error detection
Autorzy:: Papík, Mário
Papíková, Lenka
Powiązania:: https://bibliotekanauki.pl/articles/22444352.pdf
Data publikacji:: 2021
Wydawca:: Instytut Badań Gospodarczych
Tematy:: financial fraud
unintentional accounting errors
financial restatements
decision tree
classification and regression tree
random forest
Opis:: Research background: Even though unintentional accounting errors leading to financial restatements look like less serious distortion of publicly available information, it has been shown that financial restatements impacts on financial markets are similar to intentional fraudulent activities. Unintentional accounting errors leading to financial restatements then affect value of company shares in the short run which negatively impacts all shareholders. Purpose of the article: The aim of this manuscript is to predict unintentional accounting errors leading to financial restatements based on information from financial statements of companies. The manuscript analysis if financial statements include sufficient information which would allow detection of unintentional accounting errors. Methods: Method of classification and regression trees (decision tree) and random forest have been used in this manuscript to fulfill the aim of this manuscript. Data sample has consisted of 400 items from financial statements of 80 selected international companies. The results of developed prediction models have been compared and explained based on their accuracy, sensitivity, specificity, precision and F1 score. Statistical relationship among variables has been tested by correlation analysis. Differences between the group of companies with and without unintentional accounting error have been tested by means of Kruskal-Wallis test. Differences among the models have been tested by Levene and T-tests. Findings & value added: The results of the analysis have provided evidence that it is possible to detect unintentional accounting errors with high levels of accuracy based on financial ratios (rather than the Beneish variables) and by application of random forest method (rather than classification and regression tree method).
Źródło:: Equilibrium. Quarterly Journal of Economics and Economic Policy; 2021, 16, 1; 185-201
1689-765X
2353-3293
Pojawia się w:: Equilibrium. Quarterly Journal of Economics and Economic Policy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Extracting relevant predictors of the severity of mental illnesses from clinical information using regularisation regression models
Autorzy:: Kaushik, Sakshi
Sabharwal, Alka
Grover, Gurprit
Powiązania:: https://bibliotekanauki.pl/articles/2107145.pdf
Data publikacji:: 2022-06-14
Wydawca:: Główny Urząd Statystyczny
Tematy:: adaptive LASSO
group LASSO
mental disorder
multicollinearity
random forest imputation
ridge regression
severity of an illness
Opis:: Mental disorders are common non-communicable diseases whose occurrence rises at epidemic rates globally. The determination of the severity of a mental illness has important clinical implications and it serves as a prognostic factor for effective intervention planning and management. This paper aims to identify the relevant predictors of the severity of mental illnesses (measured by psychiatric rating scales) from a wide range of clinical variables consisting of information on both laboratory test results and psychiatric factors . The laboratory test results collectively indicate the measurements of 23 components derived from vital signs and blood tests results for the evaluation of the complete blood count. The 8 psychiatric factors known to affect the severity of mental illnesses are considered, viz. the family history, course and onset of an illness, etc. Retrospective data of 78 patients diagnosed with mental and behavioural disorders were collected from the Lady Hardinge Medical College & Smt. S.K, Hospital in New Delhi, India. The observations missing in the data are imputed using the non-parametric random forest algorithm. The multicollinearity is detected based on the variance inflation factor. Owing to the presence of multicollinearity, regularisation techniques such as ridge regression and extensions of the least absolute shrinkage and selection operator (LASSO), viz. adaptive and group LASSO are used for fitting the regression model. Optimal tuning parameter λ is obtained through 13-fold cross-validation. It was observed that the coefficients of the quantitative predictors extracted by the adaptive LASSO and the group of predictors extracted by the group LASSO were comparable to the coefficients obtained through ridge regression.
Źródło:: Statistics in Transition new series; 2022, 23, 2; 129-152
1234-7655
Pojawia się w:: Statistics in Transition new series
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Predicting the stability of open stopes using Machine Learning
Autorzy:: Szmigiel, Alicja
Apel, Derek B.
Powiązania:: https://bibliotekanauki.pl/articles/2201415.pdf
Data publikacji:: 2022
Wydawca:: Główny Instytut Górnictwa
Tematy:: open stope
machine learning
logistic regression
random forest
system otwartych komór
uczenie maszynowe
regresja logistyczna
las losowy
Opis:: The Mathews stability graph method was presented for the first time in 1980. This method was developed to assess the stability of open stopes in different underground conditions, and it has an impact on evaluating the safety of underground excavations. With the development of technology and growing experience in applying computer sciences in various research disciplines, mining engineering could significantly benefit by using Machine Learning. Applying those ML algorithms to predict the stability of open stopes in underground excavations is a new approach that could replace the original graph method and should be investigated. In this research, a Potvin database that consisted of 176 historical case studies was passed to the two most popular Machine Learning algorithms: Logistic Regression and Random Forest, to compare their predicting capabilities. The results obtained showed that those algorithms can indicate the stability of underground openings, especially Random Forest, which, in examined data, performed slightly better than Logistic Regression.
Źródło:: Journal of Sustainable Mining; 2022, 21, 3; 241--248
2300-1364
2300-3960
Pojawia się w:: Journal of Sustainable Mining
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Explicit and implicit description of the factors impact on the NO2 concentration in the traffic corridor
Jawny i niejawny opis wpływu czynników na stężenie NO2 w kanionie komunikacyjnym
Autorzy:: Kamińska, Joanna Amelia
Turek, Tomasz
Powiązania:: https://bibliotekanauki.pl/articles/204830.pdf
Data publikacji:: 2020
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: nitrogen dioxide
traffic flow
meteorological conditions
random forest
linear regression
dwutlenek azotu
ruch uliczny
warunki meteorologiczne
losowy las
regresja liniowa
Opis:: High concentrations of nitrogen dioxide in the air, particularly in heavily urbanized areas, have an adverse effect on many aspects of residents’ health. A method is proposed for modelling daily average, minimal and maximal atmospheric NO2 concentrations in a conurbation, using two types of modelling: multiple linear regression (LR) an advanced data mining technique – Random Forest (RF). It was shown that Random Forest technique can be successfully applied to predict daily NO₂ concentration based on data from 2015–2017 years and gives better fi t than linear models. The best results were obtained for predicting daily average NO₂ values with R²=0.69 and RMSE=7.47 μg/m³. The cost of receiving an explicit, interpretable function is a much worse fit (R² from 0.32 to 0.57). Verification of models on independent material from the first half of 2018 showed the correctness of the models with the mean average percentage error equal to 16.5% for RF and 28% for LR modelling daily average concentration. The most important factors were wind conditions and traffic flow. In prediction of maximal daily concentration, air temperature and air humidity take on greater importance. Prevailing westerly and south-westerly winds in Wrocław effectively implement the idea of ventilating the city within the studied intersection. Summarizing: when modeling natural phenomena, a compromise should be sought between the accuracy of the model and its interpretability.
Celem pracy jest zbadanie możliwości prognozowania dziennego stężenia NO2 za pomocą metody losowego lasu – RF i porównanie wyników z wielowymiarową regresją liniową (LR) w oparciu o ten sam zestaw danych. Ponadto zbadano wpływ zwiększenia interpretowalności modelu na jego dokładność. W pracy przedstawiono dwie metody modelowania dziennych wartości minimalnych, średnich oraz maksymalnych stężeń NO₂ w aglomeracji miejskiej: wielowymiarowa regresja liniowa (LR) oraz losowy las (RF). Wykazano, że metoda Lasu Losowego (Random Forest) może być skutecznie wykorzystywana do przewidywania dziennych wartości stężenia NO₂. Największą dokładność otrzymano dla przewidywania średnich wartości dziennych stężenia z R²=0.69 oraz RMSE=7.47 μg/m³. Kosztem otrzymania jawnej postaci funkcji w modeli liniowym (LR) jest znacząco niższa dokładność przewidywania wartości stężenia (R² od 0.32 do 0.57). Weryfikacja modeli na niezależnym materiale z pierwszej połowy 2018 roku potwierdziła poprawność modeli ze średnim błędem względnym dla średnich wartości dobowych stężeń równym 16.5% dla RF oraz 28% dla LR. Największy wpływ na stężenia NO₂ w kanionie komunikacyjnym ma wiatr oraz natężenie ruchu. W modelowaniu maksymalnych wartości dobowych nabierają znaczenia temperatura powietrza oraz wilgotność względna powietrza. Przeważające zachodnie i północno-zachodnie wiatry we Wrocławiu skutecznie realizują koncepcję przewietrzania miasta w zakresie rozważanego skrzyżowania.
Źródło:: Archives of Environmental Protection; 2020, 46, 1; 93-99
2083-4772
2083-4810
Pojawia się w:: Archives of Environmental Protection
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Evaluation of the impact of explanatory variables on the accuracy of prediction of daily inflow to the sewage treatment plant by selected models nonlinear
Ocena wpływu zmiennych objaśniających na dokładność predykcji dobowego dopływu do oczyszczalni ścieków wybranymi modelami nieliniowymi
Autorzy:: Szeląg, B.
Bartkiewicz, L.
Studziński, J.
Barbusiński, K.
Powiązania:: https://bibliotekanauki.pl/articles/205349.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: wastewater treatment plant
data mining
random forest
forecasting inflow
k-nearest neighbour
Kernel regression
oczyszczalnia ścieków
wydobywanie danych
las losowy
dopływ ścieków
modelowanie
k-najbliższy sąsiad
regresja Kernela
Opis:: The aim of the study was to evaluate the possibility of applying different methods of data mining to model the inflow of sewage into the municipal sewage treatment plant. Prediction models were elaborated using methods of support vector machines (SVM), random forests (RF), k-nearest neighbour (k-NN) and of Kernel regression (K). Data consisted of the time series of daily rainfalls, water level measurements in the clarified sewage recipient and the wastewater inflow into the Rzeszow city plant. Results indicate that the best models with one input delayed by 1 day were obtained using the k-NN method while the worst with the K method. For the models with two input variables and one explanatory one the smallest errors were obtained if model inputs were sewage inflow and rainfall data delayed by 1 day and the best fit is provided using RF method while the worst with the K method. In the case of models with three inputs and two explanatory variables, the best results were reported for the SVM and the worst for the K method. In the most of the modelling runs the smallest prediction errors are obtained using the SVM method and the biggest ones with the K method. In the case of the simplest model with one input delayed by 1 day the best results are provided using k-NN method and by the models with two inputs in two modelling runs the RF method appeared as the best.
Celem pracy jest ocena możliwości zastosowania różnych metod data mining do modelowania dopływu ścieków do komunalnej oczyszczalni ścieków. Do opracowania modeli statystycznych metodą wektorów nośnych, lasów losowych, k – najbliższego sąsiada i regresji Kernela wykorzystano szeregi pomiarowe dobowych wartości opadów deszczu, stanów wody w odbiorniku oraz dopływów do komunalnej oczyszczalni ścieków w Rzeszowie. Z obliczeń wykonanych metodami SVM, RF, k-NN i K wynika, że dla modeli z jedną zmienną objaśniającą opóźnioną o dobę w stosunku do wartości dopływu, najlepsze wyniki otrzymano modelem autoregresyjnym bazującym na metodzie k-NN a najgorsze regresją Kernela. W przypadku modeli z dwoma zmiennymi objaśniającymi najmniejsze wartości błędów uzyskano, dla modeli uwzględniających dopływ ścieków i całkowitą wysokość opadu deszczu z jednodobowym opóźnieniem; najlepsze wyniki uzyskano metodą RF a najgorsze regresji Kernela. Dla modeli z dwiema zmiennymi objaśniającymi, ale trzema sygnałami wejściowymi, najmniejsze błędy dopływu ścieków do OŚ uzyskano metodą SVM, a najgorsze regresji Kernela. Z wykonanych symulacji stwierdzono, że w większości przypadków najmniejsze wartości błędów dopływu ścieków do oczyszczalni otrzymano metodą SVM a największe metodą K. W przypadku najprostszego modelu z jednym sygnałem wejściowym opóźnionym o 1 dobę najlepsze wyniki obliczeń uzyskano metodą k-NN, a w dwóch przypadkach modeli, gdzie ujęto 2 sygnały wejściowe, najlepsza okazała się metoda RF.
Źródło:: Archives of Environmental Protection; 2017, 43, 3; 74-81
2083-4772
2083-4810
Pojawia się w:: Archives of Environmental Protection
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Zastosowanie metod czarnej skrzynki do prognozowania wartości wybranych wskaźników jakości ścieków dopływających do oczyszczalni komunalnej
Black-box forecasting of selected indicator values for influent wastewater quality in municipal treatment plant
Autorzy:: Szeląg, B.
Bartkiewicz, L.
Studziński, J.
Powiązania:: https://bibliotekanauki.pl/articles/236740.pdf
Data publikacji:: 2016
Wydawca:: Polskie Zrzeszenie Inżynierów i Techników Sanitarnych
Tematy:: ścieki komunalne
modelowanie
prognozowanie jakości ścieków metoda MARS
metoda lasów losowych (RF)
metoda samoorganizujących się sieci neuronowych (SOM)
metoda drzew wzmacnianych (BT) metoda analizy składowych
głównych (PCA)
sewage
modeling
sewage quality forecasting
MARS (multivariate adaptive regression spline)
random forest (RF)
self-organizing map (SOM)
boosted trees (BT)
principal component analysis (PCA)
Opis:: Prognozowanie ilości i jakości ścieków dopływających do oczyszczalni komunalnej z odpowiednim wyprzedzeniem czasowym daje możliwość optymalnego sterowania wieloma parametrami procesów oczyszczania ścieków. Dlatego prowadzi się badania mające na celu opracowanie modeli matematycznych (fizykalnych deterministycznych i operatorowych statystycznych), prognozujących zarówno ilość, jak i jakość ścieków dopływających do oczyszczalni. W artykule zbadano możliwość zastosowania prostszych modeli operatorowych do prognozowania wartości wybranych wskaźników jakości ścieków na dopływie do oczyszczalni (BZT5, zawiesiny ogólne, azot ogólny i amonowy, fosfor ogólny) jedynie na podstawie wyników pomiarów natężenia przepływu ścieków oraz – w celu porównania – na podstawie ich zmierzonych wartości. Do tego celu zastosowano metody czarnej skrzynki typu MARS oraz lasy losowe (RF). Dodatkowo przedstawiono możliwość połączenia metody lasów losowych z modelem klasyfikacyjnym (RF+SOM). Do identyfikacji danych określających zmienność wybranych wskaźników jakości ścieków zastosowano metody drzew wzmacnianych (BT) i analizy składowych głównych (PCA). Modele opracowano na podstawie wyników ciągłych pomiarów dobowych przeprowadzonych w latach 2013–2015 w oczyszczalni ścieków komunalnych w Rzeszowie.
Forecasting the amount and quality of wastewater flowing into a treatment plant sufficiently in advance, enables effective control of numerous treatment process parameters. Therefore, mathematical (physical deterministic and time series statistical) models forecasting both the amount and quality of wastewater inflow into a sewage treatment plant are under development. In this paper, a possibility of simpler time series models application to forecasting values of selected indicators (biochemical oxygen demand (BOD5), total suspended solids (TSS), total nitrogen (TN), total phosphorus (TP) and ammonium (NH4+)) of sewage quality in the inflow into a treatment plant was investigated. The research was based solely on sewage flow rate data and – for the purpose of comparison – the actual measured indicator values. For this purpose, MARS type black-box and random forest (RF) methods were used. Also, a possibility of combining the RF method with a classification model (RF+SOM) was investigated. Boosted trees (BT) and principal component analysis (PCA) methods were applied for identification of data that determine variability of the selected sewage quality indicators. The models were developed on the basis of continuous daily measurements performed in the period of 2013–2015 in the municipal sewage treatment plant in Rzeszow.
Źródło:: Ochrona Środowiska; 2016, 38, 4; 39-46
1230-6169
Pojawia się w:: Ochrona Środowiska
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "random forest regression" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język