Temat: Random Forest - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Zastosowanie metod czarnej skrzynki do prognozowania wartości wybranych wskaźników jakości ścieków dopływających do oczyszczalni komunalnej
Black-box forecasting of selected indicator values for influent wastewater quality in municipal treatment plant
Autorzy:: Szeląg, B.
Bartkiewicz, L.
Studziński, J.
Powiązania:: https://bibliotekanauki.pl/articles/236740.pdf
Data publikacji:: 2016
Wydawca:: Polskie Zrzeszenie Inżynierów i Techników Sanitarnych
Tematy:: ścieki komunalne
modelowanie
prognozowanie jakości ścieków metoda MARS
metoda lasów losowych (RF)
metoda samoorganizujących się sieci neuronowych (SOM)
metoda drzew wzmacnianych (BT) metoda analizy składowych
głównych (PCA)
sewage
modeling
sewage quality forecasting
MARS (multivariate adaptive regression spline)
random forest (RF)
self-organizing map (SOM)
boosted trees (BT)
principal component analysis (PCA)
Opis:: Prognozowanie ilości i jakości ścieków dopływających do oczyszczalni komunalnej z odpowiednim wyprzedzeniem czasowym daje możliwość optymalnego sterowania wieloma parametrami procesów oczyszczania ścieków. Dlatego prowadzi się badania mające na celu opracowanie modeli matematycznych (fizykalnych deterministycznych i operatorowych statystycznych), prognozujących zarówno ilość, jak i jakość ścieków dopływających do oczyszczalni. W artykule zbadano możliwość zastosowania prostszych modeli operatorowych do prognozowania wartości wybranych wskaźników jakości ścieków na dopływie do oczyszczalni (BZT5, zawiesiny ogólne, azot ogólny i amonowy, fosfor ogólny) jedynie na podstawie wyników pomiarów natężenia przepływu ścieków oraz – w celu porównania – na podstawie ich zmierzonych wartości. Do tego celu zastosowano metody czarnej skrzynki typu MARS oraz lasy losowe (RF). Dodatkowo przedstawiono możliwość połączenia metody lasów losowych z modelem klasyfikacyjnym (RF+SOM). Do identyfikacji danych określających zmienność wybranych wskaźników jakości ścieków zastosowano metody drzew wzmacnianych (BT) i analizy składowych głównych (PCA). Modele opracowano na podstawie wyników ciągłych pomiarów dobowych przeprowadzonych w latach 2013–2015 w oczyszczalni ścieków komunalnych w Rzeszowie.
Forecasting the amount and quality of wastewater flowing into a treatment plant sufficiently in advance, enables effective control of numerous treatment process parameters. Therefore, mathematical (physical deterministic and time series statistical) models forecasting both the amount and quality of wastewater inflow into a sewage treatment plant are under development. In this paper, a possibility of simpler time series models application to forecasting values of selected indicators (biochemical oxygen demand (BOD5), total suspended solids (TSS), total nitrogen (TN), total phosphorus (TP) and ammonium (NH4+)) of sewage quality in the inflow into a treatment plant was investigated. The research was based solely on sewage flow rate data and – for the purpose of comparison – the actual measured indicator values. For this purpose, MARS type black-box and random forest (RF) methods were used. Also, a possibility of combining the RF method with a classification model (RF+SOM) was investigated. Boosted trees (BT) and principal component analysis (PCA) methods were applied for identification of data that determine variability of the selected sewage quality indicators. The models were developed on the basis of continuous daily measurements performed in the period of 2013–2015 in the municipal sewage treatment plant in Rzeszow.
Źródło:: Ochrona Środowiska; 2016, 38, 4; 39-46
1230-6169
Pojawia się w:: Ochrona Środowiska
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Vibroacoustic Real Time Fuel Classification in Diesel Engine
Autorzy:: Bąkowski, A.
Kekez, M.
Radziszewski, L.
Sapietova, A.
Powiązania:: https://bibliotekanauki.pl/articles/177686.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: fuel recognition
classification trees
particle swarm optimization (PSO)
random forest
Opis:: Five models and methodology are discussed in this paper for constructing classifiers capable of recognizing in real time the type of fuel injected into a diesel engine cylinder to accuracy acceptable in practical technical applications. Experimental research was carried out on the dynamic engine test facility. The signal of in-cylinder and in-injection line pressure in an internal combustion engine powered by mineral fuel, biodiesel or blends of these two fuel types was evaluated using the vibro-acoustic method. Computational intelligence methods such as classification trees, particle swarm optimization and random forest were applied.
Źródło:: Archives of Acoustics; 2018, 43, 3; 385-395
0137-5075
Pojawia się w:: Archives of Acoustics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Using a GEOBIA framework for integrating different data sources and classification methods in context of land use/land cover mapping
Autorzy:: Osmólska, A.
Hawryło, P.
Powiązania:: https://bibliotekanauki.pl/articles/145304.pdf
Data publikacji:: 2018
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: mapa użytkowanych gruntów
mapa pokrycia terenu
mapa leśna
data fusion
random forest
supervised classification
Sentinel-2
Opis:: Land use/land cover (LULC) maps are important datasets in various environmental projects. Our aim was to demonstrate how GEOBIA framework can be used for integrating different data sources and classification methods in context of LULC mapping.We presented multi-stage semi-automated GEOBIA classification workflow created for LULC mapping of Tuszyma Forestry Management area based on multi-source, multi-temporal and multi-resolution input data, such as 4 bands- aerial orthophoto, LiDAR-derived nDSM, Sentinel-2 multispectral satellite images and ancillary vector data. Various classification methods were applied, i.e. rule-based and Random Forest supervised classification. This approach allowed us to focus on classification of each class ‘individually’ by taking advantage from all useful information from various input data, expert knowledge, and advanced machine-learning tools. In the first step, twelve classes were assigned in two-steps rule-based classification approach either vector-based, ortho- and vector-based or orthoand Lidar-based. Then, supervised classification was performed with use of Random Forest algorithm. Three agriculture-related LULC classes with vegetation alternating conditions were assigned based on aerial orthophoto and Sentinel-2 information. For classification of 15 LULC classes we obtained 81.3% overall accuracy and kappa coefficient of 0.78. The visual evaluation and class coverage comparison showed that the generated LULC layer differs from the existing land cover maps especially in relative cover of agriculture-related classes. Generally, the created map can be considered as superior to the existing data in terms of the level of details and correspondence to actual environmental and vegetation conditions that can be observed in RS images.
Źródło:: Geodesy and Cartography; 2018, 67, 1; 99-116
2080-6736
2300-2581
Pojawia się w:: Geodesy and Cartography
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: The use of data mining models in solving the problem of imbalanced classes based on the example of an online marketing campaign
Wykorzystanie modeli data mining w rozwiązywaniu problemu niezrównoważonych klas na przykładzie kampanii marketingowych w Internecie
Autorzy:: Łapczyński, Mariusz
Surma, Jerzy
Powiązania:: https://bibliotekanauki.pl/articles/424980.pdf
Data publikacji:: 2015
Wydawca:: Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu
Tematy:: C&RT
Random Forest
imbalanced class problem
online social network
banner ad campaign
Opis:: While building predictive models in analytical CRM, researchers often encounter the problem of imbalanced classes (skewed distributions of dependent variables), which consists in the fact that the number of observations belonging to one category of the dependent variable is much lower than the number of observations belonging to the second category of that variable. This is related to such areas as churn analysis, customer acquisition models and cross and up-selling models. The purpose of the paper is to present a predictive model that was built to predict the response of Internet users to banner advertising. The dataset used in the study came from an online social network which offers advertisers banner campaigns targeting its users. The advertising campaign of a cosmetics company was carried out in the autumn of 2010 and was mainly targeted at young women. A user of this service was described by 115 independent variables – 3 out of which were demographic variables (sex, age, education), and the remaining 112 referred to the user’s online activity. While building the model there appeared the problem of imbalanced classes due to the low number of users who clicked on the banner ad. The number of cases amounted to 81,000, while the number of positive reactions to the banner was 207, which constitutes approximately 0.25% of the dependent variable. During the study, two popular data mining tools were utilized – the decision trees C&RT and Random Forest. The second goal of this paper is to compare the performance of the predictive models based on both these analytical tools.
Źródło:: Econometrics. Ekonometria. Advances in Applied Data Analytics; 2015, 3 (49); 9-19
1507-3866
Pojawia się w:: Econometrics. Ekonometria. Advances in Applied Data Analytics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: The increase of the performance of ultrafine coal flotation by using emulsified kerosene and the prediction of the flotation parameters by random forest and genetic algorithm
Poprawa efektywności flotacji węgla drobnoziarnistego przy wykorzystaniu emulsji naftowej oraz prognozowanie parametrów procesu flotacji przy użyciu metody lasów losowych oraz algorytmu genetycznego
Autorzy:: Oney, Ozcan
Powiązania:: https://bibliotekanauki.pl/articles/219716.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: flotacja węgla drobnoziarnistego
emulsja naftowa
metoda lasów losowych
algorytm genetyczny
ultrafine coal flotation
emulsified kerosene
random forest
genetic algorithm
Opis:: In this study, emulsified kerosene was investigated to improve the flotation performance of ultrafine coal. For this purpose, NP-10 surfactant was used to form the emulsified kerosene. Results showed that the emulsified kerosene increased the recovery of ultrafine coal compared to kerosene. This study also revealed the effect of independent variables (emulsified collector dosage (ECD), frother dosage (FD) and impeller speed (IS)) on the responses (concentrate yield (γC %), concentrate ash content (%) and combustible matter recovery (ε %)) based on Random Forest (RF) model and Genetic Algorithm (GA). The proposed models for γC %, % and ε% showed satisfactory results with R2. The optimal values of three test variables were computed as ECD = 330.39 g/t, FD = 75.50 g/t and IS = 1644 rpm by using GA. Responses at these experimental optimal conditions were γC % = 58.51%, % = 21.7% and ε % = 82.83%. The results indicated that GA was a beneficial method to obtain the best values of the operating parameters. According to results obtained from optimal flotation conditions, kerosene consumption was reduced at the rate of about 20% with using the emulsified kerosene.
W pracy zbadano możliwość wykorzystania emulsji naftowej do poprawy efektywności flotacji węgla drobnoziarnistego. W tym celu wykorzystano środek powierzchniowo czynny NP.-10 do utworzenia emulsji naftowej. Badania wykazały, że zastosowanie nafty w formie emulsji poprawiło wskaźniki odzysku węgla w porównaniu do procesów z wykorzystaniem nafty. W pracy badano także wpływ zmiennych zależnych (dozowanie emulsji w kolektorze ECD, dozowanie środka pianotwórczego FD, prędkość wirnika IS na wyniki procesu (uzysk koncentratu (γC %), zawartość popiołów (%) i stopień odzysku materii palnej (ε%), w oparciu o metodę lasów losowych i algorytm genetyczny. Proponowane modele pozwoliły na uzyskanie zadawalających wyników dla wskaźników γC %, %, ε %, w odniesieniu do współczynnika R2. Optymalne wartości badanych zmiennych ECD = 330.39 g/t, FD = 75.50 g/t and IS = 1644 obrotów na minutę obliczono przy wykorzystaniu algorytmu genetycznego. Wyniki procesu prowadzonego w wa-runkach optymalnych, określonych eksperymentalnie to γC % = 58.81 %; % = 21.7 %; ε % = 82.83 %. Uzyskane wyniki wskazują, że wykorzystanie algorytmu genetycznego jest metodą umożliwiającąotrzymanie najkorzystniejszych wartości parametrów pracy. Na podstawie wyników flotacji uzyskanych w najkorzystniejszych warunkach stwierdzono, że zużycie nafty obniżone zostało o ok. 20% dzięki zastosowaniu nafty w postaci emulsji.
Źródło:: Archives of Mining Sciences; 2019, 64, 1; 119-130
0860-7001
Pojawia się w:: Archives of Mining Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Sparse data classifier based on first-past-the-post voting system
Autorzy:: Cudak, Magdalena
Piech, Mateusz
Marcjan, Robert
Powiązania:: https://bibliotekanauki.pl/articles/27312911.pdf
Data publikacji:: 2022
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: POI
machine learning
geospatial data
data science
first-past-the-post
random forest
point of interest
Opis:: A point of interest (POI) is a general term for objects that describe places from the real world. The concept of POI matching (i.e., determining whether two sets of attributes represent the same location) is not a trivial challenge due to the large variety of data sources. The representations of POIs may vary depending on the basis of how they are stored. A manual comparison of objects is not achievable in real time; therefore, there are multiple solutions for automatic merging. However, there is no yet the efficient solution solves the missing of the attributes. In this paper, we propose a multi-layered hybrid classifier that is composed of machine-learning and deep-learning techniques and supported by a first-past-the-post voting system. We examined different weights for the constituencies that were taken into consideration during a majority (or supermajority) decision. As a result, we achieved slightly higher accuracy than the best current model (random forest), which also is based on voting.
Źródło:: Computer Science; 2022, 23 (2); 277--296
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Space-Time-Frequency Machine Learning for Improved 4G/5G Energy Detection
Autorzy:: Wasilewska, Małgorzata
Bogucka, Hanna
Powiązania:: https://bibliotekanauki.pl/articles/226216.pdf
Data publikacji:: 2020
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: spectrum sensing
cognitive radio
machine learning
energy detection
4G
LTE
5G
k-nearest neighbors
random forest
Opis:: In this paper, the future Fifth Generation (5G New Radio) radio communication system has been considered, coexisting and sharing the spectrum with the incumbent Fourth Generation (4G) Long-Term Evolution (LTE) system. The 4G signal presence is detected in order to allow for opportunistic and dynamic spectrum access of 5G users. This detection is based on known sensing methods, such as energy detection, however, it uses machine learning in the domains of space, time and frequency for sensing quality improvement. Simulation results for the considered methods: k-Nearest Neighbor sand Random Forest show that these methods signiﬁcantly improves the detection probability.
Źródło:: International Journal of Electronics and Telecommunications; 2020, 66, 1; 217-223
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Semantic Segmentation of Diseases in Mushrooms using Enhanced Random Forest
Autorzy:: Yacharam, Rakesh Kumar
Sekhar, Dr. V. Chandra
Powiązania:: https://bibliotekanauki.pl/articles/31339414.pdf
Data publikacji:: 2023
Wydawca:: Szkoła Główna Gospodarstwa Wiejskiego w Warszawie. Instytut Informatyki Technicznej
Tematy:: mushroom diseases
semantic segmentation
computer aided
Machine Learning
significant feature extraction
Random Forest classifier
Opis:: Mushrooms are a rich source of antioxidants and nutritional values. Edible mushrooms, however, are susceptible to various diseases such as dry bubble, wet bubble, cobweb, bacterial blotches, and mites. Farmers face significant production losses due to these diseases affecting mushrooms. The manual detection of these diseases relies on expertise, knowledge of diseases, and human effort. Therefore, there is a need for computer-aided methods, which serve as optimal substitutes for detecting and segmenting diseases. In this paper, we propose a semantic segmentation approach based on the Random Forest machine learning technique for the detection and segmentation of mushroom diseases. Our focus lies in extracting a combination of different features, including Gabor, Bouda, Kayyali, Gaussian, Canny edge, Roberts, Sobel, Scharr, Prewitt, Median, and Variance. We employ constant mean-variance thresholding and the Pearson correlation coefficient to extract significant features, aiming to enhance computational speed and reduce complexity in training the Random Forest classifier. Our results indicate that semantic segmentation based on Random Forest outperforms other methods such as Support Vector Machine (SVM), Naïve Bayes, K-means, and Region of Interest in terms of accuracy. Additionally, it exhibits superior precision, recall, and F1 score compared to SVM. It is worth noting that deep learning-based semantic segmentation methods were not considered due to the limited availability of diseased mushroom images.
Źródło:: Machine Graphics & Vision; 2023, 32, 2; 129-146
1230-0535
2720-250X
Pojawia się w:: Machine Graphics & Vision
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Random forest based power sustainability and cost optimization in smart grid
Autorzy:: Durairaj, Danalakshmi
Wróblewski, Łukasz
Sheela, A.
Hariharasudan, A.
Urbański, Mariusz
Powiązania:: https://bibliotekanauki.pl/articles/23966623.pdf
Data publikacji:: 2022
Wydawca:: Stowarzyszenie Menedżerów Jakości i Produkcji
Tematy:: smart grid
las losowy
internet rzeczy
zarządzanie energią
uczenie maszynowe
licznik inteligentny
random forest
Internet of things
power management
machine learning
smart meter
priority power scheduling
Opis:: Presently power control and management play a vigorous role in information technology and power management. Instead of non-renewable power manufacturing, renewable power manufacturing is preferred by every organization for controlling resource consumption, price reduction and efficient power management. Smart grid efficiently satisfies these requirements with the integration of machine learning algorithms. Machine learning algorithms are used in a smart grid for power requirement prediction, power distribution, failure identification etc. The proposed Random Forest-based smart grid system classifies the power grid into different zones like high and low power utilization. The power zones are divided into number of sub-zones and map to random forest branches. The sub-zone and branch mapping process used to identify the quantity of power utilized and the non-utilized in a zone. The non-utilized power quantity and location of power availabilities are identified and distributed the required quantity of power to the requester in a minimal response time and price. The priority power scheduling algorithm collect request from consumer and send the request to producer based on priority. The producer analysed the requester existing power utilization quantity and availability of power for scheduling the power distribution to the requester based on priority. The proposed Random Forest based sustainability and price optimization technique in smart grid experimental results are compared to existing machine learning techniques like SVM, KNN and NB. The proposed random forest-based identification technique identifies the exact location of the power availability, which takes minimal processing time and quick responses to the requestor. Additionally, the smart meter based smart grid technique identifies the faults in short time duration than the conventional energy management technique is also proven in the experimental results.
Źródło:: Production Engineering Archives; 2022, 28, 1; 82--92
2353-5156
2353-7779
Pojawia się w:: Production Engineering Archives
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Prognozowanie przedziału czasowego z maksymalnym w ciągu doby z użyciem gazu przez kotłownię
Forecasting the time interval of the day with the maximum boilers gas consumption
Autorzy:: Nowak, Bogdan
Bartnicki, Grzegorz
Powiązania:: https://bibliotekanauki.pl/articles/394678.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Instytut Gospodarki Surowcami Mineralnymi i Energią PAN
Tematy:: zużycie gazu
model prognostyczny
random forest
gas consumption
prognostic model
Opis:: Działania mające na celu poprawę efektywności energetycznej systemów zaopatrzenia w ciepło wymagają korzystania z coraz bardziej złożonych metod. Podstawowe sposoby zmniejszenia zużycia ciepła poprzez stosowanie lepszej izolacji cieplnej mają coraz bardziej ograniczone możliwości iwymagają stosunkowo dużych nakładów finansowych. Dobre efekty mogą być osiągane przez coraz lepsze dopasowanie rozwiązań technicznych, sposobów regulacji czy zasad eksploatacji źródła ciepła do warunków konkretnego obiektu zasilanego wciepło. Wymaga to jednak zarówno badań identyfikujących skuteczność takich metod, jak inarzędzi służących do opisu wybranych elementów systemu czy jego całości. Artykuł przedstawia wyniki badań przeprowadzonych dla kotłowni gazowej zasilającej w ciepło grupę budynków mieszkalnych. Celem było zbudowanie modelu, który prognozowałby dla konkretnego dnia przedział czasowy, w którym występuje maksymalne zużycie gazu. Dysponując pomiarami zużycia gazu wkolejnych godzinach doby, zdecydowano się zbudować model prognostyczny wyznaczający tę część doby, w której takie maksimum wystąpi. W opracowanym modelu zdecydowano się zastosować procedurę lasów losowych (random forest). Do utworzenia modelu zastosowano pakiet mlr (Kassambara), w którym przeprowadzono również strojenie hiperparametrów modelu na bazie danych historycznych. W oparciu o odrębne dane dla innego okresu działania kotłowni przedstawiono wyniki oceny jego jakości. Uzyskano skuteczność niemal 44%. Strojenie modelu wpłynęło na poprawę jego zdolności predykcyjnych.
The heat supply systems energy efficiency improvement requires the use of increasingly complex methods. The basic ways to reduce heat consumption is by using better thermal insulation, although they have more and more limited possibilities and need relatively large financial outlays. Good effects can be achieved by the better heat source adaptation to the conditions of aspecific facility supplied with heat. However, this requires research that identifies the effectiveness of such solutions as well as the tools used to describe selected elements of the system or its entirety. The article presents the results of tests carried out for agas boiler room supplying heat to agroup of residential buildings. The goal was to build amodel that would forecast the day range in which the maximum gas consumption occurs for agiven day. Having measurements of gas consumption in subsequent hours of the day, it was decided to build aforecasting model determining the part of the day in which such amaximum would occur. To create the model the random forest procedure was used along with the mlr (Kassambara) package. The model’s hyperparameters were tuned based on historical data. Based on data for another period of boilerroom operation, the results of the model’s quality assessment were presented. Close to 44% efficiency was achieved. Tuning the model improved its predictive ability.
Źródło:: Zeszyty Naukowe Instytutu Gospodarki Surowcami Mineralnymi i Energią PAN; 2019, 109; 93-109
2080-0819
Pojawia się w:: Zeszyty Naukowe Instytutu Gospodarki Surowcami Mineralnymi i Energią PAN
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: Predictive Business Process Monitoring with Tree-based Classification Algorithms
Autorzy:: Owczarek, Tomasz
Janke, Piotr
Powiązania:: https://bibliotekanauki.pl/articles/503954.pdf
Data publikacji:: 2018
Wydawca:: Międzynarodowa Wyższa Szkoła Logistyki i Transportu
Tematy:: business process
prediction
classification
random forest
gradient boosting
Opis:: Predictive business process monitoring is a current research area which purpose is to predict the outcome of a whole process (or an element of a process i.e. a single event or task) based on available data. In the article we explore the possibility of use of the machine learning classification algorithms based on trees (CART, C5.0, random forest and extreme gradient boosting) in order to anticipate the result of a process. We test the application of these algorithms on real world event-log data and compare it with the known approaches. Our results show that.
Źródło:: Logistics and Transport; 2018, 40, 4; 73-82
1734-2015
Pojawia się w:: Logistics and Transport
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: Prediction of flexural strength of FRC pavements by soft computing techniques
Autorzy:: Kimteta, A.
Thakur, M.S.
Sihag, P.
Upadhya, A.
Sharma, N.
Powiązania:: https://bibliotekanauki.pl/articles/24200582.pdf
Data publikacji:: 2022
Wydawca:: Stowarzyszenie Komputerowej Nauki o Materiałach i Inżynierii Powierzchni w Gliwicach
Tematy:: flexural strength
fibre reinforced concrete
artificial neural network
random forest
random tree
M5P based model
wytrzymałość na zginanie
beton zbrojony włóknami
sztuczna sieć neuronowa
las losowy
drzewo losowe
model oparty na M5P
Opis:: Purpose: The mechanical characteristics of concrete used in rigid pavements can be improved by using fibre-reinforced concrete. The purpose of the study was to predict the flexural strength of the fibre-reinforced concrete for ten input variables i.e., cement, fine aggregate, coarse aggregate, water, superplasticizer/high range water reducer, glass fibre, polypropylene fibre, steel fibres, length and diameter of fibre and further to perform the sensitivity analysis to determine the most sensitive input variable which affects the flexural strength of the said fibre-reinforced concrete. Design/methodology/approach: The data used in the study was acquired from the published literature to create the soft computing modes. Four soft computing techniques i.e., Artificial neural networks (ANN), Random forests (RF), Random trees RT), and M5P, were applied to predict the flexural strength of fibre-reinforced concrete for rigid pavement using ten significant input variables as stated in the ‘purpose’. The most performing algorithm was determined after evaluating the applied models on the threshold of five statistical indices, i.e., the coefficient of correlation, mean absolute error, root mean square error, relative absolute error, and root relative squared error. The sensitivity analysis for most sensitive input variable was performed with out-performing model, i.e., ANN. Findings: The testing stage findings show that the Artificial neural networks model outperformed other applicable models, having the highest coefficient of correlation (0.9408), the lowest mean absolute error (0.8292), and the lowest root mean squared error (1.1285). Furthermore, the sensitivity analysis was performed using the artificial neural networks model. The results demonstrate that polypropylene fibre-reinforced concrete significantly influences the prediction of the flexural strength of fibre-reinforced concrete. Research limitations/implications: Large datasets may enhance machine learning technique performance. Originality/value: The article's novelty is that the most suitable model amongst the four applied techniques has been identified, which gives far better accuracy in predicting flexural strength.
Źródło:: Archives of Materials Science and Engineering; 2022, 117, 1; 13--24
1897-2764
Pojawia się w:: Archives of Materials Science and Engineering
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Prediction model of public houses’ heating systems:a comparison of support vector machine methodand random forest method
Model prognozowania systemów grzewczych budynków użyteczności publicznej: porównanie metody support vector machine i random forest
Autorzy:: Perekrest, Andrii
Chenchevoi, Vladimir
Chencheva, Olga
Kovalenko, Alexandr
Kushch-Zhyrko, Mykhailo
Kalizhanova, Aliya
Amirgaliyev, Yedilkhan
Powiązania:: https://bibliotekanauki.pl/articles/2174707.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Lubelska. Wydawnictwo Politechniki Lubelskiej
Tematy:: building heat supply
random forest
support vector machine
zaopatrzenie w ciepło budynku
metoda wektorów wspierających
Opis:: Data analysis and predicting play an important role in managing heat-supplying systems. Applying the models of predicting the systems’ parameters is possible for qualitative management, accepting appropriate decisions relating control that will be aimed at increasing energy efficiency and decreasing the amount of the consumed power source, diagnosing and defining non-typical processes in the functioning of the systems. The article deals with comparing two methods of ma-chine learning: random forest (RF) and support vector machine (SVM) for predicting the temperature of the heat-carrying agent in the heating system based on the data of electronic weather-dependent controller. The authors use the following parameters to compare the models: accuracy, source cost and the opportunity to interpret the results and non-obvious interrelations. The time spent for defining the optimal hyperparameters and conducting the SVM model training is deter-mined to exceed significantly the data of the RF parameter despite the close meanings of the root mean square error (RMSE). The change from 15-min data to once-a-minute ones is done to improve the RF model accuracy. RMSE of the RF model on the test data equals 0.41°С. The article studies the importance of the contribution of variables to the prediction accuracy.
Analiza danych i prognozowanie odgrywają ważną rolę w zarządzaniu systemami zaopatrzenia w ciepło. Wykorzystanie modeli do przewidywania parametrów systemu jest możliwe do zarządzania jakością, podejmowania odpowiednich decyzji sterujących, które będą miały na celu poprawę efektywności energetycznej i zmniejszenie ilości zużywanego źródła energii elektrycznej, diagnozowania i wykrywania nietypowych procesów w funkcjonowaniu systemu. W artykule porównano dwie metody uczenia maszynowego: Random Forest (RF) i Support Vector Machine (SVM) do przewidywania temperatury czynnika grzewczego w systemie grzewczym na podstawie danych elektronicznego regulatora pogodowego. Do porównania modeli autorzy wykorzystują następujące parametry: dokładność, koszt początkowy oraz możliwość interpretacji wyników i nieoczywistych zależności. Ustalono, że czas poświęcony na wyznaczenie optymalnych hiperparametrów i wytrenowanie modelu SVM znacznie przekracza dane parametru RF, pomimo zbliżonych wartości błędu średniokwadratowego (RMSE). Zmiana z danych 15-minutowych na dane raz na minutę została dokonana w celu poprawy dokładności modelu RF. RMSE modelu RF z danych testowych wynosi 0,41°C. W pracy zbadano znaczenie wkładu zmiennych w dokładność prognozy.
Źródło:: Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska; 2022, 12, 3; 34--39
2083-0157
2391-6761
Pojawia się w:: Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: Predicting the stability of open stopes using Machine Learning
Autorzy:: Szmigiel, Alicja
Apel, Derek B.
Powiązania:: https://bibliotekanauki.pl/articles/2201415.pdf
Data publikacji:: 2022
Wydawca:: Główny Instytut Górnictwa
Tematy:: open stope
machine learning
logistic regression
random forest
system otwartych komór
uczenie maszynowe
regresja logistyczna
las losowy
Opis:: The Mathews stability graph method was presented for the first time in 1980. This method was developed to assess the stability of open stopes in different underground conditions, and it has an impact on evaluating the safety of underground excavations. With the development of technology and growing experience in applying computer sciences in various research disciplines, mining engineering could significantly benefit by using Machine Learning. Applying those ML algorithms to predict the stability of open stopes in underground excavations is a new approach that could replace the original graph method and should be investigated. In this research, a Potvin database that consisted of 176 historical case studies was passed to the two most popular Machine Learning algorithms: Logistic Regression and Random Forest, to compare their predicting capabilities. The results obtained showed that those algorithms can indicate the stability of underground openings, especially Random Forest, which, in examined data, performed slightly better than Logistic Regression.
Źródło:: Journal of Sustainable Mining; 2022, 21, 3; 241--248
2300-1364
2300-3960
Pojawia się w:: Journal of Sustainable Mining
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 15.

Tytuł:: Predicting immunogenicity in murine hosts with use of Random Forest classifier
Przewidywanie immunogenności u myszy przy użyciu klasyfikatora Random Forest
Autorzy:: Marciniak, Anna
Tarczewska, Martyna
Kloska, Sylwester
Powiązania:: https://bibliotekanauki.pl/articles/2016293.pdf
Data publikacji:: 2020
Wydawca:: Politechnika Bydgoska im. Jana i Jędrzeja Śniadeckich. Wydawnictwo PB
Tematy:: Random Forest Classifier
immunogenicity
machine learning
entropy
Gini index
klasyfikator Random Forest
immunogenność
uczenie maszynowe
entropia
Opis:: Biomedical data are difficult to interpret due to their large amount. One of the solutions to cope with this problem is to use machine learning. Machine learning can be used to capture previously unnoticed dependencies. The authors performed random forest classifier with entropy and Gini index criteria on immunogenicity data. Input data consisted of 3 columns: epitope (8-11 amino acids long peptide), major histocompatibility complex (MHC) and immune response. Presented model can predict the immune response based on epitope-MHC complex. Achieved results had accuracy of 84% for entropy and 83% for Gini index. The results are not fully satisfying but are a fair start for more complexed experiments and could be used as an indicator for further research.
Dane biomedyczne są trudne do interpretacji ze względu na ich dużą ilość. Jednym z rozwiązań radzenia sobie z tym problemem jest wykorzystanie uczenia maszynowego. Techniki te umożliwiają wychwycenie wcześniej niezauważonych zależności. W artykule przedstawiono wykorzystanie klasyfikatora Random Forest z kryterium entropii i indeksem Gini na danych dotyczących immunogenności. Dane wejściowe składają się z 3 kolumn: epitop (peptyd o długości 8-11 aminokwasów), główny kompleks zgodności tkankowej (MHC) i odpowiedź immunologiczna. Zaprezentowany model przewiduje odpowiedź immunologiczną na podstawie kompleksu epitop-MHC. Uzyskane wyniki osiągnęły dokładność na poziomie 84% (entropia) i 83% (indeks Gini). Wyniki nie są w pełni satysfakcjonujące, ale stanowią dobry początek dla bardziej złożonych eksperymentów i wyznacznik do dalszych badań.
Źródło:: Zeszyty Naukowe. Telekomunikacja i Elektronika / Uniwersytet Technologiczno-Przyrodniczy w Bydgoszczy; 2020, 24; 31-43
1899-0088
Pojawia się w:: Zeszyty Naukowe. Telekomunikacja i Elektronika / Uniwersytet Technologiczno-Przyrodniczy w Bydgoszczy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "Random Forest" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język