Temat: Random Forest - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Predicting immunogenicity in murine hosts with use of Random Forest classifier
Przewidywanie immunogenności u myszy przy użyciu klasyfikatora Random Forest
Autorzy:: Marciniak, Anna
Tarczewska, Martyna
Kloska, Sylwester
Powiązania:: https://bibliotekanauki.pl/articles/2016293.pdf
Data publikacji:: 2020
Wydawca:: Politechnika Bydgoska im. Jana i Jędrzeja Śniadeckich. Wydawnictwo PB
Tematy:: Random Forest Classifier
immunogenicity
machine learning
entropy
Gini index
klasyfikator Random Forest
immunogenność
uczenie maszynowe
entropia
Opis:: Biomedical data are difficult to interpret due to their large amount. One of the solutions to cope with this problem is to use machine learning. Machine learning can be used to capture previously unnoticed dependencies. The authors performed random forest classifier with entropy and Gini index criteria on immunogenicity data. Input data consisted of 3 columns: epitope (8-11 amino acids long peptide), major histocompatibility complex (MHC) and immune response. Presented model can predict the immune response based on epitope-MHC complex. Achieved results had accuracy of 84% for entropy and 83% for Gini index. The results are not fully satisfying but are a fair start for more complexed experiments and could be used as an indicator for further research.
Dane biomedyczne są trudne do interpretacji ze względu na ich dużą ilość. Jednym z rozwiązań radzenia sobie z tym problemem jest wykorzystanie uczenia maszynowego. Techniki te umożliwiają wychwycenie wcześniej niezauważonych zależności. W artykule przedstawiono wykorzystanie klasyfikatora Random Forest z kryterium entropii i indeksem Gini na danych dotyczących immunogenności. Dane wejściowe składają się z 3 kolumn: epitop (peptyd o długości 8-11 aminokwasów), główny kompleks zgodności tkankowej (MHC) i odpowiedź immunologiczna. Zaprezentowany model przewiduje odpowiedź immunologiczną na podstawie kompleksu epitop-MHC. Uzyskane wyniki osiągnęły dokładność na poziomie 84% (entropia) i 83% (indeks Gini). Wyniki nie są w pełni satysfakcjonujące, ale stanowią dobry początek dla bardziej złożonych eksperymentów i wyznacznik do dalszych badań.
Źródło:: Zeszyty Naukowe. Telekomunikacja i Elektronika / Uniwersytet Technologiczno-Przyrodniczy w Bydgoszczy; 2020, 24; 31-43
1899-0088
Pojawia się w:: Zeszyty Naukowe. Telekomunikacja i Elektronika / Uniwersytet Technologiczno-Przyrodniczy w Bydgoszczy
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Artificial Intelligence Based Flood Forecasting for River Hunza at Danyor Station in Pakistan
Autorzy:: Yaseen, Muhammad Waseem
Awais, Muhammad
Riaz, Khuram
Rasheed, Muhammad Babar
Waqar, Muhammad
Rasheed, Sajid
Powiązania:: https://bibliotekanauki.pl/articles/31340346.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Instytut Budownictwa Wodnego PAN
Tematy:: hydrometeorology
random forest
support vector
multilayer perceptron
machine learning
flood forecasting
Opis:: Floods can cause significant problems for humans and can damage the economy. Implementing a reliable flood monitoring warning system in risk areas can help to reduce the negative impacts of these natural disasters. Artificial intelligence algorithms and statistical approaches are employed by researchers to enhance flood forecasting. In this study, a dataset was created using unique features measured by sensors along the Hunza River in Pakistan over the past 31 years. The dataset was used for classification and regression problems. Two types of machine learning algorithms were tested for classification: classical algorithms (Random Forest, RF and Support Vector Classifier, SVC) and deep learning algorithms (Multi-Layer Perceptron, MLP). For the regression problem, the result of MLP and Support Vector Regression (SVR) algorithms were compared based on their mean square, root mean square and mean absolute errors. The results obtained show that the accuracy of the RF classifier is 0.99, while the accuracies of the SVC and MLP methods are 0.98; moreover, in the case of flood prediction, the SVR algorithm outperforms the MLP approach.
Źródło:: Archives of Hydro-Engineering and Environmental Mechanics; 2022, 69, 1; 59-77
1231-3726
Pojawia się w:: Archives of Hydro-Engineering and Environmental Mechanics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: A random forest model for the prediction of spudcan penetration resistance in stiff-over-soft clays
Autorzy:: Gao, Pan
Liu, Zhihui
Zeng, Ji
Zhan, Yiting
Wang, Fei
Powiązania:: https://bibliotekanauki.pl/articles/1573798.pdf
Data publikacji:: 2020
Wydawca:: Politechnika Gdańska. Wydział Inżynierii Mechanicznej i Okrętownictwa
Tematy:: machine learning
random forest
jack-up
penetration resistance
stiff-over-soft clays
Opis:: Punch-through is a major threat to the jack-up unit, especially at well sites with layered stiff-over-soft clays. A model is proposed to predict the spudcan penetration resistance in stiff-over-soft clays, based on the random forest (RF) method. The RF model was trained and tested with numerical simulation results obtained through the Finite Element model, implemented with the Coupled Eulerian Lagrangian (CEL) approach. With the proposed CEL model, the effects of the stiff layer thickness, undrained shear strength ratio, and the undrained shear strength of the soft layer on the bearing characteristics, as well as the soil failure mechanism, were numerically studied. A simplified resistance profile model of penetration in stiff-over-soft clays is proposed, divided into three sections by the peak point and the transition point. The importance of soil parameters to the penetration resistance was analysed. Then, the trained RF model was tested against the test set, showing a good prediction of the numerical cases. Finally, the trained RF was validated against centrifuge tests. The RF model successfully captured the punch-through potential, and was verified using data recorded in the field, showing advantages over the SNAME guideline. It is supposed that the trained RF model should give a good prediction of the spudcan penetration resistance profile, especially if trained with more field data.
Źródło:: Polish Maritime Research; 2020, 4; 130-138
1233-2585
Pojawia się w:: Polish Maritime Research
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Application of machine learning algorithms to predict permeability in tight sandstone formations
Zastosowanie metod uczenia maszynowego do przewidywania przepuszczalności w formacjach zwięzłych piaskowców typu tight gas
Autorzy:: Topór, Tomasz
Powiązania:: https://bibliotekanauki.pl/articles/2143653.pdf
Data publikacji:: 2021
Wydawca:: Instytut Nafty i Gazu - Państwowy Instytut Badawczy
Tematy:: machine learning
random forest
permeability
prediction
uczenie maszynowe
lasy losowe
predykcja
przepuszczalność
Opis:: The application of machine learning algorithms in petroleum geology has opened a new chapter in oil and gas exploration. Machine learning algorithms have been successfully used to predict crucial petrophysical properties when characterizing reservoirs. This study utilizes the concept of machine learning to predict permeability under confining stress conditions for samples from tight sandstone formations. The models were constructed using two machine learning algorithms of varying complexity (multiple linear regression [MLR] and random forests [RF]) and trained on a dataset that combined basic well information, basic petrophysical data, and rock type from a visual inspection of the core material. The RF algorithm underwent feature engineering to increase the number of predictors in the models. In order to check the training models’ robustness, 10-fold cross-validation was performed. The MLR and RF applications demonstrated that both algorithms can accurately predict permeability under constant confining pressure (R2 0.800 vs. 0.834). The RF accuracy was about 3% better than that of the MLR and about 6% better than the linear reference regression (LR) that utilized only porosity. Porosity was the most influential feature of the models’ performance. In the case of RF, the depth was also significant in the permeability predictions, which could be evidence of hidden interactions between the variables of porosity and depth. The local interpretation revealed the common features among outliers. Both the training and testing sets had moderate-low porosity (3–10%) and a lack of fractures. In the test set, calcite or quartz cementation also led to poor permeability predictions. The workflow that utilizes the tidymodels concept will be further applied in more complex examples to predict spatial petrophysical features from seismic attributes using various machine learning algorithms.
Zastosowanie algorytmów uczenia maszynowego w geologii naftowej otworzyło nowy rozdział w poszukiwaniu złóż ropy i gazu. Algorytmy uczenia maszynowego zostały z powodzeniem wykorzystane do przewidywania kluczowych właściwości petrofizycznych charakteryzujących złoże. W pracy zastosowano metody uczenia maszynowego do przewidywania przepuszczalności w warunkach ustalonego ciśnienia złożowego dla formacji zwięzłych piaskowców typu tight gas. Modele zostały skonstruowane przy użyciu algorytmów o różnym stopniu komplikacji (wielowymiarowa regresja liniowa – MLR i lasy losowe – RF), a następnie poddano je procesowi uczenia na danych zawierających podstawowe informacje o otworze, podstawowe parametry petrofizyczne oraz typ skał pochodzący z makroskopowego i mikroskopowego opisu próbek rdzeni. Typ skał został rozkodowany i poddany procesowi inżynierii cech, aby wydobyć dodatkowe zmienne do modelu. Proces uczenia na zbiorze treningowym został przeprowadzony z wykorzystaniem 10-krotnej kroswalidacji. Uzyskane wyniki pokazują, że oba algorytmy mogą przewidywać przepuszczalność z dużą dokładnością (R2 = 0,800 dla MLR vs R2 = 0,834 dla RF). Dokładność modelu RF jest około 3% lepsza niż MLR i około 6% lepsza w porównaniu do modelu referencyjnego (model regresji liniowej z jedną zmienną – porowatością). W przypadku obu modeli porowatość była najistotniejszym parametrem przy przewidywaniu przepuszczalności. Dodatkowo w modelu wykorzystującym lasy losowe istotną cechą okazała się głębokość próbki, co może świadczyć o dodatkowych interakcjach pomiędzy zmiennymi. Cechą wspólną próbek w zbiorze treningowym i testowym, dla których modele zadziałały ze słabą skutecznością, były porowatość od 3% do 10% i brak spękań. Dodatkowo w zbiorze testowym niska dokładność przewidywań przepuszczalności była związana z obecnością cementacji kalcytem i kwarcem. Workflow wykorzystujący stan wiedzy dotyczącej modelowania, którego trzon stanowi pakiet tidymodels, będzie dalej stosowany do prognozowania przestrzennych właściwości petrofizycznych na podstawie atrybutów sejsmicznych.
Źródło:: Nafta-Gaz; 2021, 77, 5; 283-292
0867-8871
Pojawia się w:: Nafta-Gaz
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Developing a data-driven soft sensor to predict silicate impurity in iron ore flotation concentrate
Autorzy:: Pural, Yusuf Enes
Powiązania:: https://bibliotekanauki.pl/articles/24148677.pdf
Data publikacji:: 2023
Wydawca:: Politechnika Wrocławska. Oficyna Wydawnicza Politechniki Wrocławskiej
Tematy:: soft sensor
machine learning
random forest
multi-layer perceptron
flotation
grade estimation
Opis:: Soft sensors are mathematical models that estimate the value of a process variable that is difficult or expensive to measure directly. They can be based on first principle models, data-based models, or a combination of both. These models are increasingly used in mineral processing to estimate and optimize important performance parameters such as mill load, mineral grades, and particle size. This study investigates the development of a data-driven soft sensor to predict the silicate content in iron ore reverse flotation concentrate, a crucial indicator of plant performance. The proposed soft sensor model employs a dataset obtained from Kaggle, which includes measurements of iron and silicate content in the feed to the plant, reagent dosages, weight and pH of pulp, as well as the amount of air and froth levels in the flotation units. To reduce the dimensionality of the dataset, Principal Component Analysis, an unsupervised machine learning method, was applied. The soft sensor model was developed using three machine learning algorithms, namely, Ridge Regression, Multi-Layer Perceptron, and Random Forest. The Random Forest model, created with non-reduced data, demonstrated superior performance, with an R-squared value of 96.5% and a mean absolute error of 0.089. The results suggest that the proposed soft sensor model can accurately predict the silicate content in the iron ore flotation concentrate using machine learning algorithms. Moreover, the study highlights the importance of selecting appropriate algorithms for soft sensor developments in mineral processing plants.
Źródło:: Physicochemical Problems of Mineral Processing; 2023, 59, 5; art. no. 169823
1643-1049
2084-4735
Pojawia się w:: Physicochemical Problems of Mineral Processing
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: A Study on the Optimization of Metalloid Contents of Fe-Si-B-C Based Amorphous Soft Magnetic Materials Using Artificial Intelligence Method
Autorzy:: Choi, Young-Sin
Kwon, Do-Hun
Lee, Min_Woo
Cha, Eun-Ji
Jeon, Junhyub
Lee, Seok-Jae
Kim, Jongryoul
Kim, Hwi-Jun
Powiązania:: https://bibliotekanauki.pl/articles/2174571.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: Fe-based amorphous
soft magnetic properties
artificial intelligence
machine learning
random forest regression
Opis:: The soft magnetic properties of Fe-based amorphous alloys can be controlled by their compositions through alloy design. Experimental data on these alloys show some discrepancy, however, with predicted values. For further improvement of the soft magnetic properties, machine learning processes such as random forest regression, k-nearest neighbors regression and support vector regression can be helpful to optimize the composition. In this study, the random forest regression method was used to find the optimum compositions of Fe-Si-B-C alloys. As a result, the lowest coercivity was observed in Fe80.5Si3.63B13.54C2.33 at.% and the highest saturation magnetization was obtained Fe81.83Si3.63B12.63C1.91at.% with R2 values of 0.74 and 0.878, respectively.
Źródło:: Archives of Metallurgy and Materials; 2022, 67, 4; 1459--1463
1733-3490
Pojawia się w:: Archives of Metallurgy and Materials
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Assessment of Approaches for the Extraction of Building Footprints from Pléiades Images
Autorzy:: Taha, Lamyaa Gamal El-deen
Ibrahim, Rania Elsayed
Powiązania:: https://bibliotekanauki.pl/articles/1837996.pdf
Data publikacji:: 2021
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: ensemble classifiers
machine learning
random forest
maximum likelihood
support vector machines
backpropagation
image classification
Opis:: The Marina area represents an official new gateway of entry to Egypt and the development of infrastructure is proceeding rapidly in this region. The objective of this research is to obtain building data by means of automated extraction from Pléiades satellite images. This is due to the need for efficient mapping and updating of geodatabases for urban planning and touristic development. It compares the performance of random forest algorithm to other classifiers like maximum likelihood, support vector machines, and backpropagation neural networks over the well-organized buildings which appeared in the satellite images. Images were subsequently classified into two classes: buildings and non-buildings. In addition, basic morphological operations such as opening and closing were used to enhance the smoothness and connectedness of the classified imagery. The overall accuracy for random forest, maximum likelihood, support vector machines, and backpropagation were 97%, 95%, 93% and 92% respectively. It was found that random forest was the best option, followed by maximum likelihood, while the least effective was the backpropagation neural network. The completeness and correctness of the detected buildings were evaluated. Experiments confirmed that the four classification methods can effectively and accurately detect 100% of buildings from very high-resolution images. It is encouraged to use machine learning algorithms for object detection and extraction from very high-resolution images.
Źródło:: Geomatics and Environmental Engineering; 2021, 15, 4; 101-116
1898-1135
Pojawia się w:: Geomatics and Environmental Engineering
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Performance comparison of machine learning algotihms for predictive maintenance
Porównanie skuteczności algorytmów uczenia maszynowego dla konserwacji predykcyjnej
Autorzy:: Gęca, Jakub
Powiązania:: https://bibliotekanauki.pl/articles/1841332.pdf
Data publikacji:: 2020
Wydawca:: Politechnika Lubelska. Wydawnictwo Politechniki Lubelskiej
Tematy:: machine learning
random forest
predictive maintenance
neural networks
uczenie maszynowe
las losowy
konserwacja predykcyjna
sieci neuronowe
Opis:: The consequences of failures and unscheduled maintenance are the reasons why engineers have been trying to increase the reliability of industrial equipment for years. In modern solutions, predictive maintenance is a frequently used method. It allows to forecast failures and alert about their possibility. This paper presents a summary of the machine learning algorithms that can be used in predictive maintenance and comparison of their performance. The analysis was made on the basis of data set from Microsoft Azure AI Gallery. The paper presents a comprehensive approach to the issue including feature engineering, preprocessing, dimensionality reduction techniques, as well as tuning of model parameters in order to obtain the highest possible performance. The conducted research allowed to conclude that in the analysed case, the best algorithm achieved 99.92% accuracy out of over 122 thousand test data records. In conclusion, predictive maintenance based on machine learning represents the future of machine reliability in industry.
Skutki związane z awariami oraz niezaplanowaną konserwacją to powody, dla których od lat inżynierowie próbują zwiększyć niezawodność osprzętu przemysłowego. W nowoczesnych rozwiązaniach obok tradycyjnych metod stosowana jest również tzw. konserwacja predykcyjna, która pozwala przewidywać awarie i alarmować o możliwości ich powstawania. W niniejszej pracy przedstawiono zestawienie algorytmów uczenia maszynowego, które można zastosować w konserwacji predykcyjnej oraz porównanie ich skuteczności. Analizy dokonano na podstawie zbioru danych Azure AI Gallery udostępnionych przez firmę Microsoft. Praca przedstawia kompleksowe podejście do analizowanego zagadnienia uwzględniające wydobywanie cech charakterystycznych, wstępne przygotowanie danych, zastosowanie technik redukcji wymiarowości, a także dostrajanie parametrów poszczególnych modeli w celu uzyskania najwyższej możliwej skuteczności. Przeprowadzone badania pozwoliły wskazać najlepszy algorytm, który uzyskał dokładność na poziomie 99,92%, spośród ponad 122 tys. rekordów danych testowych. Na podstawie tego można stwierdzić, że konserwacja predykcyjna prowadzona w oparciu o uczenie maszynowe stanowi przyszłość w zakresie podniesienia niezawodności maszyn w przemyśle.
Źródło:: Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska; 2020, 10, 3; 32-35
2083-0157
2391-6761
Pojawia się w:: Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Sparse data classifier based on first-past-the-post voting system
Autorzy:: Cudak, Magdalena
Piech, Mateusz
Marcjan, Robert
Powiązania:: https://bibliotekanauki.pl/articles/27312911.pdf
Data publikacji:: 2022
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: POI
machine learning
geospatial data
data science
first-past-the-post
random forest
point of interest
Opis:: A point of interest (POI) is a general term for objects that describe places from the real world. The concept of POI matching (i.e., determining whether two sets of attributes represent the same location) is not a trivial challenge due to the large variety of data sources. The representations of POIs may vary depending on the basis of how they are stored. A manual comparison of objects is not achievable in real time; therefore, there are multiple solutions for automatic merging. However, there is no yet the efficient solution solves the missing of the attributes. In this paper, we propose a multi-layered hybrid classifier that is composed of machine-learning and deep-learning techniques and supported by a first-past-the-post voting system. We examined different weights for the constituencies that were taken into consideration during a majority (or supermajority) decision. As a result, we achieved slightly higher accuracy than the best current model (random forest), which also is based on voting.
Źródło:: Computer Science; 2022, 23 (2); 277--296
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Predicting the stability of open stopes using Machine Learning
Autorzy:: Szmigiel, Alicja
Apel, Derek B.
Powiązania:: https://bibliotekanauki.pl/articles/2201415.pdf
Data publikacji:: 2022
Wydawca:: Główny Instytut Górnictwa
Tematy:: open stope
machine learning
logistic regression
random forest
system otwartych komór
uczenie maszynowe
regresja logistyczna
las losowy
Opis:: The Mathews stability graph method was presented for the first time in 1980. This method was developed to assess the stability of open stopes in different underground conditions, and it has an impact on evaluating the safety of underground excavations. With the development of technology and growing experience in applying computer sciences in various research disciplines, mining engineering could significantly benefit by using Machine Learning. Applying those ML algorithms to predict the stability of open stopes in underground excavations is a new approach that could replace the original graph method and should be investigated. In this research, a Potvin database that consisted of 176 historical case studies was passed to the two most popular Machine Learning algorithms: Logistic Regression and Random Forest, to compare their predicting capabilities. The results obtained showed that those algorithms can indicate the stability of underground openings, especially Random Forest, which, in examined data, performed slightly better than Logistic Regression.
Źródło:: Journal of Sustainable Mining; 2022, 21, 3; 241--248
2300-1364
2300-3960
Pojawia się w:: Journal of Sustainable Mining
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: Space-Time-Frequency Machine Learning for Improved 4G/5G Energy Detection
Autorzy:: Wasilewska, Małgorzata
Bogucka, Hanna
Powiązania:: https://bibliotekanauki.pl/articles/226216.pdf
Data publikacji:: 2020
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: spectrum sensing
cognitive radio
machine learning
energy detection
4G
LTE
5G
k-nearest neighbors
random forest
Opis:: In this paper, the future Fifth Generation (5G New Radio) radio communication system has been considered, coexisting and sharing the spectrum with the incumbent Fourth Generation (4G) Long-Term Evolution (LTE) system. The 4G signal presence is detected in order to allow for opportunistic and dynamic spectrum access of 5G users. This detection is based on known sensing methods, such as energy detection, however, it uses machine learning in the domains of space, time and frequency for sensing quality improvement. Simulation results for the considered methods: k-Nearest Neighbor sand Random Forest show that these methods signiﬁcantly improves the detection probability.
Źródło:: International Journal of Electronics and Telecommunications; 2020, 66, 1; 217-223
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: Application of machine learning tools for seismic reservoir characterization study of porosity and saturation type
Zastosowanie metod uczenia maszynowego do charakterystyki porowatości i typu nasycenia przy użyciu atrybutów sejsmicznych
Autorzy:: Topór, Tomasz
Sowiżdżał, Krzysztof
Powiązania:: https://bibliotekanauki.pl/articles/2143329.pdf
Data publikacji:: 2022
Wydawca:: Instytut Nafty i Gazu - Państwowy Instytut Badawczy
Tematy:: machine learning
random forest
XGBoost
seismic attributes
reservoir properties prediction
uczenie maszynowe
lasy losowe
drzewa wzmocnione gradientowo
atrybuty sejsmiczne
predykcja własności zbiornikowych
Opis:: The application of machine learning (ML) tools and data-driven modeling became a standard approach for solving many problems in exploration geology and contributed to the discovery of new reservoirs. This study explores an application of machine learning ensemble methods – random forest (RF) and extreme gradient boosting (XGBoost) to derive porosity and saturation type (gas/water) in multihorizon sandstone formations from Miocene deposits of the Carpathian Foredeep. The training of ML algorithms was divided into two stages. First, the RF algorithm was used to compute porosity based on seismic attributes and well location coordinates. The obtained results were used as an extra feature to saturation type modeling using the XGBoost algorithm. The XGBoost was run with and without well location coordinates to evaluate the influence of the spatial information for the modeling performance. The hyperparameters for each model were tuned using the Bayesian optimization algorithm. To check the training models' robustness, 10-fold cross-validation was performed. The results were evaluated using standard metrics, for regression and classification, on training and testing sets. The residual mean standard error (RMSE) for porosity prediction with RF for training and testing was close to 0.053, providing no evidence of overfitting. Feature importance analysis revealed that the most influential variables for porosity prediction were spatial coordinates and seismic attributes sweetness. The results of XGBoost modeling (variant 1) demonstrated that the algorithm could accurately predict saturation type despite the class imbalance issue. The sensitivity for XGBoost on training and testing data was high and equaled 0.862 and 0.920, respectively. The XGBoost model relied on computed porosity and spatial coordinates. The obtained sensitivity results for both training and testing sets dropped significantly by about 10% when well location coordinates were removed (variant 2). In this case, the three most influential features were computed porosity, seismic amplitude contrast, and iso-frequency component (15 Hz) attribute. The obtained results were imported to Petrel software to present the spatial distribution of porosity and saturation type. The latter parameter was given with probability distribution, which allows for identifying potential target zones enriched in gas.
Metody uczenia maszynowego stanowią obecnie rutynowe narzędzie wykorzystywane przy rozwiązywaniu wielu problemów w geologii poszukiwawczej i przyczyniają się do odkrycia nowych złóż. Prezentowana praca pokazuje zastosowanie dwóch algorytmów uczenia maszynowego – lasów losowych (RF) i drzew wzmocnionych gradientowo (XGBoost) do wyznaczenia porowatości i typu nasycenia (gaz/woda) w formacjach piaskowców będących potencjalnymi horyzontami gazonośnymi w mioceńskich osadach zapadliska przedkarpackiego. Proces uczenia maszynowego został podzielony na dwa etapy. W pierwszym etapie użyto RF do obliczenia porowatości na podstawie danych pochodzących z atrybutów sejsmicznych oraz współrzędnych lokalizacji otworów. Uzyskane wyniki zostały wykorzystane jako dodatkowa cecha przy modelowaniu typu nasycenia z zastosowaniem algorytmu XGBoost. Modelowanie za pomocą XGBoost został przeprowadzone w dwóch wariantach – z wykorzystaniem lokalizacji otworów oraz bez nich w celu oceny wpływu informacji przestrzennych na wydajność modelowania. Proces strojenia hiperparametrów dla poszczególnych modeli został przeprowadzony z wykorzystaniem optymalizacji Bayesa. Wyniki procesu modelowania zostały ocenione na zbiorach treningowym i testowym przy użyciu standardowych metryk wykorzystywanych do rozwiązywania problemów regresyjnych i klasyfikacyjnych. Dodatkowo, aby wzmocnić wiarygodność modeli treningowych, przeprowadzona została 10-krotna kroswalidacja. Pierwiastek błędu średniokwadratowego (RMSE) dla wymodelowanej porowatości na zbiorach treningowym i testowym był bliski 0,053 co wskazuje na brak nadmiernego dopasowania modelu (ang. overfitting). Analiza istotności cech ujawniła, że zmienną najbardziej wpływającą na prognozowanie porowatości były współrzędne lokalizacji otworów oraz atrybut sejsmiczny sweetness. Wyniki modelowania XGBoost (wariant 1) wykazały, że algorytm jest w stanie dokładnie przewidywać typ nasycenia pomimo problemu z nierównowagą klas. Czułość wykrywania potencjalnych stref gazowych w przypadku modelu XGBoost była wysoka zarówno dla zbioru treningowego, jak i testowego (0,862 i 0,920). W swoich predykcjach model opierał się głównie na wyliczonej porowatości oraz współrzędnych otworów. Czułość dla uzyskanych wyników na zbiorze treningowym i testowym spadła o około 10%, gdy usunięto współrzędne lokalizacji otworów (wariant 2 XGBoost). W tym przypadku trzema najważniejszymi cechami były obliczona porowatość oraz atrybut sejsmiczny amplitude contrast i atrybut iso-frequency component (15 Hz). Uzyskane wyniki zostały zaimportowane do programu Petrel, aby przedstawić przestrzenny rozkład porowatości i typu nasycenia. Ten ostatni parametr został przedstawiony wraz z rozkładem prawdopodobieństwa, co dało wgląd w strefy o najwyższym potencjale gazowym.
Źródło:: Nafta-Gaz; 2022, 78, 3; 165-175
0867-8871
Pojawia się w:: Nafta-Gaz
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Random forest based power sustainability and cost optimization in smart grid
Autorzy:: Durairaj, Danalakshmi
Wróblewski, Łukasz
Sheela, A.
Hariharasudan, A.
Urbański, Mariusz
Powiązania:: https://bibliotekanauki.pl/articles/23966623.pdf
Data publikacji:: 2022
Wydawca:: Stowarzyszenie Menedżerów Jakości i Produkcji
Tematy:: smart grid
las losowy
internet rzeczy
zarządzanie energią
uczenie maszynowe
licznik inteligentny
random forest
Internet of things
power management
machine learning
smart meter
priority power scheduling
Opis:: Presently power control and management play a vigorous role in information technology and power management. Instead of non-renewable power manufacturing, renewable power manufacturing is preferred by every organization for controlling resource consumption, price reduction and efficient power management. Smart grid efficiently satisfies these requirements with the integration of machine learning algorithms. Machine learning algorithms are used in a smart grid for power requirement prediction, power distribution, failure identification etc. The proposed Random Forest-based smart grid system classifies the power grid into different zones like high and low power utilization. The power zones are divided into number of sub-zones and map to random forest branches. The sub-zone and branch mapping process used to identify the quantity of power utilized and the non-utilized in a zone. The non-utilized power quantity and location of power availabilities are identified and distributed the required quantity of power to the requester in a minimal response time and price. The priority power scheduling algorithm collect request from consumer and send the request to producer based on priority. The producer analysed the requester existing power utilization quantity and availability of power for scheduling the power distribution to the requester based on priority. The proposed Random Forest based sustainability and price optimization technique in smart grid experimental results are compared to existing machine learning techniques like SVM, KNN and NB. The proposed random forest-based identification technique identifies the exact location of the power availability, which takes minimal processing time and quick responses to the requestor. Additionally, the smart meter based smart grid technique identifies the faults in short time duration than the conventional energy management technique is also proven in the experimental results.
Źródło:: Production Engineering Archives; 2022, 28, 1; 82--92
2353-5156
2353-7779
Pojawia się w:: Production Engineering Archives
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "Random Forest" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język