نوع مقاله : مقاله پژوهشی
نویسندگان
1 گروه سازههای آبی، دانشکده مهندسی آب و محیط زیست، دانشگاه شهید چمران اهواز، اهواز، ایران.
2 گروه سازه های آبی، دانشکده مهندسی آب و محیط زیست، دانشگاه شهید چمران اهواز، اهواز، ایران.
3 گروه هیدرولوژی و منابع آب، دانشکده مهندسی آب و محیط زیست، دانشگاه شهید چمران اهواز، اهواز، ایران
4 گروه سازههای آبی، دانشکده مهندسی آّ و محیط زیست، دانشگاه شهید چمران اهواز، اهواز، ایران.
چکیده
کلیدواژهها
موضوعات
عنوان مقاله [English]
نویسندگان [English]
In this study, the performance of machine learning-based methods for predicting the maximum scour depth around a Bandallike spur-dike is evaluated. For this purpose, three methods of Random Forest (RF) model, Support Vector Machine (SVM), and Gene Expression Programming (GEP) were used. To train and test the models, 108 data series (87 series for training and 21 series for testing) were extracted from the results of an experimental study. The models were evaluated with four different combinations of inputs (Fr: flow Froude number, S/L: ratio of distance to breakwater length, θ: spur-dike installation angle relative to the bank, and α: porosity of the permeable structure). The results showed that for all methods in the one input mode, the parameters with the most and least impact were, in order, α and S/L. In the SVM model, the average MAE index increased by about 2 times when the number of inputs increased from one input mode. In the GEP model, the average MAE index increased by about 3.5 times when the number of inputs increased from three to four inputs mode. However, in the RF method, increasing the number of inputs led to an increase in model accuracy, and the average MAE index decreased by 83% in the four inputs mode compared to the three inputs mode. Finally, it was found that the RF method had much better performance (MAE = 0.006 and RMSE = 0.009) in estimating the scour depth around the Bandal-like spur-dike than the other methods, and this model had less error spread with the same inputs.
کلیدواژهها [English]
Evaluation of the performance of machine learning methods for estimating the maximum scour depth around the bandallike spur-dike
EXTENDED ABSTRACT
Various methods have been proposed to control riverbank erosion, with one of the most common methods being the use of a spur-dike. One of the key issues with these structures is the scouring around their foundations due to changes in flow patterns. Scouring around the foundation can lead to structural instability and ultimately its destruction. To reduce the depth of scouring around the foundation of the dam structure, a new type of spur-dike "BandalLike" has been introduced. This structure consists of a permeable part at the bottom and an impermeable part on top. Experimental relationships have been established to estimate the maximum depth of scouring around the foundation of the impermeable spur-dike to consider precautions in design. However, there are not many relationships provided for the permeable dam structure. Therefore, the aim of the current research is to evaluate the performance of machine learning-based methods to estimate the maximum depth of scouring around the "BandalLike" structure.
In the current research, the results of a laboratory experiment were used. Dimensionless influential variables (input variables) on the scouring around the bandallike were considered, including Froude number (Fr), angle of installation of the bandallike relative to the flume wall (θ), percentage of permeability of the porous section (α), and the distance ratio to the length of the bandallike (S/L). It is worth mentioning that these variables were determined after dimensional analysis. Additionally, the ratio of maximum scouring depth to the flow depth (hs/h) was considered as the output variable. For each variable, a total of 108 data series were extracted. 80% of the data series (86 series) were used for model training, and the remaining 20% (22 series) were used for model evaluation. Furthermore, GEP, SVM, and RF methods were used in this research.
For all models, the best performance for single-variable scenarios was determined to be scenario S2 with input parameter α. In this regard, the MAE index for GEP, SVM, and RF models was estimated to be 0.077, 0.067, and 0.073, respectively. Additionally, the lowest performance level was determined for scenario S3 with input parameter S/L. For two-variable scenarios, the results showed that the best input combination for all models is scenario S5 with variables α and Fr. Based on this, the RMSE index for GEP, SVM, and RF models was determined to be 0.015, 0.015, and 0.009, respectively. With an increase in the number of inputs to two variables, the average MAE index for GEP and RF models decreased by 12% and 24%, respectively, indicating increased accuracy with an increase in inputs for these two models. However, the average MAE index for the SVM model increased by 12%, indicating a decrease in evaluation accuracy for this model with an increase in inputs. For three-variable scenarios, the GEP model achieved the highest accuracy for scenario S14 with parameters α, Fr, and θ, with RMSE, MAE, and CC statistical indices of 0.013, 0.017, and 0.990, respectively. In SVM and RF models, the best three-variable scenario was S11 with parameters α, Fr, S/L, with RMSE evaluation indices of 0.009 and 0.008, respectively. It was found that the RF model, like the GEP model, increased accuracy (a 41% decrease in average MAE index) with an increase in the number of input variables. However, increasing the number of inputs did not significantly change the accuracy of the SVM model. For four-variable scenarios (S15), the SVM model faced a significant decrease in accuracy, with an approximate 140% increase in the average MAE index. Meanwhile, the GEP and RF models also experienced increased accuracy for scenario S15.
All methods in single-variable input mode had the most and least impact respectively related to parameters α and S/L. In the SVM model, increasing the number of inputs from single-variable to two-variable mode resulted in an almost 2-fold increase in the average MAE index. In the GEP model, increasing the number of inputs from three variables to four variables led to an approximately 3.5-fold increase in the average MAE index. However, in the RF method, increasing the number of inputs resulted in improved model accuracy, with the average MAE index in the four-variable mode decreasing by 83% compared to the three-variable mode. Ultimately, it was evident that the RF method provided much better performance in estimating the depth of sedimentation around the bandallike spur-dike compared to other methods.