Document Type : Research Paper
Authors
1 Department of Hydraulic Structures, Faculty of Water and Environmental Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
2 Department of Hydrology and Water Resources , Faculty of Water and Environmental Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
Abstract
Keywords
Main Subjects
Evaluation of the performance of machine learning methods for estimating the maximum scour depth around the bandallike spur-dike
EXTENDED ABSTRACT
Various methods have been proposed to control riverbank erosion, with one of the most common methods being the use of a spur-dike. One of the key issues with these structures is the scouring around their foundations due to changes in flow patterns. Scouring around the foundation can lead to structural instability and ultimately its destruction. To reduce the depth of scouring around the foundation of the dam structure, a new type of spur-dike "BandalLike" has been introduced. This structure consists of a permeable part at the bottom and an impermeable part on top. Experimental relationships have been established to estimate the maximum depth of scouring around the foundation of the impermeable spur-dike to consider precautions in design. However, there are not many relationships provided for the permeable dam structure. Therefore, the aim of the current research is to evaluate the performance of machine learning-based methods to estimate the maximum depth of scouring around the "BandalLike" structure.
In the current research, the results of a laboratory experiment were used. Dimensionless influential variables (input variables) on the scouring around the bandallike were considered, including Froude number (Fr), angle of installation of the bandallike relative to the flume wall (θ), percentage of permeability of the porous section (α), and the distance ratio to the length of the bandallike (S/L). It is worth mentioning that these variables were determined after dimensional analysis. Additionally, the ratio of maximum scouring depth to the flow depth (hs/h) was considered as the output variable. For each variable, a total of 108 data series were extracted. 80% of the data series (86 series) were used for model training, and the remaining 20% (22 series) were used for model evaluation. Furthermore, GEP, SVM, and RF methods were used in this research.
For all models, the best performance for single-variable scenarios was determined to be scenario S2 with input parameter α. In this regard, the MAE index for GEP, SVM, and RF models was estimated to be 0.077, 0.067, and 0.073, respectively. Additionally, the lowest performance level was determined for scenario S3 with input parameter S/L. For two-variable scenarios, the results showed that the best input combination for all models is scenario S5 with variables α and Fr. Based on this, the RMSE index for GEP, SVM, and RF models was determined to be 0.015, 0.015, and 0.009, respectively. With an increase in the number of inputs to two variables, the average MAE index for GEP and RF models decreased by 12% and 24%, respectively, indicating increased accuracy with an increase in inputs for these two models. However, the average MAE index for the SVM model increased by 12%, indicating a decrease in evaluation accuracy for this model with an increase in inputs. For three-variable scenarios, the GEP model achieved the highest accuracy for scenario S14 with parameters α, Fr, and θ, with RMSE, MAE, and CC statistical indices of 0.013, 0.017, and 0.990, respectively. In SVM and RF models, the best three-variable scenario was S11 with parameters α, Fr, S/L, with RMSE evaluation indices of 0.009 and 0.008, respectively. It was found that the RF model, like the GEP model, increased accuracy (a 41% decrease in average MAE index) with an increase in the number of input variables. However, increasing the number of inputs did not significantly change the accuracy of the SVM model. For four-variable scenarios (S15), the SVM model faced a significant decrease in accuracy, with an approximate 140% increase in the average MAE index. Meanwhile, the GEP and RF models also experienced increased accuracy for scenario S15.
All methods in single-variable input mode had the most and least impact respectively related to parameters α and S/L. In the SVM model, increasing the number of inputs from single-variable to two-variable mode resulted in an almost 2-fold increase in the average MAE index. In the GEP model, increasing the number of inputs from three variables to four variables led to an approximately 3.5-fold increase in the average MAE index. However, in the RF method, increasing the number of inputs resulted in improved model accuracy, with the average MAE index in the four-variable mode decreasing by 83% compared to the three-variable mode. Ultimately, it was evident that the RF method provided much better performance in estimating the depth of sedimentation around the bandallike spur-dike compared to other methods.