نوع مقاله : مقاله پژوهشی
نویسندگان
1 گروه علوم و مهندسی آب، دانشکده کشاورزی و محیطزیست، دانشگاه اراک، اراک، ایران
2 گروه مهندسی آبیاری و آبادانی، دانشکدگان کشاورزی و منابع طبیعی، دانشگاه تهران، کرج، ایران
3 گروه علوم و مهندسی آب، دانشکده کشاورزی و محیطزیست، دانشگاه اراک، اراک، ایران.
چکیده
کلیدواژهها
موضوعات
عنوان مقاله [English]
نویسندگان [English]
The vertical distribution of suspended sediment concentration (SSC) is one of the most important parameters in the hydraulics of sediment transport in rivers. This parameter plays an important role in calculating the total sediment discharge in channels and rivers. For this reason, accurate measurement of this parameter has always been one of the goals of researchers. One way to accurately predict this parameter is to use intelligent models. For this purpose, in this study, four data mining models, KNN, KNN-PSO, GPR, and GPR-PSO, have been used to predict the distribution of sediment concentration (C/Ca). All models were coded in the MATLAB software environment. According to the results, it was found that the optimization performed on the KNN and GPR models was effective and increased the performance of these models. By comparing the models, it was shown that the GPR-PSO model has more accuracy than other models. The accuracy of this model in the training phase is equal to RMSE = 0.0297, R2 = 0.9878, and KGE = 0.9776, and in the testing phase equal to RMSE = 0.0226, R2 = 0.9907, and KGE = 0.9715. After GPR-PSO, the KNN-PSO model was ranked with RMSE = 0.0295, R2 = 0.9870, and KGE = 0.9864 in the training phase and RMSE = 0.0374, R2 = 0.9808, and KGE = 0.9569 in the testing phase. After the aforementioned models, GPR and KNN were respectively ranked. Also, by analyzing the results, it was determined that the two parameters y/D and y/a are the most important parameters in determining the most accurate results.
کلیدواژهها [English]
The vertical distribution of suspended sediment concentration (SSC) is one of the most fundamental parameters in the hydraulics of sediment transport in rivers. This parameter plays an important role in calculating the total sediment discharge in channels and rivers. For this reason, accurate measurement of this parameter has always been one of the goals of researchers. One way to accurately estimate this parameter is to use intelligent models.
The main aim of the present study is to present and evaluate highly accurate models for predicting suspended sediment concentration under laboratory conditions. Calculation of suspended sediment distribution is one of the important parameters in river engineering, and its accurate calculation leads to an acceptable estimate of suspended sediment discharge. Considering the necessity of accurate measurement of suspended sediment distribution and the difficulty of laboratory works, the use of intelligent models can be useful. To solve this problem, four intelligent models of KNN, KNN-PSO, GPR, GPR-PSO have been used in the present study.
In this study, four intelligent models KNN, KNN-PSO, GPR, GPR-PSO were used to predict the distribution of suspended sediment concentration. For this purpose, all models were coded in the MATLAB software environment. Laboratory data of Vanoni (1946) and Einstein and Chien (1955) were used for modeling (Table 1). These data include 203-point data of sediment concentration distribution. Einstein and Chien’s experiments were conducted in a flume with a width of 30.7 cm and a smooth bed with different slopes of 0.00185 and 0.0025. Sediment particles with diameters of 1.3, 0.94 and 0.274 mm were used. In addition, Vanoni’s experiments were conducted in a flume with a rough bed with a width of 84.5 cm and a constant slope of 0.0025. The diameters of sediment particles used in this study were 0.1, 0.13, and 0.16 mm.
The data were first normalized and adjusted between 0 and 1. Then, 80% of the data were used for training and 20% of them were used for testing. The results showed that the optimization by PSO method has increased the accuracy of GPR and KNN models. It was found that the superior model is the GPR-PSO model, which has an accuracy of RMSE = 0.0297, R2 = 0.9878 and KGE = 0.9776 in the training phase and RMSE = 0.0226, R2 = 0.9907 and KGE = 0.9715 in the testing phase. After the aforementioned model, the KNN-PSO model was ranked. These results show that if the optimal values are selected for the GPR model, it will have more accuracy than KNN model. This is because the prediction level in KNN is discrete, which makes the search space for PSO limited, while in the GPR model the prediction level is continuous, which makes the PSO optimization search a larger space of parameters, which makes it more accurate in regression and interpolation problems. Also, the modeling method in KNN is linear, which makes the optimization performed less effective; while the GPR model has a greater ability to model nonlinear relationships such as sediment problems, which makes this model more flexible than other models. In addition, the GPR model has the ability to consider uncertainty in modeling, while this is not possible with KNN. This is because the GPR model is a probabilistic model that models the distribution of the output variable according to the input variable, while KNN only considers the K-nearest neighbors for the prediction.
According to the results obtained, it was found that PSO optimization had an important effect on the performance of the GPR and KNN models and increased the accuracy of the models. Among the studied models, the GPR-PSO model was recognized as the best model. The accuracy of this model in the test phase was equal to RMSE = 0.0226, R2 = 0.9907 and KGE = 0.9715. Also, by analyzing the results of the studied models, it was determined that the parameters y/D and y/a in most models were recognized as important parameters in determining the highest accuracy.
The results indicate that PSO optimization impacted the performance of the GPR and KNN models, resulting in increased accuracy. The GPR-PSO model was identified as the most effective model, with test phase accuracy metrics of RMSE = 0.0226, R2 = 0.9907, and KGE = 0.9715. Analysis of the results also revealed that the parameters y/D and y/a were consistently identified as crucial factors in achieving optimal accuracy across most models.
“Conceptualization, Y.M. and M.N.; methodology, A.S.; software, Y.M.; validation, Y.M., M.N. and M.J.N.; formal analysis, Y.M.; investigation, M.N.; resources, A.S.; data curation, M.N.; writing—original draft preparation, Y.M.; writing—review and editing, M.N.; visualization, Y.M.; supervision, M.N.;
All authors have read and agreed to the published version of the manuscript.”
Not applicable
The authors would like to thank all participants of the present study.
The study was approved by the Ethics Committee of the Arak University. The authors avoided data fabrication, falsification, plagiarism, and misconduct.
The author declares no conflict of interest.