Development of Regional Models for Predicting Water Soil Erosion Using Sediment Gauging Data and Advanced Statistical Methods (Case Study: Urmia Lake Watershed)

Document Type : Research Paper

Author

Soil Conservation and Watershed Management Research Institute (SCWMRI), Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran.

Abstract

Soil erosion prediction at the watershed scale remains a critical challenge in sustainable land and water resource management. Although many studies have applied various empirical and statistical models, most have not adequately integrated long-term sediment records with a regional approach that accounts for both environmental and hydrological variability.The aim of this study is to develop regional models for predicting soil erosion rates through the integration of long-term sediment monitoring data and advanced statistical techniques, including Factor Analysis (FA), Principal Component Analysis (PCA), and Clustering methods, across the Urmia Lake watershed. Sediment data from 30 selected gauging stations within the major sub-basins were collected over a 20-year period and correlated with hydrological, topographic, geological, climatic, and land-use parameters of the study area. Initially, PCA was applied to reduce multicollinearity among input variables and to identify the principal factors controlling sediment transport. Subsequently, using hierarchical clustering based on hydrological behavior and sediment load, the sub-basins were divided into two distinct groups. Finally, the results of the factor analysis were employed as optimized inputs for multivariate regression modeling, leading to the development of regional predictive models. The validation results demonstrated that the developed models possess a high explanatory capacity, with an average coefficient of determination (R²) of 0.89 at the regional scale. The findings indicated that three primary factors — agricultural land area, erosion-prone lithological formations, and mean annual discharge — exerted the greatest influence on variations of the Sediment Delivery Ratio (SDR) across the sub-basins. The developed regional models provide strong potential for rapid assessment of erosion risk in newly monitored sub-basins without the need for extensive field measurements, offering an effective decision-support tool for sustainable land management, soil erosion control, and watershed policy planning within the Urmia Lake basin.

Keywords

Main Subjects


Introduction

Urmia Lake basin, the largest inland water body in Iran, represents a vital socio-environmental resource and is currently facing rapid decline due to both climatic and anthropogenic stressors. Water-induced soil erosion and subsequent sedimentation processes in its tributaries drastically reduce soil fertility, degrade water quality, and lead to siltation that impairs lake water inflows and increases environmental risks for the entire basin. Identifying the key drivers of sediment yield and mapping spatial erosion potential are crucial for effective watershed management. This study addresses the challenge of limited sediment data in the region by integrating statistical and GIS-based techniques to develop robust, spatially-explicit prediction models for erosion and sediment delivery.

Method

Study Area and Data:

The Urmia Lake watershed covers approximately 52,000 km² and comprises several major sub-basins, including Zarrineh Rood, Simineh Rood, Aji Chay, and Lesser Zab. Thirty sediment gauging stations with consistent records of annual streamflow and suspended sediment (covering 1991–2021) were selected for analysis.

Variable Selection and Extraction:

Thirty predictor variables covering five thematic categories—physiography, climate, geology/geomorphology, land use/vegetation, and soil physical properties—were extracted using advanced GIS platforms and remote sensing (Landsat NDVI).

- Physiography: sub-basin area, mean slope, shape index, elevation.

- Climate: mean annual precipitation, maximum rainfall intensity, aridity index.

- Geology/Geomorphology: percentage of erodible formations, lithology, drainage density.

- Land Use/Vegetation: agricultural land and rangeland area, NDVI.

- Soil: dominant soil texture, infiltration rate.

Statistical Analysis and Modeling:

To address multicollinearity and reduce model complexity, Principal Component Analysis (PCA) was applied, transforming the suite of 30 interrelated variables into a set of orthogonal principal components. Hierarchical Cluster Analysis was then used to stratify sub-basins into two homogeneous regional groups based on physiographic and hydrological features. Within each cluster, stepwise multiple regression models were established to predict the natural logarithm of the Sediment Delivery Ratio (SDR), the main indicator of basin-scale erosion risk. Model performance was validated using the coefficient of determination (R²), Nash-Sutcliffe Efficiency (NSE), and Relative Root Mean Square Error (RRMSE). Seven widely-used empirical SDR equations (e.g., Roehl, Fournier, Garbrecht) were evaluated to select the most spatially-accurate formulation for final erosion mapping.

Results

Principal Drivers and Model Outcomes:

PCA revealed that just three principal components (agricultural land area, abundance of erodible lithology, and mean annual discharge) explained over 85% of the SDR variance across sub-basins. This demonstrates that spatial SDR variability can be reliably captured using a compact set of landscape, land use, and hydrologic indicators.

Regression models developed for the two clusters yielded strong predictive power:

- Cluster 1 (northern/cooler sub-basins): R² = 0.89, NSE = 0.85

- Cluster 2 (southern/semi-arid sub-basins): R² = 0.77, NSE = 0.72

Spatial SDR maps generated using the best-fitted empirical equation matched the ground-measured sediment data from stations with >81% accuracy.

Spatial Patterns and Management Insights:

Critical erosion hotspots (SDR > 0.4) are notably concentrated in the southern and southwestern regions of the basin, particularly in the Simineh Rood and Godarchai sub-basins. These areas are typified by steep slopes, prevalence of easily erodible geological units, and intensive agriculture—all factors that elevate erosion potential. In contrast, central and northern sub-basins displayed moderate or lower SDR values, often benefiting from greater natural vegetation cover and more stable geomorphological setting.

Conclusions

The study demonstrates that regional modeling, combining PCA and cluster-based regression, offers a high-accuracy, cost-effective solution for spatial erosion assessment in data-limited basins. The identification of key predictor variables and high mapping reliability provide valuable guidance for policy makers and practitioners. Future studies can further improve predictive capabilities by incorporating near real-time remote sensing and exploring machine learning approaches for capturing non-linear sediment transport dynamics.

Integrating advanced statistical approaches (PCA, Cluster Analysis, multivariate regression) with hydrological and spatial data proves effective for predicting and mapping water soil erosion in large and complex catchments. The resulting SDR regional model and spatial erosion maps offer actionable tools for prioritizing erosion control and watershed conservation efforts within the Urmia Lake basin.

Author Contributions

Writing—original draft preparation, methodology, software, Conceptualization, funding acquisition, resources, M.T.

Data Availability Statement

Data available on request from the authors.

Acknowledgements

The authors gratefully acknowledge the financial support of the The Vice-Presidency for Science, Technology and Knowledge (Project code: 24-29-29-034-980445) for conducting this research.

Ethical considerations

The authors avoided data fabrication, falsification, plagiarism, and misconduct. The study involved only soil sampling and laboratory analysis, without any interaction with humans or animals; hence, ethical approval was not applicable.

Conflict of interest

The author declares no conflict of interest

Alatorre, L. C., Beguería, S., & García-Ruiz, J. M. (2010). Regional-scale modeling of hillslope sediment delivery: A case study in the Barasona Reservoir Watershed (Spain) using WATEM/SEDEM. Journal of Hydrology, 391, 109–123.
Asadi, H. (2022). A review of the history of activities of universities, research centers, and executive organizations related to soil erosion and conservation in Iran. Iranian Water and Soil Research, 53(2), 411–433. (In Persian)
Chitsaz, V., Nazari Samani, A. A., Soltani, S., & Feyzniya, S. (2021). Modeling suspended sediment and determining its controlling factors in the Great Karun and Karkheh watersheds. Rangeland and Watershed Management, 73(2), 293–303. (In Persian)
Hakim Khani, Sh., & Arab-Khoderi, M. (2006). Regression analysis between suspended sediment and hydro-geomorphological characteristics of the Urmia Lake Basin. Iranian Journal of Agricultural Sciences, 37(2), 85–93. (In Persian)
Honarbakhsh, A., Niazi, A., Soltani Koupaei, S., & Tahmasebi, P. (2019). Modeling the relationship between suspended sediment and hydrological and environmental characteristics of a basin: A case study of the Dez Dam Basin. Quantitative Geomorphological Research, 8(1), 105–117. (In Persian)
Honarbakhsh, A., Niazi, A., Soltani, S., & Tahmasebi, P. (2019). Modeling the relationship between suspended sediments and hydrological and environmental characteristics of a basin (case study: basin of Dez Dam). Journal of Quantitative Geomorphological Research, 8(1), 105–117.
Jansson, M. B. (1996). Estimating a sediment rating curve of the Reventazón River at Palomo using logged mean loads within discharge classes. Journal of Hydrology, 183(4), 227–241.
Kamari Yekdangeh, F., Khaledi Darvishan, A., & Aghabeygi Amin, S. (2024). Evaluation of the performance of G₂ and IntErO models in estimating soil erosion and annual sediment yield in the Keselyan representative watershed, Mazandaran Province. Watershed Management Research, 37(2), 55–73. (In Persian)
Kumar, A., & Das, G. (2010). Dynamic modeling of daily rainfall, runoff, and sediment yield for a Himalayan watershed. Agricultural Water Management, 75(2), 189–193.
Lamb, E., & Toniolo, H. (2016). Initial quantification of suspended sediment loads for three Alaska North Slope rivers. Water, 8(4), 119. https://doi.org/10.3390/w8040119
Nash, J. E., & Sutcliffe, J. V. (1970). River flow forecasting through conceptual models: Part 1, A discussion of principles. Journal of Hydrology, 10(2), 282–290.
Nik-Kami, D. (2017). Development of the national strategic plan for soil conservation. Final Research Project Report, Soil Conservation and Watershed Management Research Institute. (In Persian)
Nik-Kami, D., & Shadfar, S. (2021). Soil erosion mapping in watersheds equipped with sediment gauging stations in Iran. Watershed Engineering and Management, 13(2), 479–496. (In Persian)
Nik-Kami, D., Shadfar, S., & Jafari Ardekani, A. (2021). Preparation and integration of digital maps and datasets of watersheds and development of spatial, temporal, and thematic databases related to erosion and sediment yield in seventh-order basins of Iran. Final Research Project Report, Soil Conservation and Watershed Management Research Institute. (In Persian)
Noor, H., & Arab-Khoderi, M. (2023). Estimation of soil erosion and sediment delivery ratio using the RUSLE model in the Sanganeh Soil Conservation Research Station. Soil and Water Modeling and Management, 3(1), 42–53. (In Persian)
Sarangi, A., & Bhattacharya, A. K. (2005). Comparison of artificial neural network and regression models for sediment loss prediction from the Banha watershed in India. Agricultural Water Management, 78(3), 195–208.
Saghafian, B., Ghermezcheshmeh, B., Samiei, M., & Asheghi, R. (2009). Factors affecting suspended sediment yield in the southwestern watersheds of Iran. Watershed Engineering and Management, 1(3), 140–152. (In Persian)
Shabani, M., Feyzniya, S., Ahmadi, H., & Ghodousi, J. (2007). Investigation and determination of factors affecting sediment yield in watershed areas: A case study from the Taleqan Watershed. Iranian Journal of Natural Resources, 60(3), 759–771. (In Persian)
Taghvai Najib, H. R., Tajrishi, M. M., & Moghaddas, M. R. (2020). Application of multi-criteria decision-making methods to evaluate the effectiveness of wetland restoration projects for supplying environmental water needs. Proceedings of the 12th National Civil Engineering Congress, Tabriz, Iran. (In Persian)
Wang, Y., Zhang, L., Chen, J., & Li, Q. (2023). Assessing watershed sediment connectivity and spatial clustering using hierarchical analysis and PCA: Implications for soil erosion modeling. Catena, 226, 107093. https://doi.org/10.1016/j.catena.2023.107093
Zanganeh, H., & Naeimi, M. (2021). Spatial variability of sediment delivery ratio and its controlling factors using GIS-based modeling in the Urmia Lake Basin, Iran. Journal of Water and Soil Conservation Research, 28(3), 245–258. https://doi.org/10.22059/jwsc.2021.312451
Zangeneh Asadi, M. A., & Naemi Tabar, M. (2021). Relationships between hydro-geomorphic characteristics and suspended sediment load in the sub-watersheds of the Kashafrud Basin. Spatial Analysis of Environmental Hazards, 8(1), 111–128. (In Persian).