Document Type : Research Paper
Authors
1 Department of Irrigation and Drainage, Faculty of Water and Environmental Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran
2 Centre Eau Terre Environnement, Institut National de la Recherche Scientifique (INRS), Quebec, Canada
Abstract
Keywords
Main Subjects
EXTENDED ABSTRACT
Spatiotemporal estimation and monitoring of soil moisture based on remote sensing observations are essential for managing water resources, improving agricultural land productivity, increasing water use efficiency, and assessing crop drought conditions. In this regard, methods based on optical and thermal remote sensing data have successfully estimated surface soil moisture at different scales. However, the physical nature of these data has limited and challenged their application in dense vegetation conditions, necessitating improving and increasing the estimation accuracy in these areas.
This research aims to develop a new approach to estimating surface soil moisture in agricultural fields with dense vegetation conditions, such as sugarcane fields, using machine learning algorithms by incorporating optical and thermal remote sensing data and soil physical properties.
This study used 16 Landsat-8 images during the sugarcane crop’s growth period in 2018-2019 at Hakim Farabi Sugarcane Agro-Industrial company in the Khuzestan province of Iran. Soil moisture measurements were collected simultaneously as the satellite passed through the study area at more than 430 control locations during the period. A set of 10 scenarios of various unique combinations of the available input variables were developed. Five popular machine learning algorithms evaluated the scenarios, including multiple linear regression (MLR), decision tree-based algorithms (Classification and Regression Trees (CART) and M5-pruned (M5P)), and ensemble learning-based algorithms (gradient-boosted regression trees (GBRT) and random forest regression (RFR)).
According to the results, the highest correlation between input variables and surface soil moisture was observed in Soil Wetness Index (SWI) and Normalized Soil Moisture Index (NSMI) with R values of 0.79 and 0.69, respectively. While the NIR band with an R-value of 0.56 showed the lowest correlation. The obtained results showed the high ability of machine learning algorithms to estimate surface soil moisture in the area. The highest accuracy of machine learning algorithms based on R2, RMSE, and MAE results was obtained in GBRT (0.99, 0.011, and 0.006) in scenario 9, RFR (0.99, 0.014, and 0.007) in scenario 9, M5P (0.90, 0.054, and 0.042) in scenario 9, CART (0.87, 0.058, and 0.046) and MLR (0.70, 0.07, and 0.056) in scenario 6, respectively. The importance of incorporating soil physical properties, especially clay percentage, with remote sensing data was observed only in the MLR algorithm. While in CART, M5P, GBRT, and RFR algorithms, the use of soil physical properties in combination with optical bands and different vegetation, humidity, and temperature indices did not lead to proper surface soil moisture predictions.
In general, the findings of this research show the importance of using variables based on Landsat-8 remote sensing data (NTR, NSMI, NDVI, SWIR2, NIR, LST, and SWI) in combination with ensemble learning algorithms (RFR and GBRT) that can be independent of any ground measurements. The proposed method provides valuable results for estimating and monitoring surface soil moisture in high-vegetation areas.