Digital Mapping of Soil Quality Using Satellite Imagery and Machine Learning Algorithms (A Case Study of Lushan, Guilan Province, Iran)

Document Type : Research Paper

Authors

1 . Department of Soil Science, Faculty of Agriculture, University of Zanjan, Zanjan, Iran

2 Department of Soil Science, Faculty of Agriculture, University of Zanjan, Zanjan, Iran

3 Soil and Water Research Institute, Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran,

Abstract

Soil quality is a fundamental indicator for assessing ecosystem sustainability and land productivity, and it is influenced by a combination of natural and anthropogenic factors. This study aimed to analyze the spatial variability of the soil quality index (SQI) at the regional scale using the random forest (RF) machine learning algorithm and multiple linear regression (MLR) based on environmental variables in the lands of the Loshan region in Guilan Province. For this purpose, 76 soil samples were collected from the 0–30 cm soil layer, and soil physical, chemical, and biological properties were measured using standard laboratory methods. In addition, remote sensing-based indices, including normalised difference vegetaion index (NDVI), normalised difference water index (NDWI), normalised difference moisture index (NDMI), normalized difference built-up index (NDBI), the bare soil index (BSI), and land surface temperature (LST), were derived as environmental auxiliary variables. The SQI was calculated using both the total data set (TDS) and minimum data set (MDS) approaches, combined with fuzzy scoring functions. The results showed that the random forest model predicted the spatial variability of SQI with higher accuracy (R² = 0.75) than multiple linear regression (R² = 0.53). Moreover, spectral indices particularly NDVI, NDWI and BSI played the most important roles in explaining the spatial variation of soil quality. This study demonstrates that the proposed digital soil quality mapping framework can serve as an effective tool for sustainable land management, soil conservation, and supporting decision-making in precision agriculture.

Keywords

Main Subjects


Objective

Soil quality, as a fundamental pillar of sustainable agricultural production systems and the maintenance of soil ecosystem functions, plays a decisive role in food security, environmental sustainability, and the sustainable management of natural resources. In recent decades, increasing anthropogenic pressures resulting from land-use change, unsustainable exploitation, overgrazing, improper tillage practices, and inefficient use of agricultural inputs have led to a gradual decline in soil quality across many regions of the country. Therefore, quantitative and spatial monitoring of soil quality and the identification of its driving factors are considered essential prerequisites for sustainable land management planning and precision agriculture. The Soil Quality Index (SQI) provides an integrated framework for the simultaneous incorporation of soil physical, chemical, and biological properties, enabling quantitative assessment of the functional health status of soils across different spatial scales. The main objective of this study was to assess soil quality and map its spatial distribution in the study area by applying the SQI and integrating field-based soil data with remote sensing information and advanced machine-learning models.

Methods

A total of 76 composite soil samples were collected from the 0–30 cm soil layer. A suite of physical, chemical, and biological properties, including soil texture, bulk density, aggregate stability, water-dispersible clay, pH, electrical conductivity, soil organic carbon, total nitrogen, available phosphorus and potassium, soluble calcium and magnesium, cation exchange capacity, calcium carbonate equivalent, microbial respiration, microbial biomass carbon, metabolic quotient (qCO₂), and microbial quotient (MQ), were measured. Soil organic carbon stock was also calculated as a key indicator of soil ecosystem functioning. To derive environmental covariates, Sentinel-2 satellite imagery was obtained from Google Earth Engine and processed to extract spectral indices including NDVI, NDWI, NDMI, and BSI, as well as land surface temperature (LST). These variables were used as auxiliary predictors in digital soil quality mapping. Principal component analysis (PCA) combined with correlation analysis was applied to reduce the number of indicators and to determine the minimum data set (MDS). The PCA results indicated that several principal components with eigenvalues >1 explained most of the data variance; accordingly, soil organic carbon, bulk density, available phosphorus, magnesium, and pH were selected as key indicators. In parallel, the total data set (TDS), comprising all measured indicators, was used to compute SQI. Indicator scoring was performed using fuzzy membership functions according to the response type of each variable (more-is-better, less-is-better, or optimum range), and indicator weights in the TDS and MDS approaches were derived based on factor analysis and the proportion of variance explained by principal components, respectively. The SQI was ultimately calculated as a weighted additive index for each sampling point. For spatial prediction of SQI, multiple linear regression (MLR) and random forest (RF) models were employed.

Findings

The results showed that the mean SQI obtained from the TDS approach was slightly higher than that derived from the MDS approach. However, the very strong correlation between SQI values obtained from the two approaches indicates that, despite the substantial reduction in the number of input variables, the MDS approach was able to reproduce the spatial pattern of soil quality with acceptable accuracy. The wider range of SQI values under the MDS approach reflects its higher sensitivity in discriminating different soil quality levels and identifying areas with more severe limitations. These findings confirm the efficiency of the MDS approach as a cost-effective and practical method for regional-scale soil quality monitoring. Model performance evaluation further demonstrated that the random forest algorithm outperformed multiple linear regression in both TDS and MDS scenarios, yielding higher coefficients of determination and lower prediction errors. This superiority highlights the greater capability of machine-learning algorithms to capture complex and nonlinear relationships between soil quality indices and environmental covariates. In contrast, the multiple linear regression model, due to its reliance on linear assumptions, showed limited ability to represent the inherent complexity of soil quality controlling processes. Accordingly, the application of machine-learning approaches, particularly random forest, is recommended for digital soil quality mapping in heterogeneous landscapes. Variable importance analysis in the random forest model revealed that moisture- and vegetation-related spectral indices, including NDWI, NDVI, NDMI, and the bare soil index (BSI), contributed most to explaining the spatial variability of soil quality. These results emphasize the key role of soil moisture conditions and vegetation cover in regulating soil biological and chemical properties and, ultimately, soil quality. Higher NDVI and NDWI values were generally associated with increased soil organic carbon content, enhanced microbial activity, and improved structural stability, whereas higher BSI values reflected more exposed and degraded soil surfaces, which were directly linked to soil quality degradation. Therefore, remote sensing indices can serve as rapid, cost-effective, and efficient tools for spatial monitoring of soil quality at regional scales.

Soil quality maps indicated that a considerable proportion of the study area falls within moderate to low soil quality classes, highlighting the need for implementing conservation-oriented management strategies at the watershed scale. In contrast, areas under sustainable land-use systems such as olive orchards, particularly in low-slope and downslope positions, exhibited higher soil quality. This pattern confirms the positive role of permanent vegetation cover, reduced tillage intensity, and increased organic matter inputs in improving soil quality. Accordingly, the adoption of conservation management practices, including residue management, enhancement of surface cover, reduction of intensive tillage, optimized irrigation management, and erosion control, is recommended as effective strategies for improving soil quality in the region.

Conclusion

Overall, the findings demonstrate that integrating the Soil Quality Index with remote sensing data and machine-learning algorithms provides an efficient framework for regional-scale assessment and digital mapping of soil quality. The MDS approach can be recommended as a practical, cost-effective, and reliable alternative to the TDS approach for soil quality monitoring, while the random forest algorithm, due to its superior predictive performance, represents a robust tool for supporting management decision-making in soil conservation, precision agriculture, and sustainable land-use planning. This integrated approach can serve as a scientific basis for policy-making related to soil and water resources management at local and regional scales.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Authorship contribution

For research articles with several authors, a short paragraph specifying their individual contributions must be provided. The following statements should be used “Conceptualization, S.H., K.M, M.N and A.G.; methodology, S.H., K.M, N.N, A.G and M.A.; software, S.H, M.N.; validation, A.G., K.M, M.N. and M.A.; formal analysis, S.H.; investigation, S.H. , A.G; resources, S.H, M.A.; data curation, S.H, M.N, A.G, K.M and M.A; writing—original draft preparation, S.H.; writing—review and editing, S.H, K.M, A.G, M.N.; visualization, A.G, M.A, M.N.; supervision, K.M.; project administration, S.H, K.M, M.N.; funding acquisition, S.H,.K.M. All authors have read and agreed to the published version of the manuscript

All authors contributed equally to the conceptualization of the article and writing of the original and subsequent drafts.

Declaration of Generative AI and AI-assisted technologies in the writing process

The authors did not use any artificial intelligence tools in preparing this manuscript.

Data availability statement

Data available on request from the authors.

Acknowledgements

The authors would like to thank all participants in the present study.

Ethical considerations

The authors avoided data fabrication, falsification, and plagiarism, and any form of misconduct.

Conflict of interest

The authors declare no conflict of interest.

Anderson, J. P. (1982). Soil respiration. Methods of soil analysis: part 2 chemical and microbiological properties, 9, 831-871.
Andrews, S. S., Karlen, D. L., & Cambardella, C. A. (2004). The soil management assessment framework: a quantitative soil quality evaluation method. Soil Science Society of America Journal, 68(6), 1945-1962.
Bandyopadhyay, S., & Maiti, S. K. (2021). Different soil factors influencing dehydrogenase activity in mine degraded lands—State-of-art review. Water, Air, & Soil Pollution232(9), 360.
Barikloo, A., Alamdari, P., Rezapour, S. and Taghizadeh-Mehrjardi, R., 2024. Digital mapping of soil quality index to evaluate orchard fields using random forest models. Modeling Earth Systems and Environment, pp.1-17.
Blake G.R,  Hartage  K.H  (1986) Bulk density. In: Klute, A. (Ed.), Method of Soil Analysis, Part I. Physical and Mineralogical Methods: Agronomy Monograph no. 9, second ed., pp. 363–375.
Bower, C. A., Reitemeier, R. F., & Fireman, M. (1952). Exchangeable cation analysis of saline and alkali soils. Soil science. 73(4), 251-262.
Bremner J. M,  Mulvaney C. S (1982) Nitrogen-total. Methods of soil analysis, part 2 chemical and microbiological properties, 9, 595-624
Budak, M., Günal, E., Kılıç, M., Çelik, İ., Sırrı, M. and Acir, N., 2023. Improvement of spatial estimation for soil organic carbon stocks in Yuksekova plain using Sentinel 2 imagery and gradient descent–boosted regression tree. Environmental Science and Pollution Research, 30(18), pp.53253-53274.
Bui, E. N., Searle, R. D., Wilson, P. R., Philip, S. R., Thomas, M., Brough, D., ... & Van Gool, D. (2020). Soil surveyor knowledge in digital soil mapping and assessment in Australia. Geoderma Regional22, e00299.
Chaudhry, H., Vasava, H.B., Chen, S., Saurette, D., Beri, A., Gillespie, A., Biswas, A.,
2024. Evaluating the soil quality index using three methods to assess soil fertility,
2024 Sensors 24 (3), 864. https://doi.org/10.3390/s24030864.
Dindaroglu, T., Tunguz, V., Babur, E., Alkharabsheh, H. M., Seleiman, M. F., Roy, R., & Zakharchenko, E. (2022). The use of remote sensing to characterise geomorphometry and soil properties at watershed scale. International Journal of Global Warming, 27(4), 402-421.
Doran J.W., and Parkin B.T. (1994). Defining and assessing soil quality. In: Doran, J.W., Coleman, D.C., Bezdicek, D.F., Stewart, B.A. (Eds.), Defining Soil Quality for a Sustainable Environment. Soil Science Society of America, Inc., Madison, WI, USA, pp. 3–21. Special Publication. Number 35.
Emami, N. S. , Chavoshi, E. , Ayoubi, S. , Honarjoo, N. and Zeraatpisheh, M. (2025). Digital mapping of soil physical and chemical properties using some machine learning algorithms and environmental variables in the Fereydan region, Isfahan Province. Agricultural Engineering48(2), 205-229. doi: 10.22055/agen.2025.48843.1761. (In Persian).
Fathizad, H., Ardakani, M.A.H., Heung, B., Sodaiezadeh, H., Rahmani, A., Fathabadi, A., Scholten, T. and Taghizadeh-Mehrjardi, R., 2020. Spatio-temporal dynamic of soil quality in the central Iranian desert modeled with machine learning and digital soil assessment techniques. Ecological Indicators, 118, p.106736.
Gee, G.W. and Bauder J.M. (1986). Partical-size analysis. In Methods of Soil Analysis, Part 1, Physical and Mineralogical Methods. Agronomy Monogroph No. 9 (2nd edition), American Society of Agronomy, Madison, WI. Pp 383-411.
Geng, Y., Shi, K., Xie, M., Ni, H., Zhu, Q., ... & Bourennane, H. (2025). Fine-resolution baseline maps of soil nutrients in farmland of Jiangxi Province using digital soil mapping and interpretable machine learning. Catena, 249, 108635.
Halder, B., Bandyopadhyay, J., & Banik, P. (2021). Monitoring the effect of urban development on urban heat island based on remote sensing and geo-spatial approach in Kolkata and adjacent areas, India. Sustainable Cities and Society, 74, 103186.
Hemmati, S., Yaghmaeian, N., Farhangi, M. B., & Sabouri, A. (2023). Soil quality assessment of paddy fields (in Northern Iran) with different productivities: Establishing the critical limits of minimum data set indicators. Environmental Science and Pollution Research30(4), 10286-10296.
Huang, W., Zong, M., Fan, Z., Feng, Y., Li, S., Duan, C. and Li, H., 2021. Determining the impacts of deforestation and corn cultivation on soil quality in tropical acidic red soils using a soil quality index. Ecological Indicators, 125, p.107580.
Kempen, B., Dalsgaard, S., Kaaya, A. K., Chamuya, N., Ruipérez-González, M., Pekkarinen, A., & Walsh, M. G. (2019). Mapping topsoil organic carbon concentrations and stocks for Tanzania. Geoderma337, 164-180.
Kemper W.D. and Rosenau R.C. (1986). Aggregate stability and size distribution. In: Klute A (ed). Methods of Soil Analysis. Part a: Physical and Mineralogical Methods. American Society of Agronomy. Soil Science Society of America, Madison, WI. Pp 425–442.
Knudsen D., Peterson G.A. and Pratt P.F. (1982). Lithium, sodium and potassium. p. 225-246. In: A.L. Page (ed) Methods of Soil Analysis. Part 2. America Society of Agronomy. Madison, WI.
Kumaraperumal, R., Pazhanivelan, S., Geethalakshmi, V., Nivas Raj, M., Muthumanickam, D., Kaliaperumal, R., ... & Tarun Kshatriya, T. V. (2022). Comparison of machine learning-based prediction of qualitative and quantitative digital soil-mapping approaches for Eastern Districts of Tamil Nadu, India. Land, 11(12), 2279.
Li, C., Wang, Y., Gao, Z., Sun, B., Xing, H. and Zang, Y., 2022. Identification of Typical Ecosystem Types by Integrating Active and Passive Time Series Data of the Guangdong–Hong Kong–Macao Greater Bay Area, China. International Journal of Environmental Research and Public Health, 19(22), p.15108.
Li, Z., Zhu, C., & Gold, C. (2004). Digital terrain modeling: principles and methodology. CRC press.
Liu, Y., Meng, Q., Zhang, L., & Wu, C. (2022). NDBSI: A normalized difference bare soil index for remote sensing to improve bare soil mapping accuracy in urban and rural areas. Catena, 214, 106265.
Maghami Moghim, F., Karimi, A., Bagheri Bodaghabadi, M. and Emami, H. (2022). Evaluating the Role of Different Management Systems on Soil Quality Index Using Crop Yield (Case Study: Neyshabour Plain, Iran). Water and Soil36(1), 95-112. doi: 10.22067/jsw.2022.74026.1120. (In Persian).
Maleki, S., Zeraatpisheh, M., Karimi, A., Sareban, G., & Wang, L. (2022). Assessing variation of soil quality in agroecosystem in an arid environment using digital soil mapping. Agronomy, 12(3), 578.
McBratney, A. B., Santos, M. M., & Minasny, B. (2003). On digital soil mapping. Geoderma, 117(1-2), 3-52.
Metwaly, M.M., Metwalli, M.R., Abd-Elwahed, M.S. and Zakarya, Y.M., 2024. Digital mapping of soil quality and salt-affected soil indicators for sustainable agriculture in the Nile Delta region. Remote Sensing Applications: Society and Environment, 36, p.101318.
Moharana, P. C., Jena, R. K., Yadav, B., Naitam, R., Kumar, N., Pradhan, U. K., & Sharma, G. K. (2024). Digital soil mapping algorithm for soil quality assessment and monitoring: a case study in desert ecosystem of India. In Remote sensing of soils (pp. 229-245). Elsevier.
Nabiollahi, K., Taghizadeh-Mehrjardi, R., & Eskandari, S. (2018). Assessing and monitoring the soil quality of forested and agricultural areas using soil-quality indices and digital soil-mapping in a semi-arid environment. Archives of Agronomy and soil science, 64(5), 696-707.
North, H., Amies, A., Dymond, J., Belliss, S., Pairman, D., Drewry, J., Schindler, J. and Shepherd, J., 2022. Mapping bare ground in New Zealand hill-country agriculture and forestry for soil erosion risk assessment: An automated satellite remote-sensing method. Journal of Environmental Management, 301, p.113812.
Olsen S.R., Cole C.V., Watanabe F.S. and Dean L.A. (1954). Estimation of Available Phosphorous in Soils by Extraction with Sodium Bicarbonate; U.S. Department of Agriculture: Washington, D.C., USDA Circ. 939.
Page A.L., Miller R.H., and Keeney D.R.(1982). Methods of Soil Analysis, part2, chemical and microbiological properties. American Society of Agronomy, Inc. Soil Science Society of Aamerica, Madison, WI.
Poeplau, C., & Don, A. (2013). Sensitivity of soil organic carbon stocks and fractions to different land-use changes across Europe. Geoderma, 192, 189-201.
Qi, Y., Darilek, J.L., Huang, B., Zhao, Y., Sun, W., Gu, Z., 2009. Evaluating soil quality
indices in an agricultural region of Jiangsu Province, China. Geoderma 149 (3–4),
325–334
Rahmanipour, F., Marzaioli, R., Bahrami, H.A., Fereidouni, Z., Bandarabadi, S.R., 2014.
Assessment of soil quality indices in agricultural lands of Qazvin Province, Iran. Ecol.
Indicators 40, 19–26
Rangzan, K., Abdehvand, Z. Z., Mousavi, S. R., & Karimi, D. (2025). Spatial analysis of soil quality in agricultural land using machine learning and environmental covariates: A case study of Khuzestan Province. Soil and Tillage Research, 252, 106591.
Reynolds W.D., Drury C.F., Tan C.S., Fox C.A. and Yang X.M. (2009). Use of indicators and pore volume function characteristics to quantify soil physical quality. Geoderma, 152: 252-263
Rhoades, J.D. (1982). Soluble salts. In: Page AL (ed) Methods of soil analysis, part II, 2nd ed., ASA, Monograph No. 9, Madison, WI, pp 167–179. https://doi.org/10.2134/agronmonogr9.2.2ed.c10
Rostaminia, M., Rahmani, A., Mousavi, S. R., Taghizadeh-Mehrjardi, R., & Maghsodi, Z. (2021). Spatial prediction of soil organic carbon stocks in an arid rangeland using machine learning algorithms. Environmental Monitoring and Assessment, 193(12), 815.
Samie Khoshk Estalkhi, F. , Yaghmaeian Mahabadi, N. , Abrishamkesh, S. and Maslahatjou, A. (2024). Evaluation of Scoring and Weighting Methods for Soil Characteristics to Determine Soil Quality in Different Land Uses. Iranian Journal of Soil Research37(4), 355-375. doi: 10.22092/ijsr.2024.361794.700. (In Persian).
Sánchez-Ruiz, S., Piles, M., Sánchez, N., Martínez-Fernández, J., Vall-llossera, M., & Camps, A. (2014). Combining SMOS with visible and near/shortwave/thermal infrared satellite data for high resolution soil moisture estimates. Journal of Hydrology, 516, 273-283.
Sedaghat, A., Shahrestani, M. S., Noroozi, A. A., Nosratabad, A. F., & Bayat, H. (2022). Developing pedotransfer functions using Sentinel-2 satellite spectral indices and Machine learning for estimating the surface soil moisture. Journal of Hydrology606, 127423.
Serrano, J., Shahidian, S., & Marques da Silva, J. (2019). Evaluation of normalized difference water index as a tool for monitoring pasture seasonal and inter-annual variability in a Mediterranean agro-silvo-pastoral system. Water11(1), 62.
Shafizadeh-Moghadam, H., Minaei, F., Talebi-khiavi, H., Xu, T., & Homaee, M. (2022). Synergetic use of multi-temporal Sentinel-1, Sentinel-2, NDVI, and topographic factors for estimating soil organic carbon. Catena, 212, 106077.
Shukla, M. K., Lal, R., & Ebinger, M. (2006). Determining soil quality indicators by factor analysis. Soil and tillage research, 87(2), 194-204.
Taghizadeh-Mehrjardi, R., Schmidt, K., Toomanian, N., Heung, B., Behrens, T., Mosavi, A., ... & Scholten, T. (2021). Improving the spatial prediction of soil salinity in arid regions using wavelet transformation and support vector regression models. Geoderma, 383, 114793.
Tauqeer, H. M., Turan, V., Farhad, M., & Iqbal, M. (2022). Sustainable agriculture and plant production by virtue of biochar in the era of climate change. In Managing plant production under changing environment (pp. 21-42). Singapore: Springer Nature Singapore.
Tudorescu, A. M., Negru, C., Mocanu, B. C., & Pop, F. (2024). Quality sustaining vegetation index for natural resources monitoring using satellite images. Engineering Science and Technology, an International Journal, 59, 101847.
Walkley A. and Black I.A. (1934). An examination of Degtjareff method for determining soil organic matter and a proposed modification of the chromic acid titration method. Soil Science, 37: 29-37.
Wang, B., Waters, C., Orgill, S., Gray, J., Cowie, A., Clark, A. and Li Liu, D., 2018. High resolution mapping of soil organic carbon stocks using remote sensing variables in the semi-arid rangelands of eastern Australia. Science of the Total Environment, 630, pp.367-378.
Yang, L., He, X., Shen, F., Zhou, C., Zhu, A. X., Gao, B., ... & Li, M. (2020). Improving prediction of soil organic carbon content in croplands using phenological parameters extracted from NDVI time series data. Soil and Tillage Research196, 104465.
Zeraatpisheh, M., Bakhshandeh, E., Hosseini, M., & Alavi, S. M. (2020). Assessing the effects of deforestation and intensive agriculture on the soil quality through digital soil mapping. Geoderma363, 114139.
Zeyliger, A.M., Muzalevskiy, K.V., Zinchenko, E.V. and Ermolaeva, O.S., 2022. Field test of the surface soil moisture mapping using Sentinel-1 radar data. Science of the Total Environment, 807, p.151121.