Document Type : Research Paper
Authors
1 Department of Soil Science, Faculty of Agriculture, Ferdowsi University of Mashhad Mashhad Iran
2 Assistant professor, Soil Reclamation and Sustainable Land Management Department,, Soil and Water Research Institute of Iran, Karaj, Iran
3 Computer Department, Jihad Daneshgahi, of Khorasan Razavi, Mashhad Iran
Abstract
Keywords
Main Subjects
Climate change is having profound effects on soil health and its functioning, particularly by influencing microbial respiration and enhancing carbon release through rising temperatures. The Q10 index, which indicates the temperature sensitivity of respiration, is a crucial metric for understanding soil carbon dynamics in the context of global warming. This study leverages machine learning models and explainable AI (using SHAP) to predict Q10 values based on a combination of biological, chemical, and physical soil properties. By employing a three-level classification system for Q10, the study offers a more nuanced perspective on microbial response and delivers interpretable insights into soil-climate interactions.
The objective of this study was to predict microbial Q10 temperature sensitivity using various machine learning algorithms, incorporating biological, chemical, and physical soil data. Additionally, the research aims to pinpoint the most influential factors impacting Q10 through interpretable AI methods such as SHAP, thereby improving our understanding of soil carbon behavior in response to climate change.
This research utilized a dataset containing soil biological, chemical, and physical property data to classify microbial Q10 values into three sensitivity levels. Multiple machine learning models, including Random Forest, XGBoost, SVM, and MLP, were applied, with class imbalance addressed using resampling techniques like SMOTE, SMOTEENN, and ADASYN. Model performance was assessed using precision, recall, and F1-score, leveraging nested cross-validation. SHAP analysis was conducted to interpret feature importance and model decisions.
Ensemble models, particularly Extra Trees, Random Forest, and XGBoost, outperformed other models in accuracy and AUC. Extra Trees achieved the best results, with an accuracy of 0.653 ± 0.117, AUROC of 0.815 ± 0.080, and AUPRC of 0.753 ± 0.096. Decision Tree and KNN performed the weakest. Feature importance analysis, both embedded and SHAP, highlighted microbial traits such as Bacteria_Positive and Mean_Glucose as key predictors. SHAP plots also revealed threshold effects for chemical features like SOC and Alkane. Statistical tests confirmed the superiority of ensemble models over simpler approaches (p < 0.05).
This research employs machine learning and explainable AI to identify critical factors affecting Q10 in soils. Key microbial traits, including Bacteria_Positive and Mean_Glucose, and chemical features like SOC and Alkane, were found to be influential. SHAP analysis provided transparency in model predictions, uncovering complex relationships between soil properties and microbial respiration. These insights can inform soil management practices and climate adaptation strategies. Future research could focus on incorporating physical and hydraulic soil properties to refine Q10 predictions for specific ecological conditions.
KeshikNevisRazavi, S.R. contributed to the conceptualization, methodology, software development, investigation, data curation, writing of the original draft, and manuscript review and editing. Farahani, E. was involved in validation, writing of the original draft, review and editing of the manuscript, supervision of the research process, and overall project administration. Abedinzadeh, N. participated in software development, data curation, and writing of the original draft. Abdolahi, M. contributed to the methodology, software implementation, validation, formal analysis, investigation, data curation, writing of the original draft, review and editing, visualization of results, and research supervision. All authors have read and approved the final version of the manuscript.
The dataset used in this study is publicly available on:
https://figshare.com/articles/dataset/The_soil_microbiome_governs_the_response_of_microbial_respiration_to_warming_across_the_globe/20776243?file=42356520
The authors would like to thank the original authors of the publicly available dataset used in this study. We also appreciate the support provided by our affiliated institutions and colleagues who contributed insightful discussions during the research process.
This study is based on secondary data analysis of a publicly available dataset. No human or animal subjects were involved, and therefore no ethical approval was required.
The authors declare that there is no conflict of interest regarding the publication of this paper.