افزایش دقت داده‌های بارش ماهواره‌ای: روش ریزمقیاس سازی داده‌های PERSIANN با استفاده از NDVI، LST و DEM

نوع مقاله : مقاله پژوهشی

نویسنده

استادیار گروه مهندسی آب، مجتمع آموزش عالی کشاورزی و دامپروری تربت‌جام، ایران

10.22059/ijswr.2025.384891.669827

چکیده

این مطالعه بر پایه داده‌های بارانسنجی شهرستان تربت جام و طی یک دوره آماری ۲3 ساله (۲۰۰۱ تا ۲۰۲۳) انجامشده است. در این پژوهش، داده‌های بارش ماهوارهای PERSIANN با وضوح مکانی ۲۷ کیلومتر با استفاده از داده‌های NDVI، دمای سطح زمین (LST)  و مدل ارتفاعی (DEM) و با کمک الگوریتم یادگیری ماشین جنگل تصادفی (RF) به وضوح مکانی یک کیلومتر ارتقا یافت. برای ارزیابی دقت ریزمقیاس‌سازی داده‌های بارش ماهواره‌ای نسبت به داده‌های ایستگاه‌های زمینی، از معیارهای آماری شامل ضریب همبستگی (CC)، ریشه میانگین مربعات خطا (RMSE)  و میانگین خطای مطلق (MAE)  استفاده شد. علاوه بر این، روش تصحیح باقیمانده‌ها برای افزایش دقت پیش‌بینی مدل به کار گرفته شد. نتایج نشان داد که روش تغییر مقیاس همراه با داده‌های کمکی مکانی و الگوریتم جنگل تصادفی، دقت مدلسازی بارش را به طور چشمگیری بهبود بخشیده است. نتایج نشان داد که در تمامی ایستگاه‌های موردمطالعه و در مقیاس‌های زمانی ماهانه و سالانه استفاده از روش اصلاح باقیمانده باعث بهبود قابل توجهی در دقت پیش‌بینی‌ها شد. به طوری که در مقیاس بارش ماهیانه بعد از اصلاح باقیمانده ضریب همبستگی از 29-22 درصد افزایش، ریشه میانگین مربعات خطا از 64-61% کاهش و میانگین مطلق خطا به میزان 68-60 درصد کاهش نشان داد. در مقیاس سالانه ضریب همبستگی از 35-7 درصد، ریشه میانگین مربعات خطا از 74-69 درصد کاهش و میانگین مطلق خطا به میزان 76-69 درصد کاهش نشان داد. این مطالعه بر اثربخشی روش مذکور در بهبود دقت پیش‌بینی در مقیاس‌های زمانی مختلف در منطقه مورد بررسی تأکید دارد. علاوه بر این، مفاهیم عملی این پژوهش بینش‌های ارزشمندی را برای مدلسازی هیدرولوژیکی و مدیریت منابع آب، به ویژه در مناطقی که داده‌های ایستگاه زمینی محدود است، ارائه می‌دهد. یافته‌های این تحقیق میتواند به بهبود مدیریت منابع آبی و برنامه‌ریزی‌های اقلیمی، به خصوص در مناطق خشک و نیمه خشک، کمک قابل توجهی نماید. 

کلیدواژه‌ها

موضوعات


عنوان مقاله [English]

Improving Precipitation Accuracy: A Rescaling Method for PERSIANN Using NDVI, LST, and DEM Data

نویسنده [English]

  • Zahra Shirmohammadi -Aliakbarkhani
Department of Water Science and Engineering, Faculty of Agriculture and Animal Sciences, University of Torbat-e Jam, Iran
چکیده [English]

This study is based on the rain gauge data from Torbat-e Jam over a 23-year period (2001–2023). PERSIANN satellite rainfall data with a spatial resolution of 27 kilometers were enhanced to a 1-kilometer resolution using NDVI, land surface temperature (LST), and digital elevation model (DEM) data, aided by the random forest (RF) algorithm. To evaluate the accuracy of satellite rainfall downscaling compared to ground station data, statistical metrics such as correlation coefficient (CC), root mean square error (RMSE), and mean absolute error (MAE) were utilized. Additionally, a residual correction method was implemented to refine model predictions further. Results demonstrated that integrating spatial datasets with the RF algorithm significantly improved rainfall modeling accuracy. Applying the residual correction method led to substantial improvements in forecasting accuracy across all studied stations on both monthly and annual timescales. On the monthly scale, the correlation coefficient increased by 22-29%, while RMSE and MAE decreased by 61-64% and 60-68%, respectively. On an annual scale, the correlation coefficient showed an increase of 7-35%, with RMSE and MAE reductions of 69-74% and 69-76%, respectively. This study underscores the effectiveness of the applied method in enhancing prediction accuracy across various temporal scales within the studied region. Additionally, the practical implications of this research provide valuable insights for hydrological modeling and water resource management, especially in regions with limited ground station data. The findings of this research can significantly aid in better water resource management and climatic planning, particularly in arid and semi-arid areas.

کلیدواژه‌ها [English]

  • Machine Learning
  • PERSIANN-CDR
  • Precipitation
  • Spatial Resolution Enhancement

Introduction

Precipitation plays a critical role in the global water cycle and in processes involving material and energy exchanges. High-resolution precipitation data are essential for accurate hydrological, meteorological, and ecological studies, especially at regional scales. Traditionally, rain gauges provide precise point data, but their irregular distribution and the high spatial-temporal variability of precipitation make them insufficient for generating fine-resolution datasets. Satellite observations offer valuable insights into water and energy exchanges between land and atmosphere. They are particularly effective in estimating precipitation across vast areas, including mountainous and sparsely gauged regions. For example, remotely sensed data such as the PERSIANN-CDR product are widely used in hydrological and meteorological studies. However, the 27 km spatial resolution of PERSIANN data often limits its applicability in detailed hydrological simulations and environmental assessments at local scales. To address this limitation, this study introduces a novel approach by combining land surface temperature (LST), normalized difference vegetation index (NDVI), and digital elevation model (DEM) data with the Random Forest (RF) algorithm to downscale annual PERSIANN precipitation data. The study focuses on southeastern Khorasan Razavi from 2000 to 2023—a region with limited ground-based observations and complex topography. To the best of our knowledge, this combination of variables and techniques has not been applied in this region before. The outcomes of this research have significant practical implications. High-resolution precipitation maps generated by this method can improve water resource management, enhance flood prediction accuracy, and support sustainable development in arid and semi-arid regions. By addressing the limitations of traditional precipitation datasets, this study provides a foundation for more reliable hydrological and environmental analyses.

Method

Data Collection

 This study utilizes data from the Torbat Jam rain gauge stations over a 23-year period (2001–2023). The time period of 2001-2023 was chosen due to the availability of continuous data and to capture long-term climatic trends. While some data gaps and noise were present, they were addressed through preprocessing techniques to ensure the robustness of the analysis. The PERSIANN dataset, accessible on Google Earth Engine as ee. ImageCollection("NOAA/PERSIANN-CDR"), provided precipitation data for the period from 2001 to 2023. To obtain NDVI data, MOD13Q1 data (16-day NDVI at 250-meter resolution) was accessed through Google Earth Engine under the code ee. ImageCollection("MODIS/061/MOD13Q1"). For land surface temperature, the study used MOD11A2 data, which includes 8-day LST at a 1 km resolution, available since 2000 as ee. ImageCollection("MODIS/061/MOD11A2"). Elevation data was sourced from the SRTM version 3 (SRTM Plus) product, with an approximate 30-meter accuracy, accessible via ee. Image("USGS/SRTMGL1_003").

Preprocessing

 The study focuses on downscaling PERSIANN satellite precipitation data, originally at a 27 km resolution, to a finer 1 km resolution. This scaling was achieved using NDVI (Normalized Difference Vegetation Index), LST (Land Surface Temperature), and DEM (Digital Elevation Model) data, applying the Random Forest (RF) machine learning algorithm.

Modeling

The Random Forest algorithm implemented in Google Earth Engine was chosen for its robustness and ability to handle complex datasets. This platform provides optimized parameters for the algorithm, making it efficient for large-scale environmental data processing.

Residual Correction

To correct the residuals, the difference between the actual precipitation values (ground data) and the predicted values from the microscaling model (using NDVI, LST, and DEM data) was first calculated. Next, a machine learning method called Random Forest was applied to model these residuals, as it is capable of identifying and modeling complex patterns in the residual data. Finally, the residuals predicted by the machine learning models were added to the initial predicted values to obtain the adjusted predictions. The residual correction technique offers significant advantages:

Enhanced Model Accuracy

 The residual correction technique offers significant advantages: by addressing the differences (residuals) between observed and predicted values, the model's accuracy is significantly improved. This method reduces both systematic and random errors in the initial predictions, leading to more reliable results. Additionally, the Random Forest algorithm, used for residual correction, is capable of capturing complex relationships within the residuals that simpler models might miss.

Evaluation

The model’s performance, before and after the modification, was evaluated using root mean square error (RMSE), mean absolute error (MAE), and correlation coefficient (CC) metrics. The residual correction technique significantly improved model performance, highlighting its utility in enhancing prediction accuracy. Specifically, it led to reductions in RMSE and MAE, and increases in CC, demonstrating the method's effectiveness in refining precipitation predictions.

Results

Statistical analyses indicate that precipitation downscaling yielded favorable results at Firouzkouh and Robat Samanegan stations on monthly and annual scales, and at Torbat-e Jam station on an annual scale. The limited improvement in monthly downscaling at Torbat Jam can be attributed to the high spatial-temporal variability of precipitation in this region, which makes accurate downscaling more challenging. Similarly, studies by Cho et al. (2013) on the Korean Peninsula achieved improved local precipitation accuracy by downscaling TRMM precipitation data to a resolution of 1 km, while Zhang et al. (2018) demonstrated that 1 km maps in mountainous areas offer higher precision. Research by Ghorbanpour et al. (2021) also found that finer resolutions, such as 1 km, are beneficial in arid regions like Lake Urmia, especially where ground stations are sparse. Noor et al. (2023) confirmed that using 1 km downscaling improved precipitation accuracy in downstream regions of the Indus River. In Torbat Jam, the residual correction method was applied due to the lack of significant improvement in monthly downscaling, leading to enhanced model accuracy across all stations and time scales. Overall, residual correction proved highly effective in improving downscaling models’ accuracy, especially in areas with limited data or spatial anomalies. This technique enables models to correct for initial downscaling errors, resulting in better rainfall prediction by incorporating real data. Several studies support the efficacy of residual correction with machine learning methods like Random Forest (RF) and Kriging. For example, Zhan et al. (2018) demonstrated enhanced accuracy in semi-arid regions, and Chen et al. (2020) highlighted RF’s ability to reduce errors in arid environments. Zhao (2021) showed that residual correction led to a significant reduction in MAE and RMSE errors by 19% and 21%, respectively, in mountainous areas. In this study, applying the residual correction method significantly improved prediction accuracy across all studied stations and time scales. At Torbat Jam station, monthly correlation increased by 22% and annual correlation by 34%, with RMSE and MAE reductions of 64% and 66% on a monthly scale, and 71% and 69% on an annual scale, respectively. At Firouzkouh station, monthly correlation improved by 29% and annual correlation by 35%, with RMSE reductions of 63% monthly and 69% annually, and MAE reductions of 68% monthly and 76% annually. At Robat Samanegan station, monthly correlation increased by 26% and annual correlation by 7%, with RMSE reductions of 61% monthly and 74% annually, and MAE reductions of 60% monthly and 74% annually. These results underscore the efficacy of residual correction in enhancing model accuracy and reducing prediction errors across varying temporal and spatial scales. Consequently, combining RF downscaling with residual correction provides a suitable method for achieving high-resolution precipitation data (1 km), especially when using auxiliary variables like NDVI, LST, and DEM data. Satellite data offers extensive spatial and temporal coverage, making it valuable for precipitation modeling and climate analysis. Downscaling and residual correction techniques improve model accuracy significantly, supporting more precise hydrological and climate studies.

Conclusions

High-resolution precipitation maps at 1 km provide more precise insights into precipitation distribution, allowing for the identification of local patterns, climate anomalies, and precipitation-sensitive areas. These finer maps are instrumental in water resource management, flood and drought mitigation, and agricultural and urban planning, while 27 km maps are more suitable for regional-scale analyses. Machine learning techniques, particularly when paired with residual correction, significantly enhance precipitation prediction accuracy in regions with sparse ground data or complex topographies. However, this study has certain limitations. The effectiveness of downscaling methods can be sensitive to the quality and availability of input data, such as auxiliary variables (e. g. , NDVI, LST, DEM). Additionally, computational challenges in training advanced machine learning models must be considered. Future research should focus on improving machine learning models, incorporating newer and higher-resolution satellite data, and testing these methods across diverse regions with varying climatic and topographical characteristics. Addressing these areas can further enhance the robustness and applicability of downscaling approaches for hydrological, ecological, and climate studies.

Author Contributions

The author contributed to the conceptualization of the article and writing of the original and subsequent drafts.

Data Availability Statement

Data available on request from the author.

Acknowledgements

We acknowledge the financial support received from the University of Torbat-e Jam.

Ethical considerations

The author avoided data fabrication, falsification, plagiarism, and misconduct.

Conflict of interest

The author declares no conflict of interest.

Chen, C. , Chen, Q. , Qin, B. , Zhao, S. , & Duan, Z. (2020). Comparison of Different Methods for Spatial Downscaling of GPM IMERG V06B Satellite Precipitation Product Over a Typical Arid to Semi-Arid Area. Frontiers in Earth Science, 8, 536337. https://doi.org/10.3389/feart. 2020.536337
Chen, F. , Gao, Y. , Wang, Y. , & Li, X. (2020). A downscaling-merging method for high-resolution daily precipitation estimation. Journal of Hydrology, 581, 124414. https://doi.org/10.1016/j. jhydrol. 2019.124414
Cho, Herin, Hwang, Seok Hwan, Cho, Yong-Sik, & Choi, Minha. (2013). Analysis of Spatial Precipitation Field Using Downscaling on the Korean Peninsula. Journal of Korea Water Resources Association, 46(11), 1129–1140. https://doi.org/10.3741/JKWRA. 2013.46.11.1129
De Kauwe, M. G. , Taylor, C. M. , Harris, P. P. , Weedon, G. P. , & Ellis, Richard. J. (2013). Quantifying Land Surface Temperature Variability for Two Sahelian Mesoscale Regions during the Wet Season. Journal of Hydrometeorology, 14(5), 1605–1619. https://doi.org/10.1175/JHM-D-12-0141.1
Ding, M. , Zhang, Y. , Liu, L. , Zhang, W. , Wang, Z. , & Bai, W. (2007). The relationship between NDVI and precipitation on the Tibetan Plateau. Journal of Geographical Sciences, 17(3), 259–268. https://doi.org/10.1007/s11442-007-0259-7
Farr, T. G. , Rosen, P. A. , Caro, E. , Crippen, R. , Duren, R. , Hensley, S. , Kobrick, M. , Paller, M. , Rodriguez, E. , Roth, L. , Seal, D. , Shaffer, S. , Shimada, J. , Umland, J. , Werner, M. , Oskin, M. , Burbank, D. , & Alsdorf, D. (2007). The Shuttle Radar Topography Mission. Reviews of Geophysics, 45(2), 2005RG000183. https://doi.org/10.1029/2005RG000183
Fensholt, R. , Rasmussen, K. , Nielsen, T. T. , & Mbow, C. (2009). Evaluation of earth observation based long term vegetation trends—Intercomparing NDVI time series trend analysis consistency of Sahel from AVHRR GIMMS, Terra MODIS and SPOT VGT data. Remote Sensing of Environment, 113(9), 1886–1898. https://doi.org/10.1016/j. rse. 2009.04.004
Grohmann, C. H. , Smith, M. J. , & Riccomini, C. (2011). Multiscale Analysis of Topographic Surface Roughness in the Midland Valley, Scotland. IEEE Transactions on Geoscience and Remote Sensing, 49(4), 1200–1213. https://doi.org/10.1109/TGRS. 2010.2053546
Guanghua Xu, Xu, G. , Xianli Xu, Xu, X. , Meixian Liu, Liu, M. , Alexander Y. Sun, Sun, A. Y. , Kelin Wang, & Wang, K. (2015). Spatial Downscaling of TRMM Precipitation Product Using a Combined Multifractal and Regression Approach: Demonstration for South China. Water, 7(6), 3083–3102. https://doi.org/10.3390/w7063083
Immerzeel, W. W. , Rutten, M. M. , & Droogers, P. (2009). Spatial downscaling of TRMM precipitation using vegetative response on the Iberian Peninsula. Remote Sensing of Environment, 113(2), 362–370. https://doi.org/10.1016/j. rse. 2008.10.004
Jia, S. , Zhu, W. , Lű, A. , & Yan, T. (2011). A statistical spatial downscaling algorithm of TRMM precipitation based on NDVI and DEM in the Qaidam Basin of China. Remote Sensing of Environment, 115(12), 3069–3079. https://doi.org/10.1016/j. rse. 2011.06.009
Jing, W. , Yang, Y. , Yue, X. , & Zhao, X. (2016a). A Comparison of Different Regression Algorithms for Downscaling Monthly Satellite-Based Precipitation over North China. Remote Sensing, 8(10), 835. https://doi.org/10.3390/rs8100835
Jing, W. , Yang, Y. , Yue, X. , & Zhao, X. (2016b). A Spatial Downscaling Algorithm for Satellite-Based Precipitation over the Tibetan Plateau Based on NDVI, DEM, and Land Surface Temperature. Remote Sensing, 8(8). https://doi.org/10.3390/rs8080655
Karbalaye Ghorbanpour, A. , Hessels, T. , Moghim, S. , & Afshar, A. (2021). Comparison and assessment of spatial downscaling methods for enhancing the accuracy of satellite-based precipitation over Lake Urmia Basin. Journal of Hydrology, 596, 126055. https://doi.org/10.1016/j. jhydrol. 2021.126055
Li, Y. , Zhang, Y. , He, D. , Luo, X. , & Ji, X. (2019). Spatial Downscaling of the Tropical Rainfall Measuring Mission Precipitation Using Geographically Weighted Regression Kriging over the Lancang River Basin, China. Chinese Geographical Science, 29(3), 446–462. https://doi.org/10.1007/s11769-019-1033-3
Lima, A. R. , Cannon, A. J. , & Hsieh, W. W. (2012). Downscaling temperature and precipitation using support vector regression with evolutionary strategy. The 2012 International Joint Conference on Neural Networks (IJCNN), 1–8. https://doi.org/10.1109/IJCNN. 2012.6252383
Ma, Z. , Shi, Z. , Zhou, Y. , Xu, J. , Yu, W. , & Yang, Y. (2017). A spatial data mining algorithm for downscaling TMPA 3B43 V7 data over the Qinghai–Tibet Plateau with the effects of systematic anomalies removed. Remote Sensing of Environment, 200, 378–395. https://doi.org/10.1016/j. rse. 2017.08.023
Martiny, N. , Camberlin, P. , Richard, Y. , & Philippon, N. (2006). Compared regimes of NDVI and rainfall in semi‐arid regions of Africa. International Journal of Remote Sensing, 27(23), 5201–5223. https://doi.org/10.1080/01431160600567787
Noor, R. , Arshad, A. , Shafeeque, M. , Liu, J. , Baig, A. , Ali, S. , Maqsood, A. , Pham, Q. B. , Dilawar, A. , Khan, S. N. , Anh, D. T. , & Elbeltagi, A. (2023). Combining APHRODITE Rain Gauges-Based Precipitation with Downscaled-TRMM Data to Translate High-Resolution Precipitation Estimates in the Indus Basin. Remote Sensing, 15(2), 318. https://doi.org/10.3390/rs15020318
Shaodan Chen, Chen, S. , Liping Zhang, Zhang, L. , Dunxian She, Dunxian She, She, D. , Dunxian She, Jie Chen, & Chen, J. (2019). Spatial Downscaling of Tropical Rainfall Measuring Mission (TRMM) Annual and Monthly Precipitation Data over the Middle and Lower Reaches of the Yangtze River Basin, China. Water, 11(3), 568. https://doi.org/10.3390/w11030568
Shi, Y. , Song, L. , Xia, Z. , Lin, Y. , Myneni, R. , Choi, S. , Wang, L. , Ni, X. , Lao, C. , & Yang, F. (2015). Mapping Annual Precipitation across Mainland China in the Period 2001–2010 from TRMM3B43 Product Using Spatial Downscaling Approach. Remote Sensing, 7(5), 5849–5878. https://doi.org/10.3390/rs70505849
Sobrino, J. A. , Jiménez-Muñoz, J. C. , & Paolini, L. (2004). Land surface temperature retrieval from LANDSAT TM 5. Remote Sensing of Environment, 90(4), 434–440. https://doi.org/10.1016/j. rse. 2004.02.003
Sorooshian, S. , Hsu, K. -L. , Gao, X. , Gupta, H. V. , Imam, B. , & Braithwaite, D. (2000). Evaluation of PERSIANN System Satellite–Based Estimates of Tropical Rainfall. Bulletin of the American Meteorological Society, 81(9), 2035–2046. https://doi.org/10.1175/1520-0477(2000)081<2035: EOPSSE>2.3. CO; 2
Trenberth, K. E. , & Shea, D. J. (2005). Relationships between precipitation and surface temperature. Geophysical Research Letters, 32(14), 2005GL022760. https://doi.org/10.1029/2005GL022760
Xu, S. , Wu, C. , Wang, L. , Gonsamo, A. , Shen, Y. , & Niu, Z. (2015). A new satellite-based monthly precipitation downscaling algorithm with non-stationary relationship between precipitation and land surface characteristics. Remote Sensing of Environment, 162, 119–140. https://doi.org/10.1016/j. rse. 2015.02.024
Zhan, C. , Han, J. , Hu, S. , Liu, L. , & Dong, Y. (2018). Spatial Downscaling of GPM Annual and Monthly Precipitation Using Regression-Based Algorithms in a Mountainous Area. Advances in Meteorology, 2018, 1–13. https://doi.org/10.1155/2018/1506017
Zhang, Y. , Li, Y. , Ji, X. , Luo, X. , & Li, X. (2018). Fine-Resolution Precipitation Mapping in a Mountainous Watershed: Geostatistical Downscaling of TRMM Products Based on Environmental Variables. Remote Sensing, 10(1), 119. https://doi.org/10.3390/rs10010119
Zhao, N. (2021). An Efficient Downscaling Scheme for High-Resolution Precipitation Estimates over a High Mountainous Watershed. Remote Sensing, 13(2), 234. https://doi.org/10.3390/rs13020234
Zheng, X. , & Zhu, J. (2015). A methodological approach for spatial downscaling of TRMM precipitation data in North China. International Journal of Remote Sensing, 36(1), 144–169. https://doi.org/10.1080/01431161.2014.995275
Zhengming Wan, & Dozier, J. (1996). A generalized split-window algorithm for retrieving land-surface temperature from space. IEEE Transactions on Geoscience and Remote Sensing, 34(4), 892–905. https://doi.org/10.1109/36.508406
Zhu, H. , Liu, H. , Zhou, Q. , & Cui, A. (2023). Towards an Accurate and Reliable Downscaling Scheme for High-Spatial-Resolution Precipitation Data. Remote Sensing, 15(10), 2640. https://doi.org/10.3390/rs15102640