Skip to main content

Machine Learning Procedures for Daily Interpolation of Rainfall in Navarre (Spain)

  • Chapter
  • First Online:
Trends in Mathematical, Information and Data Sciences

Abstract

Kriging is by far the most well known and widely used statistical method for interpolating data in spatial random fields. The main reason is that it provides the best linear unbiased predictor and it is an exact interpolator when normality is assumed. The robustness of this method allows small departures from normality, however, many meteorological, pollutant and environmental variables have extremely asymmetrical distributions and Kriging cannot be used. Machine learning techniques such as neural networks, random forest, and k-nearest neighbor can be used instead, because they do not require specific distributional assumptions. The drawback is that they do not take account of the spatial dependence, and for an optimal performance in spatial random fields more complex machine learning techniques could be considered. These techniques also require a relatively large amount of training data and they are computationally challenging to implement. For a reduced number of observations, we illustrate the performance of the aforementioned procedures using daily rainfall data of manual meteorological gauge stations in Navarre, where the only auxiliary variables available are the spatial coordinates and the altitude. The quality of the predictions is carefully checked through three versions of the relative root mean squared error (RRMSE). The conclusion is that when we cannot use Kriging, random forest and neural networks outperform k-nearest neighbor technique, and provide reliable predictions of rainfall daily data with scarce auxiliary information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agilan, A., Umamahesh, N.V.: Rainfall generator for gonstationary extreme rainfall condition. J. Hydrol. Eng. 24(9), 04019027 (2019)

    Article  Google Scholar 

  2. Breiman, L.: Bagging predictors. Mach. Learn. 24, 123–140 (1996)

    MATH  Google Scholar 

  3. Breiman, L.: Random Forest. Mach. Learn. 45(1), 5–32 (2001)

    Article  Google Scholar 

  4. Chen, F., Gao, Y., Wang, Y., Li, X.: A downscaling-merging method for high-resolution daily precipitation estimation. J. Hydrol. 581, 124414 (2020)

    Google Scholar 

  5. Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 13(1), 21–27 (1967)

    Article  Google Scholar 

  6. Czernecki, B., Taszarek, M., Marosz, M., Półrolniczak, M., Kolendowicz, L., Wyszogrodzki, A., Szturc, J.: Application of machine learning to large hail prediction - The importance of radar reflectivity, lightning occurrence and convective parameters derived from ERA5. Atmos. Res. 227, 249–262 (2019)

    Google Scholar 

  7. Díez-Sierra, J., del Jesús, M.: Subdaily rainfall estimation through daily rainfall downscaling using random forests in Spain. Water 11(1), 125:w11010125

    Google Scholar 

  8. Efron, F., Hastie, T.: Computer Age Statistical Inference. Cambridge University Press, Cambridge (2016)

    Book  Google Scholar 

  9. Fix, E., Hodges, J.L.: Discriminatory analysis. Nonparametric discrimination; consistency properties. Tech Rep 4, USAF School of Aviation Medicine, Randolph Field, TX (1951)

    Google Scholar 

  10. Fox, E.W., Ver Hoef, J.M., Olsen, A.R.: Comparing spatial regression to random forests for large environmental data sets. PLoS ONE 15(3), e0229509 (2020)

    Google Scholar 

  11. Friedman, J.H., Hastie, T., Tibshirani, R.: The Elements of Statistical Learning. Springer Series in Statistics, Springer, New York (2001)

    MATH  Google Scholar 

  12. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)

    MATH  Google Scholar 

  13. Grimes, D.I.F., Pardo-Igúzquiza, E.: Geostatistical analysis of rainfall. Geogr. Anal. 42(2), 136–160 (2010)

    Article  Google Scholar 

  14. Hashimoto, H., Wang, W., Melton, F.S., Moreno, A.L., Ganguly, S., Michaelis, A.R., Nemani, R.R.: Highresolution mapping of daily climate variables by aggregating multiple spatial data sets with the random forest algorithm over the conterminous United States. Int. J. Climatol. 39, 2964–2983 (2019)

    Article  Google Scholar 

  15. Hu, MJ-C.: Application of the adaline system to weather forecasting. Doctoral Thesis. Department of Electrical Engineering, Stanford University (1964)

    Google Scholar 

  16. Kataria, A., Singh, M.D.: A review of data classification using \(k\)-nearest neighbour algorithm. Int. J. Emerg. Tech. Adv. Eng. 3(6), 354–360 (2013)

    Google Scholar 

  17. Khedhaouiria, D., Mailhot, A., Favre, A.C.: Regional modeling of daily precipitation fields across the Great Lakes region (Canada) using the CFSR reanalysis. Stoch. Environ. Res. Risk Assess 34, 1385–1405 (2019)

    Google Scholar 

  18. Kilibarda, M., Hengl, T., Heuvelink, G.B.M., Gräler, B., Pebesma, E., Perčec Tadić, M., Bajat, B.: Spatiotemporal interpolation of daily temperatures for global land areas at 1 km resolution. J. Geophys. Res. Atmos. 119, 2294–2313 (2014)

    Google Scholar 

  19. Kuhn, M.: The caret package (2019). https://topepo.github.io/caret/index.html

  20. Kuhn, M.: Contributions from Wing, J., Weston, S., Williams, A., Keefer, C., Engelhardt, A., Cooper, T., Mayer, Z., Kenkel, B., the R Core Team, Benesty, M., Lescarbeau, R., Ziem, A., Scrucca, L., Tang, Y., Candan, C., Hunt, T.: caret: Classification and Regression Training. R package version 6.0-81 (2018). https://CRAN.R-project.org/package=caret

  21. Lazri, M., Ameur, S.: Combination of support vector machine, artificial neural network and random forest for improving the classification of convective and stratiform rain using spectral features of SEVIRI data. Atmos. Res. 203, 118–129 (2018)

    Article  Google Scholar 

  22. Li, S., Song, W., Fang, L., Chen, Y., Ghamisi, P., Benediktsson, J.A.: Deep Learning for hyperspectral image classification: an overview. IEEE Trans. Geosci. Remote Sens. 57(9), 6690–6709 (2019)

    Article  Google Scholar 

  23. Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2, 18–19 (2002)

    Google Scholar 

  24. Lu, Y., Qin, X.S.: A coupled \(K\) nearest neighbor and Bayesian neural network model for daily rainfall downscaling. Int. J. Climatol. 34, 3221–3236 (2014)

    Article  Google Scholar 

  25. Meteorology of Navarre Government. http://meteo.navarra.es

  26. Meyer, H., Kühnlein, M., Appelhans, T., Nauss, T.: Comparison of four machine learning algorithms for their applicability in satellite-based optical rainfall retrievals. Atmos. Res. 169, Part B, 424–433 (2016)

    Google Scholar 

  27. Militino, A.F., Ugarte, M.D., Goicoa, T., Genton, M.: Interpolation of daily rainfall using spatiotemporal models and clustering. Int. J. Climatol. 35(7), 1453–1464 (2015)

    Article  Google Scholar 

  28. Ouallouche, F., Lazri, M., Ameur, S.: Improvement of rainfall estimation from MSG data using Random Forests classification and regression. Atmos. Res 211, 62–72 (2018)

    Article  Google Scholar 

  29. Partal, T., Cigizoglu, H.K., Kahya, E.: Daily precipitation predictions using three different wavelet neural network algorithms by meteorological data. Stoch. Environ. Res. Risk Assess 29, 1317–1329 (2015)

    Article  Google Scholar 

  30. Pellicone, G., Caloiero, T., Modica, G., Guagliardi, I.: Application of several spatial interpolation techniques to monthly rainfall data in the Calabria region (southern Italy). Int. J. Climatol. 38, 3651–3666 (2018)

    Article  Google Scholar 

  31. Pham, Q.B., Yang, T.-C., Kuo, C.-M., Tseng, H.-W., Yu, P.-S.: Combing random forest and least square support vector regression for improving extreme rainfall downscaling. Water 11(451), w11030451 (2019)

    Google Scholar 

  32. Sekulić, A., Kilibarda, M., Heuvelink, G.B.M., Nikolić, M., Bajat, B.: Random forest spatial interpolation. Remote Sens. 12, 1687:rs12101687 (2020)

    Google Scholar 

  33. Shapire, R., Freund, Y., Bartlett, P., Lee, W.: Boosting the margin: a new explanation for the effectiveness of voting method. Ann. Stat. 26(5), 1651–1686 (1998)

    MathSciNet  MATH  Google Scholar 

  34. Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, 4th Ed. Springer, New York (2002)

    Google Scholar 

  35. Vu, T.M., Mishra, A.K.: Performance of multisite stochastic precipitation models for a tropical monsoon region. Stoch. Environ. Res. Risk Assess 34, 2159–2177 (2020)

    Article  Google Scholar 

  36. Wang, B., Zheng, L., Liu, D.L., Ji, F., Clark, A., Yu, Q.: Using multimodel ensembles of CMIP5 global climate models to reproduce observed monthly rainfall and temperature with machine learning methods in Australia. Int. J. Climatol. 38, 4891–4902 (2018)

    Article  Google Scholar 

  37. Wikle, C.K.: Comparison of deep neural networks and deep hierarchical models for spatio-temporal data. J. Agric. Biol. Environ. Stat. 24, 175–203 (2019)

    Article  MathSciNet  Google Scholar 

  38. Wu, J.: A novel artificial neural network ensemble model based on \(K\)-nearest neighbor nonparametric estimation of regression function and its application for rainfall forecasting. In: International Joint Conference on Computational Sciences and Optimization, pp. 44–489 (2009)

    Google Scholar 

  39. Zammit-Mangion, A., Wikle, C.K.: Deep integro-difference equation models for spatio-temporal forecasting. Spat Stat. 37, 100408 (2020)

    Article  MathSciNet  Google Scholar 

  40. Zhang, J., Fan, H., He, D., Chen, J.: Integrating precipitation zoning with random forest regression for the spatial downscaling of satellite based precipitation: A case study of the Lancang-Mekong River basin. Int. J. Climatol. 39, 3947–3961 (2019)

    Google Scholar 

  41. Zhang, G., Su, X., Ayantobo, O., Feng, K., Guo, J.: Spatial interpolation of daily precipitation based on modified ADW method for gauge-scarce mountainous regions: a case study in the Shiyang River Basin. Atmos. Res. 247, 105167 (2021)

    Article  Google Scholar 

Download references

Acknowledgements

This research was supported by the Spanish Research Agency (PID2020-113125RB-I00/MCIN/AEI/10.13039/501100011033 project). It has also received funding from la Caixa Foundation (ID1000010434), Caja Navarra Foundation, and UNED Pamplona, under agreement LCF/PR/PR15/51100007.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ana F. Militino .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Militino, A.F., Ugarte, M.D., Pérez-Goya, U. (2023). Machine Learning Procedures for Daily Interpolation of Rainfall in Navarre (Spain). In: Balakrishnan, N., Gil, M.Á., Martín, N., Morales, D., Pardo, M.d.C. (eds) Trends in Mathematical, Information and Data Sciences. Studies in Systems, Decision and Control, vol 445. Springer, Cham. https://doi.org/10.1007/978-3-031-04137-2_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-04137-2_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-04136-5

  • Online ISBN: 978-3-031-04137-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics