A STUDY OF THE STANDARDIZED PRECIPITATION INDEX (SPI) AND MACHINE LEARNING TECHNIQUES FOR DROUGHT PREDICTION IN THE STATE OF PARAÍBA, BRAZIL

Authors

DOI:

https://doi.org/10.47820/recima21.v5i10.5736

Keywords:

SPI, Drought, Machine Learnig

Abstract

This study aimed to identify and analyze droughts in Paraíba, using the Standardized Precipitation Index (SPI) and machine learning algorithms for predicting SPI for the subsequent years (2020-2021) at six rainfall stations distributed across the mesoregions of Paraíba. The Precipitation data were downloaded from the Global Precipitation Climatology Centre (GPCC) and the National Oceanic and Atmospheric Administration (NOAA) database, covering the period from 1991 to 2019. Three machine learning algorithms were selected based on their ability to fit historical SPI data: Extra Trees Regressor, Gradient Boosting Regressor, and Random Forest Regressor. The applied machine learning models yielded satisfactory results, with the Extra Trees Regressor consistently producing the highest R² value across all stations, indicating high data explainability. The predictions were analyzed to determine their accuracy and reliability, providing valuable insights into precipitation variability and drought occurrence in different mesoregions of Paraíba. In conclusion, this study contributed to understanding climate variability and its implications in Paraíba, offering valuable insights into drought occurrence and the importance of adaptive approaches to mitigate adverse impacts. The application of SPI and machine learning techniques proved effective in analyzing and predicting precipitation, providing an objective approach to characterizing drought and rainfall intensity in specific regions.

Downloads

Download data is not yet available.

Author Biographies

Jefferson Vieria dos Santos

Universidade Federal Rural de Pernambuco. Mestrando no programa de Pós Graduação em Biometria e Estatística Aplicada.

Viviane Farias Felipe

Universidade Federal Rural de Pernambuco.

Erika Fialho Morais Xavier

Pesquisadora na CIDACS-FIOCRUZ.

Tiago Almeida de Oliveira

Professor do Departamento de Estatística - UEPB - Universidade Estadual da Paraiba.

Jader Silva Jale

Professor do Departamento de Estatística e Informática - UFRPE.

Silvio Fernando Alves Xavier Junior

Universidade Estadual da Paraíba.

References

ALFIAN, G.; SYAFRUDIN, M.; FAHRURROZI, I.; FITRIYANI, N. L.; ATMAJI, F. T. D.; WIDODO, T.; BAHIYAH, N.; BENES, F.; RHEE, J. Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method. Computers, v. 11, p. 136, 2022. Disponível em: https://doi.org/10.3390/computers11090136. Acesso em: 28 set. 2023. DOI: https://doi.org/10.3390/computers11090136

BREIMAN, L. Random Forests. Machine Learning, v. 45, p. 5-32, 2001. DOI: https://doi.org/10.1023/A:1010933404324

CUTLER, A.; CUTLER, D. R.; STEVENS, J. R. Random Forests. In: ZHANG, C.; MA, Y. (Ed.). Ensemble Machine Learning: Methods and Applications. [S. l.]: Springer, 2011. cap. 5, p. 157-176. DOI: 10.1007/978-1-4419-9326-7_5. DOI: https://doi.org/10.1007/978-1-4419-9326-7_5

DA SILVA, A. S. A.; MENEZES, R. S. C.; STOSIC, T. Análise multifractal do índice de precipitação padronizado. Research, Society and Development, v. 10, n. 7, p.e24710716535-e24710716535, 2021. DOI: https://doi.org/10.33448/rsd-v10i7.16535

DE FARIAS, A. A.; SOUZA, J. T. A.; De Sousa, F. D. A. S. IDENTIFICAÇÃO E ANÁLISE DE SECAS SEVERAS E EXTREMAS NO MUNICÍPIO DE TAPEROÁ-PB. Revista Brasileira de Geografia Física, v. 07, n. 05, especial- VIWMCRHPE, p. 818-826, 2014. DOI: https://doi.org/10.26848/rbgf.v7.5.p818-826

DE SOUZA GUEDES, R. V.; MACEDO, M. J. H.; DE SOUZA, F. D. A. S. Análise espacial de eventos de secas com base no índice padronizado de precipitação e análise de agrupamento. Brazilian Journal of Environmental Sciences (Online), v. 23, p. 55-65, 2012.

DIDÁTICA TECH. Como funciona o algoritmo ExtraTrees. [S. l.]: Didativa Tech, s. d. Disponível em: https://didatica.tech/como-funciona-o-algoritmo-extratrees/. Acesso em: 28 set. 2023.

FILHO, M. MAE (Erro Médio Absoluto) Em Machine Learning. [S. l.: s. n.], s. d. Disponível em: https://mariofilho.com/mae-erro-medio-absoluto-em-machine-learning/. Acesso em: 28 set. 2023.

FRIEDMAN, Jerome H. “Greedy Function Approximation: A Gradient Boosting Machine.” The Annals of Statistics, v. 29, n. 5, p. 1189–1232, 2001. http://www.jstor.org/stable/2699986. DOI: https://doi.org/10.1214/aos/1013203451

HASAN, H.; RADI, N. A.; KASSIM, S. Modeling of extreme temperature using generalized extreme value (GEV) distribution: A case study of Penang. In: Proceedings of the world congress on engineering, v. 1, p. 181-186, july 2012.

ICMC JÚNIOR. Random Forest. [S. L.]: ICMC Junior, s. d. Disponível em: https://icmcjunior.com.br/random-forest/. Acesso em: Acesso em: 28 set. 2023.

MACEDO, M. J. H.; DE SOUZA GUEDES, R. V.; DE ASSIS SALVIANO, F. D. A. Monitoramento e intensidade das secas e chuvas na cidade de Campina Grande/PB. Revista Brasileira de Climatologia, v. 8, 2011. DOI: https://doi.org/10.5380/abclima.v8i0.25797

MACEDO, M. J. H.; DE SOUZA GUEDES, R. V.; DE SOUZA, F. D. A. S.; DA CUNHA DANTAS, F. R. Análise do índice padronizado de precipitação para o estado da Paraíba, Brasil. Ambiente & Água - An Interdisciplinary Journal of Applied Science, v. 5, n. 1, p. 204-214, 2010.

MASUI, T. All You Need to Know about Gradient Boosting Algorithm − Part 1. Regression. Towards Data Science, [S. l.], 20 jan. 2022. Disponível em: https://towardsdatascience.com/all-you-need-to-know-about-gradient-boosting-algorithm-part-1-regression-2520a34a502. Acesso em: 28 set. 2023.

MATHBOUT, S.; LOPEZ-BUSTINS, J. A.; MARTIN-VIDE, J.; BECH, J.; RODRIGO, F. S. Spatial and temporal analysis of drought variability at several time scales in Syria during 1961–2012. Atmospheric Research, v. 200, p. 153-168, 2018. DOI: https://doi.org/10.1016/j.atmosres.2017.09.016

MATPLOTLIB. Documentação estável. [S. l.]: MATPLOTLIB, s. d. Disponível em: https://matplotlib.org/stable/index.html. Acesso em: 20 ago. 2023.

MCKEE, T. B.; DOESKEN, N. J.; KLEIST, J. The relationship of drought frequency and duration to time scales. In: Proceedings of the 8th Conference on Applied Climatology, v. 17, n. 22, p. 179-183, 1993, jan. 1993.

OTCHERE, D. A.; GANAT, T. O. A.; OJERO, J. O.; TACKIE-OTOO, B. N.; TAKI, M. Y. Application of gradient boosting regression model for the evaluation of feature selection techniques in improving reservoir characterisation predictions. Journal of Petroleum Science and Engineering, v. 208, Part E, p. 109244, 2022. ISSN 0920-4105. Disponível em: https://doi.org/10.1016/j.petrol.2021.109244. Acesso em: 28 set. 2023. DOI: https://doi.org/10.1016/j.petrol.2021.109244

PANDAS. Documentação. [S. l.]: Pandas, s. d. Disponível em: https://pandas.pydata.org/docs/. Acesso em: 20 ago. 2023

PARAÍBA TOTAL. Geografia. [S. l.]: Paraíba Total, 2021. Disponível em: https://www.paraibatotal.com.br/a-paraiba/geografia/. Acesso em: 20 ago. 2023.

PYTHON SOFTWARE FOUNDATION. Documentação Python 3.x (Português do Brasil). [S. l.]: Python Software Foundation, s. d. Disponível em: https://docs.python.org/pt-br/3/. Acesso em: 20 ago. 2023.

R DEVELOPMENT CORE TEAM. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, 2013. ISBN 3-900051-07-0, 2014.

SANTOS, C. A. G.; BRASIL NETO, R. M.; DA SILVA, R. M.; DOS SANTOS, D. C. Innovative approach for geospatial drought severity classification: a case study of Paraíba state, Brazil. Stochastic Environmental Research and Risk Assessment, v. 33, n. 2, p. 545-562, 2019. DOI: https://doi.org/10.1007/s00477-018-1619-9

SEABORN. Introdução ao tutorial. [S. l.]: SEABORN, s. d. Disponível em: https://seaborn.pydata.org/tutorial/introduction.html. Acesso em: 20 ago. 2023.

SODRÉ, G. D. A. 2040: o aquecimento global gerando impacto econômico na produção de cana no Estado de São Paulo. 2010. Tese (Doutorado) – Fundação Getulio Vargas, São Paulo, 2010.

SVOBODA, M.; HAYES, M.; WOOD, D. Standardized precipitation index: user guide. [S. l.: s. n.], 2012.

THOM, H. C. A note on the gamma distribution. Monthly weather review, v. 86, n. 4, p. 117-122, 1958. DOI: https://doi.org/10.1175/1520-0493(1958)086<0117:ANOTGD>2.0.CO;2

WANG, K. Y.; LI, Q. F.; YANG, Y.; ZENG, M.; Li, P. C.; ZHANG, J. X. Analysis of spatio-temporal evolution of droughts in Luanhe River Basin using different drought indices. Water Science and Engineering, v. 8, n. 4, p. 282-290, 2015. DOI: https://doi.org/10.1016/j.wse.2015.11.004

XARRAY. Documentação estável. [S. l.]: Xarray, s. d. Disponível em: https://docs.xarray.dev/en/stable/. Acesso em: 20 ago. 2023.

Downloads

Published

07/10/2024

How to Cite

Vieria dos Santos, J., Farias Felipe, V., Fialho Morais Xavier, E., Almeida de Oliveira, T., Silva Jale, J., & Alves Xavier Junior, S. F. (2024). A STUDY OF THE STANDARDIZED PRECIPITATION INDEX (SPI) AND MACHINE LEARNING TECHNIQUES FOR DROUGHT PREDICTION IN THE STATE OF PARAÍBA, BRAZIL. RECIMA21 - Revista Científica Multidisciplinar - ISSN 2675-6218, 5(10), e5105736. https://doi.org/10.47820/recima21.v5i10.5736