IDEAS home Printed from https://ideas.repec.org/a/wly/envmet/v33y2022i6ne2742.html
   My bibliography  Save this article

Practical strategies for generalized extreme value‐based regression models for extremes

Author

Listed:
  • Daniela Castro‐Camilo
  • Raphaël Huser
  • Håvard Rue

Abstract

The generalized extreme value (GEV) distribution is the only possible limiting distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. As such, it has been widely applied to approximate the distribution of maxima over blocks. In these applications, GEV properties such as finite lower endpoint when the shape parameter ξ$$ \xi $$ is positive or the loss of moments due to the magnitude of ξ$$ \xi $$ are inherited by the finite‐sample maxima distribution. The extent to which these properties are realistic for the data at hand has been widely ignored. Motivated by these overlooked consequences in a regression setting, we here make three contributions. First, we propose a blended GEV (bGEV) distribution, which smoothly combines the left tail of a Gumbel distribution (GEV with ξ=0$$ \xi =0 $$) with the right tail of a Fréchet distribution (GEV with ξ>0$$ \xi >0 $$). Our resulting distribution has, therefore, unbounded support. Second, we proposed a principled method called property‐preserving penalized complexity (P3$$ {}^3 $$C) prior to decide on the existence of the GEV distribution first and second moments a priori. Third, we propose a reparametrization of the GEV distribution that provides a more natural interpretation of the (possibly covariate‐dependent) model parameters, which in turn helps define meaningful priors. We implement the bGEV distribution with the new parameterization and the P3$$ {}^3 $$C prior approach in the R‐INLA package to make it readily available to users. We illustrate our methods with a simulation study that reveals that the GEV and bGEV distributions are comparable when estimating the right tail under large‐sample settings. Moreover, some small‐sample settings show that the bGEV fit slightly outperforms the GEV fit. Finally, we conclude with an application to NO2$$ {}_2 $$ pollution levels in California that illustrates the suitability of the new parameterization and the P3$$ {}^3 $$C prior approach in the Bayesian framework.

Suggested Citation

  • Daniela Castro‐Camilo & Raphaël Huser & Håvard Rue, 2022. "Practical strategies for generalized extreme value‐based regression models for extremes," Environmetrics, John Wiley & Sons, Ltd., vol. 33(6), September.
  • Handle: RePEc:wly:envmet:v:33:y:2022:i:6:n:e2742
    DOI: 10.1002/env.2742
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/env.2742
    Download Restriction: no

    File URL: https://libkey.io/10.1002/env.2742?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    2. Sabrina Vettori & Raphaël Huser & Marc G. Genton, 2019. "Bayesian modeling of air pollution extremes using nested multivariate max‐stable processes," Biometrics, The International Biometric Society, vol. 75(3), pages 831-841, September.
    3. Tilmann Gneiting & Fadoua Balabdaoui & Adrian E. Raftery, 2007. "Probabilistic forecasts, calibration and sharpness," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 243-268, April.
    4. Stuart G. Coles & Jonathan A. Tawn, 1996. "A Bayesian Analysis of Extreme Rainfall Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 45(4), pages 463-478, December.
    5. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    6. Lindgren, Finn & Rue, Håvard, 2015. "Bayesian Spatial Modelling with R-INLA," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i19).
    7. M. L. Stein, 2017. "Should annual maximum temperatures follow a generalized extreme value distribution?," Biometrika, Biometrika Trust, vol. 104(1), pages 1-16.
    8. Anja B. Schmiedt, 2016. "Domains of attraction of asymptotic distributions of extreme generalized order statistics," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 45(7), pages 2089-2104, April.
    9. Broussard, John Paul & Booth, G. Geoffrey, 1998. "The behavior of extreme values in Germany's stock index futures: An application to intradaily margin setting," European Journal of Operational Research, Elsevier, vol. 104(3), pages 393-402, February.
    10. Mendes, Beatriz Vaz de Melo & Lopes, Hedibert Freitas, 2004. "Data driven estimates for mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 47(3), pages 583-598, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jacqueline D. Seufert & Andre Python & Christoph Weisser & Elías Cisneros & Krisztina Kis‐Katos & Thomas Kneib, 2022. "Mapping ex ante risks of COVID‐19 in Indonesia using a Bayesian geostatistical model on airport network data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2121-2155, October.
    2. Cho, Daegon & Hwang, Youngdeok & Park, Jongwon, 2018. "More buzz, more vibes: Impact of social media on concert distribution," Journal of Economic Behavior & Organization, Elsevier, vol. 156(C), pages 103-113.
    3. Daniel Cervone & Alex D’Amour & Luke Bornn & Kirk Goldsberry, 2016. "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 585-599, April.
    4. Johnson, Blair T. & Sisti, Anthony & Bernstein, Mary & Chen, Kun & Hennessy, Emily A. & Acabchuk, Rebecca L. & Matos, Michaela, 2021. "Community-level factors and incidence of gun violence in the United States, 2014–2017," Social Science & Medicine, Elsevier, vol. 280(C).
    5. Zhang, Shen & Liu, Xin & Tang, Jinjun & Cheng, Shaowu & Qi, Yong & Wang, Yinhai, 2018. "Spatio-temporal modeling of destination choice behavior through the Bayesian hierarchical approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 537-551.
    6. Sameh Abdulah & Yuxiao Li & Jian Cao & Hatem Ltaief & David E. Keyes & Marc G. Genton & Ying Sun, 2023. "Large‐scale environmental data science with ExaGeoStatR," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    7. André Victor Ribeiro Amaral & Elias Teixeira Krainski & Ruiman Zhong & Paula Moraga, 2024. "Model-Based Geostatistics Under Spatially Varying Preferential Sampling," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(4), pages 766-792, December.
    8. John M. Humphreys & Robert B. Srygley & David H. Branson, 2022. "Geographic Variation in Migratory Grasshopper Recruitment under Projected Climate Change," Geographies, MDPI, vol. 2(1), pages 1-19, January.
    9. Álvaro Briz-Redón, 2021. "Respondent Burden Effects on Item Non-Response and Careless Response Rates: An Analysis of Two Types of Surveys," Mathematics, MDPI, vol. 9(17), pages 1-16, August.
    10. Dong Liang & Genevieve Nesslage & Michael Wilberg & Thomas Miller, 2017. "Bayesian Calibration of Blue Crab (Callinectes sapidus) Abundance Indices Based on Probability Surveys," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(4), pages 481-497, December.
    11. Silius M. Vandeskog & Sara Martino & Daniela Castro-Camilo & Håvard Rue, 2022. "Modelling Sub-daily Precipitation Extremes with the Blended Generalised Extreme Value Distribution," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(4), pages 598-621, December.
    12. Humphreys, John M. & Srygley, Robert B. & Lawton, Douglas & Hudson, Amy R. & Branson, David H., 2022. "Grasshoppers exhibit asynchrony and spatial non-stationarity in response to the El Niño/Southern and Pacific Decadal Oscillations," Ecological Modelling, Elsevier, vol. 471(C).
    13. Carlos Díaz-Avalos & Pablo Juan & Somnath Chaudhuri & Marc Sáez & Laura Serra, 2020. "Association between the New COVID-19 Cases and Air Pollution with Meteorological Elements in Nine Counties of New York State," IJERPH, MDPI, vol. 17(23), pages 1-18, December.
    14. Beręsewicz Maciej, 2019. "Correlates of Representation Errors in Internet Data Sources for Real Estate Market," Journal of Official Statistics, Sciendo, vol. 35(3), pages 509-529, September.
    15. Deslatte, Aaron & Scott, Tyler A. & Carter, David P., 2019. "Specialized governance and regional land-use outcomes: A spatial analysis of Florida community development districts," Land Use Policy, Elsevier, vol. 83(C), pages 227-239.
    16. Chiranjit Dutta & Nalini Ravishanker & Sumanta Basu, 2022. "Modeling Multivariate Positive-Valued Time Series Using R-INLA," Papers 2206.05374, arXiv.org, revised Jul 2022.
    17. Ropo E. Ogunsakin & Themba G. Ginindza, 2022. "Bayesian Spatial Modeling of Diabetes and Hypertension: Results from the South Africa General Household Survey," IJERPH, MDPI, vol. 19(15), pages 1-17, July.
    18. Chen, Yewen & Chang, Xiaohui & Luo, Fangzhi & Huang, Hui, 2023. "Additive dynamic models for correcting numerical model outputs," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    19. Jamie M. Madden & Simon More & Conor Teljeur & Justin Gleeson & Cathal Walsh & Guy McGrath, 2021. "Population Mobility Trends, Deprivation Index and the Spatio-Temporal Spread of Coronavirus Disease 2019 in Ireland," IJERPH, MDPI, vol. 18(12), pages 1-16, June.
    20. I. Gede Nyoman Mindra Jaya & Henk Folmer, 2022. "Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease," Journal of Geographical Systems, Springer, vol. 24(4), pages 527-581, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:envmet:v:33:y:2022:i:6:n:e2742. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1180-4009/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.