IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i18p2899-d1479853.html
   My bibliography  Save this article

High-Resolution Spatiotemporal Forecasting with Missing Observations Including an Application to Daily Particulate Matter 2.5 Concentrations in Jakarta Province, Indonesia

Author

Listed:
  • I Gede Nyoman Mindra Jaya

    (Department of Statistics, Universitas Padjadjaran, Jl. Raya Bandung Sumedang km 21 Jatinangor, Sumedang 45363, Indonesia)

  • Henk Folmer

    (Department of Statistics, Universitas Padjadjaran, Jl. Raya Bandung Sumedang km 21 Jatinangor, Sumedang 45363, Indonesia
    Faculty of Spatial Sciences, University of Groningen, P.O. Box 800, 9700 AV Groningen, The Netherlands)

Abstract

Accurate forecasting of high-resolution particulate matter 2.5 (PM 2.5 ) levels is essential for the development of public health policy. However, datasets used for this purpose often contain missing observations. This study presents a two-stage approach to handle this problem. The first stage is a multivariate spatial time series (MSTS) model, used to generate forecasts for the sampled spatial units and to impute missing observations. The MSTS model utilizes the similarities between the temporal patterns of the time series of the spatial units to impute the missing data across space. The second stage is the high-resolution prediction model, which generates predictions that cover the entire study domain. The second stage faces the big N problem giving rise to complex memory and computational problems. As a solution to the big N problem, we propose a Gaussian Markov random field (GMRF) for innovations with the Matérn covariance matrix obtained from the corresponding Gaussian field (GF) matrix by means of the stochastic partial differential equation (SPDE) method and the finite element method (FEM). For inference, we propose Bayesian statistics and integrated nested Laplace approximation (INLA) in the R-INLA package. The above approach is demonstrated using daily data collected from 13 PM 2.5 monitoring stations in Jakarta Province, Indonesia, for 1 January–31 December 2022. The first stage of the model generates PM 2.5 forecasts for the 13 monitoring stations for the period 1–31 January 2023, imputing missing data by means of the MSTS model. To capture temporal trends in the PM 2.5 concentrations, the model applies a first-order autoregressive process and a seasonal process. The second stage involves creating a high-resolution map for the period 1–31 January 2023, for sampled and non-sampled spatiotemporal units. It uses the MSTS-generated PM 2.5 predictions for the sampled spatiotemporal units and observations of the covariate’s altitude, population density, and rainfall for sampled and non-samples spatiotemporal units. For the spatially correlated random effects, we apply a first-order random walk process. The validation of out-of-sample forecasts indicates a strong model fit with low mean squared error (0.001), mean absolute error (0.037), and mean absolute percentage error (0.041), and a high R² value (0.855). The analysis reveals that altitude and precipitation negatively impact PM 2.5 concentrations, while population density has a positive effect. Specifically, a one-meter increase in altitude is linked to a 7.8% decrease in PM 2.5 , while a one-person increase in population density leads to a 7.0% rise in PM 2.5 . Additionally, a one-millimeter increase in rainfall corresponds to a 3.9% decrease in PM 2.5 . The paper makes a valuable contribution to the field of forecasting high-resolution PM 2.5 levels, which is essential for providing detailed, accurate information for public health policy. The approach presents a new and innovative method for addressing the problem of missing data and high-resolution forecasting.

Suggested Citation

  • I Gede Nyoman Mindra Jaya & Henk Folmer, 2024. "High-Resolution Spatiotemporal Forecasting with Missing Observations Including an Application to Daily Particulate Matter 2.5 Concentrations in Jakarta Province, Indonesia," Mathematics, MDPI, vol. 12(18), pages 1-29, September.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:18:p:2899-:d:1479853
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/18/2899/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/18/2899/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Michela Cameletti & Finn Lindgren & Daniel Simpson & Håvard Rue, 2013. "Spatio-temporal modeling of particulate matter concentration through the SPDE approach," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(2), pages 109-131, April.
    2. Prakash Thangavel & Duckshin Park & Young-Chul Lee, 2022. "Recent Insights into Particulate Matter (PM 2.5 )-Mediated Toxicity in Humans: An Overview," IJERPH, MDPI, vol. 19(12), pages 1-22, June.
    3. Amanda Lenzi & Ingelin Steinsland & Pierre Pinson, 2018. "Benefits of spatiotemporal modeling for short‐term wind power forecasting at both individual and aggregated levels," Environmetrics, John Wiley & Sons, Ltd., vol. 29(3), May.
    4. I. Gede Nyoman Mindra Jaya & Budhi Handoko & Yudhie Andriyana & Anna Chadidjah & Farah Kristiani & Mila Antikasari, 2023. "Multivariate Bayesian Semiparametric Regression Model for Forecasting and Mapping HIV and TB Risks in West Java, Indonesia," Mathematics, MDPI, vol. 11(17), pages 1-23, August.
    5. Widya Liadira Kusuma & Wu Chih-Da & Zeng Yu-Ting & Handayani Hepi Hapsari & Jaelani Lalu Muhamad, 2019. "PM 2.5 Pollutant in Asia—A Comparison of Metropolis Cities in Indonesia and Taiwan," IJERPH, MDPI, vol. 16(24), pages 1-12, December.
    6. Håvard Rue & Sara Martino & Nicolas Chopin, 2009. "Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 319-392, April.
    7. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    8. Yusuf, Arief Anshory & Resosudarmo, Budy P., 2009. "Does clean air matter in developing countries' megacities? A hedonic price analysis of the Jakarta housing market, Indonesia," Ecological Economics, Elsevier, vol. 68(5), pages 1398-1407, March.
    9. Finn Lindgren & Håvard Rue & Johan Lindström, 2011. "An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(4), pages 423-498, September.
    10. I. Gede Nyoman Mindra Jaya & Henk Folmer, 2022. "Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease," Journal of Geographical Systems, Springer, vol. 24(4), pages 527-581, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Shen & Liu, Xin & Tang, Jinjun & Cheng, Shaowu & Qi, Yong & Wang, Yinhai, 2018. "Spatio-temporal modeling of destination choice behavior through the Bayesian hierarchical approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 537-551.
    2. Yuheng Ling, 2020. "Time, space and hedonic prediction accuracy: evidence from Corsican apartment markets," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 64(2), pages 367-388, April.
    3. Carson, Stuart & Mills Flemming, Joanna, 2014. "Seal encounters at sea: A contemporary spatial approach using R-INLA," Ecological Modelling, Elsevier, vol. 291(C), pages 175-181.
    4. Zongyuan Xia & Bo Tang & Long Qin & Huiguo Zhang & Xijian Hu, 2023. "Spatially Dependent Bayesian Modeling of Geostatistics Data and Its Application for Tuberculosis (TB) in China," Mathematics, MDPI, vol. 11(19), pages 1-15, October.
    5. Mayer Alvo & Jingrui Mu, 2023. "COVID-19 Data Analysis Using Bayesian Models and Nonparametric Geostatistical Models," Mathematics, MDPI, vol. 11(6), pages 1-13, March.
    6. Daniela Silva & Raquel Menezes & Ana Moreno & Ana Teles-Machado & Susana Garrido, 2024. "Environmental Effects on the Spatiotemporal Variability of Sardine Distribution Along the Portuguese Continental Coast," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(3), pages 553-575, September.
    7. Braulio-Gonzalo, Marta & Bovea, María D. & Jorge-Ortiz, Andrea & Juan, Pablo, 2021. "Which is the best-fit response variable for modelling the energy consumption of households? An analysis based on survey data," Energy, Elsevier, vol. 231(C).
    8. Jacqueline D. Seufert & Andre Python & Christoph Weisser & Elías Cisneros & Krisztina Kis‐Katos & Thomas Kneib, 2022. "Mapping ex ante risks of COVID‐19 in Indonesia using a Bayesian geostatistical model on airport network data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2121-2155, October.
    9. Zammit-Mangion, Andrew & Rougier, Jonathan, 2018. "A sparse linear algebra algorithm for fast computation of prediction variances with Gaussian Markov random fields," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 116-130.
    10. Márcio Poletti Laurini, 2017. "A spatial error model with continuous random effects and an application to growth convergence," Journal of Geographical Systems, Springer, vol. 19(4), pages 371-398, October.
    11. Somnath Chaudhuri & Gerard Giménez-Adsuar & Marc Saez & Maria A. Barceló, 2022. "PandemonCAT: Monitoring the COVID-19 Pandemic in Catalonia, Spain," IJERPH, MDPI, vol. 19(8), pages 1-22, April.
    12. Carlos Díaz-Avalos & Pablo Juan & Somnath Chaudhuri & Marc Sáez & Laura Serra, 2020. "Association between the New COVID-19 Cases and Air Pollution with Meteorological Elements in Nine Counties of New York State," IJERPH, MDPI, vol. 17(23), pages 1-18, December.
    13. Marc Francke & Alex Van de Minne, 2021. "Modeling unobserved heterogeneity in hedonic price models," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 49(4), pages 1315-1339, December.
    14. Coll, M. & Pennino, M. Grazia & Steenbeek, J. & Sole, J. & Bellido, J.M., 2019. "Predicting marine species distributions: Complementarity of food-web and Bayesian hierarchical modelling approaches," Ecological Modelling, Elsevier, vol. 405(C), pages 86-101.
    15. Martins, Thiago G. & Simpson, Daniel & Lindgren, Finn & Rue, Håvard, 2013. "Bayesian computing with INLA: New features," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 68-83.
    16. I. Gede Nyoman Mindra Jaya & Henk Folmer, 2022. "Spatiotemporal high-resolution prediction and mapping: methodology and application to dengue disease," Journal of Geographical Systems, Springer, vol. 24(4), pages 527-581, October.
    17. Jorge Sicacha-Parada & Diego Pavon-Jordan & Ingelin Steinsland & Roel May & Bård Stokke & Ingar Jostein Øien, 2022. "A Spatial Modeling Framework for Monitoring Surveys with Different Sampling Protocols with a Case Study for Bird Abundance in Mid-Scandinavia," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 27(3), pages 562-591, September.
    18. Nicoletta D’Angelo & Antonino Abbruzzo & Giada Adelfio, 2021. "Spatio-Temporal Spread Pattern of COVID-19 in Italy," Mathematics, MDPI, vol. 9(19), pages 1-14, October.
    19. Laurini, Márcio Poletti, 2017. "The spatio-temporal dynamics of ethanol/gasoline price ratio in Brazil," Renewable and Sustainable Energy Reviews, Elsevier, vol. 70(C), pages 1-12.
    20. Jonathan Rougier & Aoibheann Brady & Jonathan Bamber & Stephen Chuter & Sam Royston & Bramha Dutt Vishwakarma & Richard Westaway & Yann Ziegler, 2023. "The scope of the Kalman filter for spatio‐temporal applications in environmental science," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:18:p:2899-:d:1479853. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.