IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v12y2016i1p179-201n1.html
   My bibliography  Save this article

Optimal Spatial Prediction Using Ensemble Machine Learning

Author

Listed:
  • Davies Molly Margaret

    (Group in Biostatistics, University of California, Berkeley, Berkeley, CA, USA)

  • van der Laan Mark J.

    (Group in Biostatistics, University of California, Berkeley, Berkeley, CA, USA)

Abstract

Spatial prediction is an important problem in many scientific disciplines. Super Learner is an ensemble prediction approach related to stacked generalization that uses cross-validation to search for the optimal predictor amongst all convex combinations of a heterogeneous candidate set. It has been applied to non-spatial data, where theoretical results demonstrate it will perform asymptotically at least as well as the best candidate under consideration. We review these optimality properties and discuss the assumptions required in order for them to hold for spatial prediction problems. We present results of a simulation study confirming Super Learner works well in practice under a variety of sample sizes, sampling designs, and data-generating functions. We also apply Super Learner to a real world dataset.

Suggested Citation

  • Davies Molly Margaret & van der Laan Mark J., 2016. "Optimal Spatial Prediction Using Ensemble Machine Learning," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 179-201, May.
  • Handle: RePEc:bpj:ijbist:v:12:y:2016:i:1:p:179-201:n:1
    DOI: 10.1515/ijb-2014-0060
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/ijb-2014-0060
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/ijb-2014-0060?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Huang, Hsin-Cheng & Chen, Chun-Shu, 2007. "Optimal Geostatistical Model Selection," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1009-1024, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Odunayo David Adeniyi & Alexander Brenning & Alice Bernini & Stefano Brenna & Michael Maerker, 2023. "Digital Mapping of Soil Properties Using Ensemble Machine Learning Approaches in an Agricultural Lowland Area of Lombardy, Italy," Land, MDPI, vol. 12(2), pages 1-17, February.
    2. Xiangyuan Wu & Kening Wu & Huafu Zhao & Shiheng Hao & Zhenyu Zhou, 2023. "Impact of Land Cover Changes on Soil Type Mapping in Plain Areas: Evidence from Tongzhou District of Beijing, China," Land, MDPI, vol. 12(9), pages 1-14, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jonathan Bradley & Noel Cressie & Tao Shi, 2015. "Comparing and selecting spatial predictors using local criteria," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 1-28, March.
    2. Siddhartha Nandy & Chae Young Lim & Tapabrata Maiti, 2017. "Additive model building for spatial regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 779-800, June.
    3. Tae Kim & Jeong Park & Gyu Song, 2010. "An asymptotic theory for the nugget estimator in spatial models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 22(2), pages 181-195.
    4. Chen, Yin-Ping & Huang, Hsin-Cheng & Tu, I-Ping, 2010. "A new approach for selecting the number of factors," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2990-2998, December.
    5. Wenning Feng & Abdhi Sarkar & Chae Young Lim & Tapabrata Maiti, 2016. "Variable selection for binary spatial regression: Penalized quasi‐likelihood approach," Biometrics, The International Biometric Society, vol. 72(4), pages 1164-1172, December.
    6. Chung‐Wei Shen & Chun‐Shu Chen, 2024. "Estimation and selection for spatial zero‐inflated count models," Environmetrics, John Wiley & Sons, Ltd., vol. 35(4), June.
    7. Yun-Huan Lee & Chun-Shu Chen, 2012. "Autoregressive model selection based on a prediction perspective," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(4), pages 913-922, October.
    8. Brian J. Reich & Montserrat Fuentes & Amy H. Herring & Kelly R. Evenson, 2010. "Bayesian Variable Selection for Multivariate Spatially Varying Coefficient Regression," Biometrics, The International Biometric Society, vol. 66(3), pages 772-782, September.
    9. Qingying Zong & Jonathan R. Bradley, 2023. "Criterion constrained Bayesian hierarchical models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 294-320, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:12:y:2016:i:1:p:179-201:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.