IDEAS home Printed from https://ideas.repec.org/a/spr/jagbes/v25y2020i1d10.1007_s13253-019-00380-4.html
   My bibliography  Save this article

Efficient Modelling of Presence-Only Species Data via Local Background Sampling

Author

Listed:
  • Jeffrey Daniel

    (University of Guelph)

  • Julie Horrocks

    (University of Guelph)

  • Gary J. Umphrey

    (University of Guelph)

Abstract

In species distribution modelling, records of species presence are often modelled as a realization of a spatial point process whose intensity is a function of environmental covariates. One way to fit a spatial point process model is to apply logistic regression to an artificial case–control sample consisting of the observed presence records combined with a simulated pattern of background points, usually a uniform random sample from within the study’s spatial domain. In this paper we propose local background sampling as an alternative to uniform background sampling when using logistic regression to fit spatial point process models to data. Our method is similar to the local case–control sampling procedure of Fithian and Hastie (Ann Appl Stat 42:1693–1724, 2014), but differs in that background points are sampled with probability proportional to an initial intensity estimate based on a pilot point process model. We compare local background sampling with uniform background sampling in a simulation study and in an example modelling the distributions of bumble bees (genus Bombus) in Ontario, Canada. Our results show local background sampling to be more efficient than uniform background sampling in all simulated settings and across all species analysed. Supplementary materials accompanying this paper appear online.

Suggested Citation

  • Jeffrey Daniel & Julie Horrocks & Gary J. Umphrey, 2020. "Efficient Modelling of Presence-Only Species Data via Local Background Sampling," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(1), pages 90-111, March.
  • Handle: RePEc:spr:jagbes:v:25:y:2020:i:1:d:10.1007_s13253-019-00380-4
    DOI: 10.1007/s13253-019-00380-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13253-019-00380-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13253-019-00380-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mark Berman & T. Rolf Turner, 1992. "Approximating Point Process Likelihoods with Glim," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 41(1), pages 31-38, March.
    2. Peter Diggle, 1985. "A Kernel Method for Smoothing Point Process Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 34(2), pages 138-147, June.
    3. Ian W. Renner & David I. Warton, 2013. "Equivalence of MAXENT and Poisson Point Process Models for Species Distribution Modeling in Ecology," Biometrics, The International Biometric Society, vol. 69(1), pages 274-281, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Matthias Eckardt & Mehdi Moradi, 2024. "Marked Spatial Point Processes: Current State and Extensions to Point Processes on Linear Networks," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(2), pages 346-378, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Giuseppe Espa & Giuseppe Arbia & Diego Giuliani, 2013. "Conditional versus unconditional industrial agglomeration: disentangling spatial dependence and spatial heterogeneity in the analysis of ICT firms’ distribution in Milan," Journal of Geographical Systems, Springer, vol. 15(1), pages 31-50, January.
    2. Leandro, Camila & Jay-Robert, Pierre & Mériguet, Bruno & Houard, Xavier & Renner, Ian W., 2020. "Is my sdm good enough? insights from a citizen science dataset in a point process modeling framework," Ecological Modelling, Elsevier, vol. 438(C).
    3. Christophe Botella & Alexis Joly & Pascal Monestiez & Pierre Bonnet & François Munoz, 2020. "Bias in presence-only niche models related to sampling effort and species niches: Lessons for background point selection," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-18, May.
    4. Nicoletta D’Angelo & Giada Adelfio, 2024. "Minimum contrast for the first-order intensity estimation of spatial and spatio-temporal point processes," Statistical Papers, Springer, vol. 65(6), pages 3651-3679, August.
    5. Edith Gabriel, 2014. "Estimating Second-Order Characteristics of Inhomogeneous Spatio-Temporal Point Processes," Methodology and Computing in Applied Probability, Springer, vol. 16(2), pages 411-431, June.
    6. Wiltshire, Kathryn H & Tanner, Jason E, 2020. "Comparing maximum entropy modelling methods to inform aquaculture site selection for novel seaweed species," Ecological Modelling, Elsevier, vol. 429(C).
    7. M. N. M. Lieshout, 2020. "Infill Asymptotics and Bandwidth Selection for Kernel Estimators of Spatial Intensity Functions," Methodology and Computing in Applied Probability, Springer, vol. 22(3), pages 995-1008, September.
    8. Amanda M E D’Andrea & Vera L D Tomazella & Hassan M Aljohani & Pedro L Ramos & Marco P Almeida & Francisco Louzada & Bruna A W Verssani & Amanda B Gazon & Ahmed Z Afify, 2021. "Objective bayesian analysis for multiple repairable systems," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-19, November.
    9. Steen, Bart & Broennimann, Olivier & Maiorano, Luigi & Guisan, Antoine, 2024. "How sensitive are species distribution models to different background point selection strategies? A test with species at various equilibrium levels," Ecological Modelling, Elsevier, vol. 493(C).
    10. Abdollah Jalilian, 2017. "Modelling and classification of species abundance: a case study in the Barro Colorado Island plot," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(13), pages 2401-2409, October.
    11. Mola-Yudego, Blas & Selkimäki, Mari & González-Olabarria, José Ramón, 2014. "Spatial analysis of the wood pellet production for energy in Europe," Renewable Energy, Elsevier, vol. 63(C), pages 76-83.
    12. repec:jss:jstsof:08:i16 is not listed on IDEAS
    13. Yingqi Zhao & Donglin Zeng & Amy H. Herring & Amy Ising & Anna Waller & David Richardson & Michael R. Kosorok, 2011. "Detecting Disease Outbreaks Using Local Spatiotemporal Methods," Biometrics, The International Biometric Society, vol. 67(4), pages 1508-1517, December.
    14. François Sémécurbe & Cécile Tannier & Stéphane G. Roux, 2019. "Applying two fractal methods to characterise the local and global deviations from scale invariance of built patterns throughout mainland France," Journal of Geographical Systems, Springer, vol. 21(2), pages 271-293, June.
    15. Peng Hou & Xiaojian Yi & Haiping Dong, 2020. "A Spatial Statistic Based Risk Assessment Approach to Prioritize the Pipeline Inspection of the Pipeline Network," Energies, MDPI, vol. 13(3), pages 1-16, February.
    16. Giuseppe Arbia & Patrizia Cella & Giuseppe Espa & Diego Giuliani, 2015. "A micro spatial analysis of firm demography: the case of food stores in the area of Trento (Italy)," Empirical Economics, Springer, vol. 48(3), pages 923-937, May.
    17. Holder, Anna M. & Markarian, Arev & Doyle, Jessie M. & Olson, John R., 2020. "Predicting geographic distributions of fishes in remote stream networks using maximum entropy modeling and landscape characterizations," Ecological Modelling, Elsevier, vol. 433(C).
    18. D'Angelo, Nicoletta & Adelfio, Giada & Mateu, Jorge, 2023. "Locally weighted minimum contrast estimation for spatio-temporal log-Gaussian Cox processes," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    19. Ondřej Šedivý & Antti Penttinen, 2014. "Intensity estimation for inhomogeneous Gibbs point process with covariates-dependent chemical activity," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(3), pages 225-249, August.
    20. Bouezmarni, Taoufik & Rombouts, Jeroen V.K., 2010. "Nonparametric density estimation for positive time series," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 245-261, February.
    21. Marcon, Eric & Puech, Florence, 2017. "A typology of distance-based measures of spatial concentration," Regional Science and Urban Economics, Elsevier, vol. 62(C), pages 56-67.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jagbes:v:25:y:2020:i:1:d:10.1007_s13253-019-00380-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.