IDEAS home Printed from https://ideas.repec.org/a/eee/ecomod/v481y2023ics0304380023000819.html
   My bibliography  Save this article

A comparison of machine learning and statistical species distribution models: Quantifying overfitting supports model interpretation

Author

Listed:
  • Chollet Ramampiandra, Emma
  • Scheidegger, Andreas
  • Wydler, Jonas
  • Schuwirth, Nele

Abstract

Species distribution models are commonly applied to predict species responses to environmental conditions. A wide variety of models with different properties exist that vary in complexity, which affects their predictive performance and interpretability. Machine learning algorithms are increasingly used because they are capable to capture complex relationships and are often better in prediction. However, to inform environmental management, it is important that a model predicts well for the right reasons. It remains a challenge to select a model with a reasonable level of complexity that captures the true relationship between the response and explanatory variables as good as possible rather than fitting to the noise in the data.

Suggested Citation

  • Chollet Ramampiandra, Emma & Scheidegger, Andreas & Wydler, Jonas & Schuwirth, Nele, 2023. "A comparison of machine learning and statistical species distribution models: Quantifying overfitting supports model interpretation," Ecological Modelling, Elsevier, vol. 481(C).
  • Handle: RePEc:eee:ecomod:v:481:y:2023:i:c:s0304380023000819
    DOI: 10.1016/j.ecolmodel.2023.110353
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304380023000819
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ecolmodel.2023.110353?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Vermeiren, Peter & Reichert, Peter & Schuwirth, Nele, 2020. "Integrating uncertain prior knowledge regarding ecological preferences into multi-species distribution models: Effects of model complexity on predictive performance," Ecological Modelling, Elsevier, vol. 420(C).
    2. Schuwirth, Nele & Borgwardt, Florian & Domisch, Sami & Friedrichs, Martin & Kattwinkel, Mira & Kneis, David & Kuemmerlen, Mathias & Langhans, Simone D. & Martínez-López, Javier & Vermeiren, Peter, 2019. "How to make ecological models useful for environmental management," Ecological Modelling, Elsevier, vol. 411(C).
    3. Austin, Mike, 2007. "Species distribution models and ecological theory: A critical assessment and some possible new approaches," Ecological Modelling, Elsevier, vol. 200(1), pages 1-19.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Caradima, Bogdan & Scheidegger, Andreas & Brodersen, Jakob & Schuwirth, Nele, 2021. "Bridging mechanistic conceptual models and statistical species distribution models of riverine fish," Ecological Modelling, Elsevier, vol. 457(C).
    2. Brias, Antoine & Munch, Stephan B., 2021. "Ecosystem based multi-species management using Empirical Dynamic Programming," Ecological Modelling, Elsevier, vol. 441(C).
    3. Václavík, Tomáš & Meentemeyer, Ross K., 2009. "Invasive species distribution modeling (iSDM): Are absence data and dispersal constraints needed to predict actual distributions?," Ecological Modelling, Elsevier, vol. 220(23), pages 3248-3258.
    4. Muñoz-Mas, Rafael & Vezza, Paolo & Alcaraz-Hernández, Juan Diego & Martínez-Capel, Francisco, 2016. "Risk of invasion predicted with support vector machines: A case study on northern pike (Esox Lucius, L.) and bleak (Alburnus alburnus, L.)," Ecological Modelling, Elsevier, vol. 342(C), pages 123-134.
    5. Mosai, Alseno K. & Tokwana, Bontle C. & Tutu, Hlanganani, 2022. "Computer simulation modelling of the simultaneous adsorption of Cd, Cu and Cr from aqueous solutions by agricultural clay soil: A PHREEQC geochemical modelling code coupled to parameter estimation (PE," Ecological Modelling, Elsevier, vol. 465(C).
    6. Meineri, Eric & Dahlberg, C. Johan & Hylander, Kristoffer, 2015. "Using Gaussian Bayesian Networks to disentangle direct and indirect associations between landscape physiography, environmental variables and species distribution," Ecological Modelling, Elsevier, vol. 313(C), pages 127-136.
    7. Marmion, Mathieu & Luoto, Miska & Heikkinen, Risto K. & Thuiller, Wilfried, 2009. "The performance of state-of-the-art modelling techniques depends on geographical distribution of species," Ecological Modelling, Elsevier, vol. 220(24), pages 3512-3520.
    8. Kaiping Wang & Weiqi Wang & Niyi Zha & Yue Feng & Chenlan Qiu & Yunlu Zhang & Jia Ma & Rui Zhang, 2022. "Spatially Heterogeneity Response of Critical Ecosystem Service Capacity to Address Regional Development Risks to Rapid Urbanization: The Case of Beijing-Tianjin-Hebei Urban Agglomeration in China," Sustainability, MDPI, vol. 14(12), pages 1-21, June.
    9. Aertsen, Wim & Kint, Vincent & van Orshoven, Jos & Özkan, Kürşad & Muys, Bart, 2010. "Comparison and ranking of different modelling techniques for prediction of site index in Mediterranean mountain forests," Ecological Modelling, Elsevier, vol. 221(8), pages 1119-1130.
    10. Di Pirro, E. & Sallustio, L. & Capotorti, G. & Marchetti, M. & Lasserre, B., 2021. "A scenario-based approach to tackle trade-offs between biodiversity conservation and land use pressure in Central Italy," Ecological Modelling, Elsevier, vol. 448(C).
    11. Rufino, Marta M. & Albouy, Camille & Brind'Amour, Anik, 2021. "Which spatial interpolators I should use? A case study applying to marine species," Ecological Modelling, Elsevier, vol. 449(C).
    12. Stoklosa, Jakub & Huang, Yih-Huei & Furlan, Elise & Hwang, Wen-Han, 2016. "On quadratic logistic regression models when predictor variables are subject to measurement error," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 109-121.
    13. Moreno-Amat, Elena & Mateo, Rubén G. & Nieto-Lugilde, Diego & Morueta-Holme, Naia & Svenning, Jens-Christian & García-Amorena, Ignacio, 2015. "Impact of model complexity on cross-temporal transferability in Maxent species distribution models: An assessment using paleobotanical data," Ecological Modelling, Elsevier, vol. 312(C), pages 308-317.
    14. Sahragard, H.P. & Chahouki, M.A. Zare, 2015. "An evaluation of predictive habitat models performance of plant species in Hoze soltan rangelands of Qom province," Ecological Modelling, Elsevier, vol. 309, pages 64-71.
    15. Halvorsen, Rune & Mazzoni, Sabrina & Dirksen, John Wirkola & Næsset, Erik & Gobakken, Terje & Ohlson, Mikael, 2016. "How important are choice of model selection method and spatial autocorrelation of presence data for distribution modelling by MaxEnt?," Ecological Modelling, Elsevier, vol. 328(C), pages 108-118.
    16. Zhou, Demin & Gong, Huili & Liu, Zhaoli, 2008. "Integrated ecological assessment of biophysical wetland habitat in water catchments: Linking hydro-ecological modelling with geo-information techniques," Ecological Modelling, Elsevier, vol. 214(2), pages 411-420.
    17. Haag, Fridolin & Chennu, Arjun, 2023. "Assessing whether decisions are more sensitive to preference or prediction uncertainty with a value of information approach," Omega, Elsevier, vol. 121(C).
    18. Flores, O. & Rossi, V. & Mortier, F., 2009. "Autocorrelation offsets zero-inflation in models of tropical saplings density," Ecological Modelling, Elsevier, vol. 220(15), pages 1797-1809.
    19. Pliscoff, Patricio & Luebert, Federico & Hilger, Hartmut H. & Guisan, Antoine, 2014. "Effects of alternative sets of climatic predictors on species distribution models and associated estimates of extinction risk: A test with plants in an arid environment," Ecological Modelling, Elsevier, vol. 288(C), pages 166-177.
    20. Yvonne Milker & Manuel F G Weinkauf & Jürgen Titschack & Andre Freiwald & Stefan Krüger & Frans J Jorissen & Gerhard Schmiedl, 2017. "Testing the applicability of a benthic foraminiferal-based transfer function for the reconstruction of paleowater depth changes in Rhodes (Greece) during the early Pleistocene," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-30, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecomod:v:481:y:2023:i:c:s0304380023000819. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/ecological-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.