IDEAS home Printed from https://ideas.repec.org/a/eee/ecomod/v411y2019ics0304380019303230.html
   My bibliography  Save this article

Importance of spatial predictor variable selection in machine learning applications – Moving from data reproduction to spatial prediction

Author

Listed:
  • Meyer, Hanna
  • Reudenbach, Christoph
  • Wöllauer, Stephan
  • Nauss, Thomas

Abstract

Machine learning algorithms find frequent application in spatial prediction of biotic and abiotic environmental variables. However, the characteristics of spatial data, especially spatial autocorrelation, are widely ignored. We hypothesize that this is problematic and results in models that can reproduce training data but are unable to make spatial predictions beyond the locations of the training samples. We assume that not only spatial validation strategies but also spatial variable selection is essential for reliable spatial predictions.

Suggested Citation

  • Meyer, Hanna & Reudenbach, Christoph & Wöllauer, Stephan & Nauss, Thomas, 2019. "Importance of spatial predictor variable selection in machine learning applications – Moving from data reproduction to spatial prediction," Ecological Modelling, Elsevier, vol. 411(C).
  • Handle: RePEc:eee:ecomod:v:411:y:2019:i:c:s0304380019303230
    DOI: 10.1016/j.ecolmodel.2019.108815
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304380019303230
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ecolmodel.2019.108815?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Antoine Stevens & Marco Nocita & Gergely Tóth & Luca Montanarella & Bas van Wesemael, 2013. "Prediction of Soil Organic Carbon at the European Scale by Visible and Near InfraRed Reflectance Spectroscopy," PLOS ONE, Public Library of Science, vol. 8(6), pages 1-13, June.
    2. Schratz, Patrick & Muenchow, Jannes & Iturritxa, Eugenia & Richter, Jakob & Brenning, Alexander, 2019. "Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data," Ecological Modelling, Elsevier, vol. 406(C), pages 109-120.
    3. Eric S Walsh & Betty J Kreakie & Mark G Cantwell & Diane Nacci, 2017. "A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-18, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Katarzyna Kopczewska, 2022. "Spatial machine learning: new opportunities for regional science," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 68(3), pages 713-755, June.
    2. Jiang, Xiaoman & Wang, Yuntao & A., Yinglan & Wang, Guoqiang & Zhang, Xiaojing & Ma, Guangwen & Duan, Limin & Liu, Kai, 2024. "Optimizing actual evapotranspiration simulation to identify evapotranspiration partitioning variations: A fusion of physical processes and machine learning techniques," Agricultural Water Management, Elsevier, vol. 295(C).
    3. Wadoux, Alexandre M.J.-C. & Heuvelink, Gerard B.M. & de Bruin, Sytze & Brus, Dick J., 2021. "Spatial cross-validation is not the right way to evaluate map accuracy," Ecological Modelling, Elsevier, vol. 457(C).
    4. Gustavo Larrea‐Gallegos & Ian Vázquez‐Rowe, 2022. "Exploring machine learning techniques to predict deforestation to enhance the decision‐making of road construction projects," Journal of Industrial Ecology, Yale University, vol. 26(1), pages 225-239, February.
    5. Metz-Peeters, Maike, 2023. "The Effects of Mandatory Speed Limits on Crash Frequency - A Causal Machine Learning Approach," Ruhr Economic Papers 982, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen, revised 2023.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Juergen Deppner & Marcelo Cajias, 2024. "Accounting for Spatial Autocorrelation in Algorithm-Driven Hedonic Models: A Spatial Cross-Validation Approach," The Journal of Real Estate Finance and Economics, Springer, vol. 68(2), pages 235-273, February.
    2. Giacomo Crucil & Fabio Castaldi & Emilien Aldana-Jague & Bas van Wesemael & Andy Macdonald & Kristof Van Oost, 2019. "Assessing the Performance of UAS-Compatible Multispectral and Hyperspectral Sensors for Soil Organic Carbon Prediction," Sustainability, MDPI, vol. 11(7), pages 1-18, March.
    3. Jasiewicz Jarosław & Cierniewski Jerzy, 2021. "SALBEC – A Python Library and GUI Application to Calculate the Diurnal Variation of the Soil Albedo," Quaestiones Geographicae, Sciendo, vol. 40(3), pages 95-107, September.
    4. Haigang Liu & David B. Hitchcock & S. Zahra Samadi, 2020. "Spatio-temporal analysis of flood data from South Carolina," Journal of Statistical Distributions and Applications, Springer, vol. 7(1), pages 1-19, December.
    5. Tao Liu & Huan Zhang & Tiezhu Shi, 2020. "Modeling and Predictive Mapping of Soil Organic Carbon Density in a Small-Scale Area Using Geographically Weighted Regression Kriging Approach," Sustainability, MDPI, vol. 12(22), pages 1-12, November.
    6. Katarzyna Kopczewska, 2022. "Spatial machine learning: new opportunities for regional science," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 68(3), pages 713-755, June.
    7. Morakot Worachairungreung & Sarawut Ninsawat & Apichon Witayangkurn & Matthew N. Dailey, 2021. "Identification of Road Traffic Injury Risk Prone Area Using Environmental Factors by Machine Learning Classification in Nonthaburi, Thailand," Sustainability, MDPI, vol. 13(7), pages 1-25, April.
    8. Baoyang Liu & Baofeng Guo & Renxiong Zhuo & Fan Dai & Haoyu Chi, 2023. "Prediction of the soil organic carbon in the LUCAS soil database based on spectral clustering," Soil and Water Research, Czech Academy of Agricultural Sciences, vol. 18(1), pages 43-54.
    9. Hobimiarantsoa Rakotonindrina & Kensuke Kawamura & Yasuhiro Tsujimoto & Tomohiro Nishigaki & Herintsitohaina Razakamanarivo & Bruce Haja Andrianary & Andry Andriamananjara, 2020. "Prediction of Soil Oxalate Phosphorus using Visible and Near-Infrared Spectroscopy in Natural and Cultivated System Soils of Madagascar," Agriculture, MDPI, vol. 10(5), pages 1-16, May.
    10. Theodora Angelopoulou & Athanasios Balafoutis & George Zalidis & Dionysis Bochtis, 2020. "From Laboratory to Proximal Sensing Spectroscopy for Soil Organic Carbon Estimation—A Review," Sustainability, MDPI, vol. 12(2), pages 1-24, January.
    11. Richter, Franziska & Jan, Pierrick & El Benni, Nadja & Lüscher, Andreas & Buchmann, Nina & Klaus, Valentin H., 2021. "A guide to assess and value ecosystem services of grasslands," Ecosystem Services, Elsevier, vol. 52(C).
    12. Zhu Liang & Wei Liu & Weiping Peng & Lingwei Chen & Changming Wang, 2022. "Improved Shallow Landslide Susceptibility Prediction Based on Statistics and Ensemble Learning," Sustainability, MDPI, vol. 14(10), pages 1-21, May.
    13. Kuntal M. Hati & Nishant K. Sinha & Monoranjan Mohanty & Pramod Jha & Sunil Londhe & Andrew Sila & Erick Towett & Ranjeet S. Chaudhary & Somasundaram Jayaraman & Mounisamy Vassanda Coumar & Jyoti K. T, 2022. "Mid-Infrared Reflectance Spectroscopy for Estimation of Soil Properties of Alfisols from Eastern India," Sustainability, MDPI, vol. 14(9), pages 1-17, April.
    14. Konstantinos Karyotis & Theodora Angelopoulou & Nikolaos Tziolas & Evgenia Palaiologou & Nikiforos Samarinas & George Zalidis, 2021. "Evaluation of a Micro-Electro Mechanical Systems Spectral Sensor for Soil Properties Estimation," Land, MDPI, vol. 10(1), pages 1-16, January.
    15. Vo Thanh, Hung & Zamanyad, Aiyoub & Safaei-Farouji, Majid & Ashraf, Umar & Hemeng, Zhang, 2022. "Application of hybrid artificial intelligent models to predict deliverability of underground natural gas storage sites," Renewable Energy, Elsevier, vol. 200(C), pages 169-184.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecomod:v:411:y:2019:i:c:s0304380019303230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/ecological-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.