IDEAS home Printed from https://ideas.repec.org/a/eee/agisys/v187y2021ics0308521x20308775.html
   My bibliography  Save this article

Machine learning for large-scale crop yield forecasting

Author

Listed:
  • Paudel, Dilli
  • Boogaard, Hendrik
  • de Wit, Allard
  • Janssen, Sander
  • Osinga, Sjoukje
  • Pylianidis, Christos
  • Athanasiadis, Ioannis N.

Abstract

Many studies have applied machine learning to crop yield prediction with a focus on specific case studies. The data and methods they used may not be transferable to other crops and locations. On the other hand, operational large-scale systems, such as the European Commission's MARS Crop Yield Forecasting System (MCYFS), do not use machine learning. Machine learning is a promising method especially when large amounts of data are being collected and published. We combined agronomic principles of crop modeling with machine learning to build a machine learning baseline for large-scale crop yield forecasting. The baseline is a workflow emphasizing correctness, modularity and reusability. For correctness, we focused on designing explainable predictors or features (in relation to crop growth and development) and applying machine learning without information leakage. We created features using crop simulation outputs and weather, remote sensing and soil data from the MCYFS database. We emphasized a modular and reusable workflow to support different crops and countries with small configuration changes. The workflow can be used to run repeatable experiments (e.g. early season or end of season predictions) using standard input data to obtain reproducible results. The results serve as a starting point for further optimizations. In our case studies, we predicted yield at regional level for five crops (soft wheat, spring barley, sunflower, sugar beet, potatoes) and three countries (the Netherlands (NL), Germany (DE), France (FR)). We compared the performance with a simple method with no prediction skill, which either predicted a linear yield trend or the average of the training set. We also aggregated the predictions to the national level and compared with past MCYFS forecasts. The normalized RMSE (NRMSE) for early season predictions (30 days after planting) were comparable for NL (all crops), DE (all except soft wheat) and FR (soft wheat, spring barley, sunflower). For example, NRMSE was 7.87 for soft wheat (NL) (6.32 for MCYFS) and 8.21 for sugar beet (DE) (8.79 for MCYFS). In contrast, NRMSEs for soft wheat (DE), sugar beet (FR) and potatoes (FR) were twice as much compared to MCYFS. NRMSEs for end of season were still comparable to MCYFS for NL, but worse for DE and FR. The baseline can be improved by adding new data sources, designing more predictive features and evaluating different machine learning algorithms. The baseline will motivate the use of machine learning in large-scale crop yield forecasting.

Suggested Citation

  • Paudel, Dilli & Boogaard, Hendrik & de Wit, Allard & Janssen, Sander & Osinga, Sjoukje & Pylianidis, Christos & Athanasiadis, Ioannis N., 2021. "Machine learning for large-scale crop yield forecasting," Agricultural Systems, Elsevier, vol. 187(C).
  • Handle: RePEc:eee:agisys:v:187:y:2021:i:c:s0308521x20308775
    DOI: 10.1016/j.agsy.2020.103016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0308521X20308775
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.agsy.2020.103016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lecerf, Rémi & Ceglar, Andrej & López-Lozano, Raúl & Van Der Velde, Marijn & Baruth, Bettina, 2019. "Assessing the information in crop model and meteorological indicators to forecast crop yield over Europe," Agricultural Systems, Elsevier, vol. 168(C), pages 191-202.
    2. Jig Han Jeong & Jonathan P Resop & Nathaniel D Mueller & David H Fleisher & Kyungdahm Yun & Ethan E Butler & Dennis J Timlin & Kyo-Moon Shim & James S Gerber & Vangimalla R Reddy & Soo-Hyung Kim, 2016. "Random Forests for Global and Regional Crop Yield Predictions," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-15, June.
    3. de Wit, Allard & Boogaard, Hendrik & Fumagalli, Davide & Janssen, Sander & Knapen, Rob & van Kraalingen, Daniel & Supit, Iwan & van der Wijngaart, Raymond & van Diepen, Kees, 2019. "25 years of the WOFOST cropping systems model," Agricultural Systems, Elsevier, vol. 168(C), pages 154-167.
    4. Bussay, Attila & van der Velde, Marijn & Fumagalli, Davide & Seguini, Lorenzo, 2015. "Improving operational maize yield forecasting in Hungary," Agricultural Systems, Elsevier, vol. 141(C), pages 94-106.
    5. van der Velde, M. & Nisini, L., 2019. "Performance of the MARS-crop yield forecasting system for the European Union: Assessing accuracy, in-season, and year-to-year improvements from 1993 to 2015," Agricultural Systems, Elsevier, vol. 168(C), pages 203-212.
    6. López-Lozano, Raúl & Baruth, Bettina, 2019. "An evaluation framework to build a cost-efficient crop monitoring system. Experiences from the extension of the European crop monitoring system," Agricultural Systems, Elsevier, vol. 168(C), pages 231-246.
    7. Willcock, Simon & Martínez-López, Javier & Hooftman, Danny A.P. & Bagstad, Kenneth J. & Balbi, Stefano & Marzo, Alessia & Prato, Carlo & Sciandrello, Saverio & Signorello, Giovanni & Voigt, Brian & Vi, 2018. "Machine learning for ecosystem services," Ecosystem Services, Elsevier, vol. 33(PB), pages 165-174.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Timsina, Jagadish & Dutta, Sudarshan & Devkota, Krishna Prasad & Chakraborty, Somsubhra & Neupane, Ram Krishna & Bishta, Sudarshan & Amgain, Lal Prasad & Singh, Vinod K. & Islam, Saiful & Majumdar, Ka, 2021. "Improved nutrient management in cereals using Nutrient Expert and machine learning tools: Productivity, profitability and nutrient use efficiency," Agricultural Systems, Elsevier, vol. 192(C).
    2. Gaona, Jaime & Benito-Verdugo, Pilar & Martínez-Fernández, José & González-Zamora, Ángel & Almendra-Martín, Laura & Herrero-Jiménez, Carlos Miguel, 2023. "Predictive value of soil moisture and concurrent variables in the multivariate modelling of cereal yields in water-limited environments," Agricultural Water Management, Elsevier, vol. 282(C).
    3. Oyenike Mary Olanrewaju & Eli Adama Jiya & Faith Oluwatosin Echobu, 2024. "Intelligent Maize Yield Prediction Model Based on Plant Attributes and Machine Learning Algorithms," International Journal of Research and Scientific Innovation, International Journal of Research and Scientific Innovation (IJRSI), vol. 11(7), pages 1097-1104, July.
    4. Schmidt, Lorenz & Odening, Martin & Schlanstein, Johann & Ritter, Matthias, 2021. "Estimation of the Farm-Level Yield-Weather-Relation Using Machine Learning," 61st Annual Conference, Berlin, Germany, September 22-24, 2021 317075, German Association of Agricultural Economists (GEWISOLA).
    5. Sebastian C. Ibañez & Christopher P. Monterola, 2023. "A Global Forecasting Approach to Large-Scale Crop Production Prediction with Time Series Transformers," Agriculture, MDPI, vol. 13(9), pages 1-27, September.
    6. Kalpana Jain & Naveen Choudhary, 2022. "Comparative analysis of machine learning techniques for predicting production capability of crop yield," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(1), pages 583-593, March.
    7. Kouame, Anselme K.K. & Bindraban, Prem S. & Kissiedu, Isaac N. & Atakora, Williams K. & El Mejahed, Khalil, 2023. "Identifying drivers for variability in maize (Zea mays L.) yield in Ghana: A meta-regression approach," Agricultural Systems, Elsevier, vol. 209(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van der Velde, M. & Nisini, L., 2019. "Performance of the MARS-crop yield forecasting system for the European Union: Assessing accuracy, in-season, and year-to-year improvements from 1993 to 2015," Agricultural Systems, Elsevier, vol. 168(C), pages 203-212.
    2. Tiecheng Bai & Nannan Zhang & Youqi Chen & Benoit Mercatoris, 2019. "Assessing the Performance of the WOFOST Model in Simulating Jujube Fruit Tree Growth under Different Irrigation Regimes," Sustainability, MDPI, vol. 11(5), pages 1-16, March.
    3. Agudelo, César Augusto Ruiz & Bustos, Sandra Liliana Hurtado & Moreno, Carmen Alicia Parrado, 2020. "Modeling interactions among multiple ecosystem services. A critical review," Ecological Modelling, Elsevier, vol. 429(C).
    4. Gniewko Niedbała, 2019. "Application of Artificial Neural Networks for Multi-Criteria Yield Prediction of Winter Rapeseed," Sustainability, MDPI, vol. 11(2), pages 1-13, January.
    5. Indy Man Kit Ho & Anthony Weldon & Jason Tze Ho Yong & Candy Tze Tim Lam & Jaime Sampaio, 2023. "Using Machine Learning Algorithms to Pool Data from Meta-Analysis for the Prediction of Countermovement Jump Improvement," IJERPH, MDPI, vol. 20(10), pages 1-15, May.
    6. Shen Tan & Shengchao Qiao & Han Wang & Sheng Chang, 2024. "Predicting Wheat Potential Yield in China Based on Eco-Evolutionary Optimality Principles," Agriculture, MDPI, vol. 14(11), pages 1-15, November.
    7. Signorello, Giovanni & Prato, Carlo & Marzo, Alessia & Ientile, Renzo & Cucuzza, Giuseppe & Sciandrello, Saverio & Martínez-López, Javier & Balbi, Stefano & Villa, Ferdinando, 2018. "Are protected areas covering important biodiversity sites? An assessment of the nature protection network in Sicily (Italy)," Land Use Policy, Elsevier, vol. 78(C), pages 593-602.
    8. Richards, Daniel Rex & Lavorel, Sandra, 2022. "Integrating social media data and machine learning to analyse scenarios of landscape appreciation," Ecosystem Services, Elsevier, vol. 55(C).
    9. Florian Schierhorn & Max Hofmann & Taras Gagalyuk & Igor Ostapchuk & Daniel Müller, 2021. "Machine learning reveals complex effects of climatic means and weather extremes on wheat yields during different plant developmental stages," Climatic Change, Springer, vol. 169(3), pages 1-19, December.
    10. Yan, Ling & Jin, Jiming & Wu, Pute, 2020. "Impact of parameter uncertainty and water stress parameterization on wheat growth simulations using CERES-Wheat with GLUE," Agricultural Systems, Elsevier, vol. 181(C).
    11. Puyu Feng & Bin Wang & De Li Liu & Hongtao Xing & Fei Ji & Ian Macadam & Hongyan Ruan & Qiang Yu, 2018. "Impacts of rainfall extremes on wheat yield in semi-arid cropping systems in eastern Australia," Climatic Change, Springer, vol. 147(3), pages 555-569, April.
    12. Shi, Yinfang & Wang, Zhaoyang & Hou, Cheng & Zhang, Puhan, 2022. "Yield estimation of Lycium barbarum L. based on the WOFOST model," Ecological Modelling, Elsevier, vol. 473(C).
    13. Feng, Dingrui & Li, Guangyong & Wang, Dan & Wulazibieke, Mierguli & Cai, Mingkun & Kang, Jing & Yuan, Zicheng & Xu, Houcheng, 2022. "Evaluation of AquaCrop model performance under mulched drip irrigation for maize in Northeast China," Agricultural Water Management, Elsevier, vol. 261(C).
    14. Li Fan & Shibo Fang & Jinlong Fan & Yan Wang & Linqing Zhan & Yongkun He, 2024. "Rice Yield Estimation Using Machine Learning and Feature Selection in Hilly and Mountainous Chongqing, China," Agriculture, MDPI, vol. 14(9), pages 1-18, September.
    15. Martin Kuradusenge & Eric Hitimana & Damien Hanyurwimfura & Placide Rukundo & Kambombo Mtonga & Angelique Mukasine & Claudette Uwitonze & Jackson Ngabonziza & Angelique Uwamahoro, 2023. "Crop Yield Prediction Using Machine Learning Models: Case of Irish Potato and Maize," Agriculture, MDPI, vol. 13(1), pages 1-19, January.
    16. Manley, Kyle & Nyelele, Charity & Egoh, Benis N., 2022. "A review of machine learning and big data applications in addressing ecosystem service research gaps," Ecosystem Services, Elsevier, vol. 57(C).
    17. Xu Zhang & Guangsheng Chen & Lingxiao Cai & Hongbo Jiao & Jianwen Hua & Xifang Luo & Xinliang Wei, 2021. "Impact Assessments of Typhoon Lekima on Forest Damages in Subtropical China Using Machine Learning Methods and Landsat 8 OLI Imagery," Sustainability, MDPI, vol. 13(9), pages 1-21, April.
    18. Xinchen Gu & Aihua Long & Guihua Liu & Jiawen Yu & Hao Wang & Yongmin Yang & Pei Zhang, 2021. "Changes in Ecosystem Service Value in the 1 km Lakeshore Zone of Poyang Lake from 1980 to 2020," Land, MDPI, vol. 10(9), pages 1-19, September.
    19. Bagstad, Kenneth J. & Ingram, Jane Carter & Shapiro, Carl D. & La Notte, Alessandra & Maes, Joachim & Vallecillo, Sara & Casey, C. Frank & Glynn, Pierre D. & Heris, Mehdi P. & Johnson, Justin A. & Lau, 2021. "Lessons learned from development of natural capital accounts in the United States and European Union," Ecosystem Services, Elsevier, vol. 52(C).
    20. Schmidt, Lorenz & Odening, Martin & Schlanstein, Johann & Ritter, Matthias, 2021. "Estimation of the Farm-Level Yield-Weather-Relation Using Machine Learning," 61st Annual Conference, Berlin, Germany, September 22-24, 2021 317075, German Association of Agricultural Economists (GEWISOLA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:agisys:v:187:y:2021:i:c:s0308521x20308775. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/agsy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.