IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v17y2025i1p256-d1558785.html
   My bibliography  Save this article

Machine Learning Approaches for Predicting Maize Biomass Yield: Leveraging Feature Engineering and Comprehensive Data Integration

Author

Listed:
  • Maryam Abbasi

    (Applied Research Institute, Polytechnic of Coimbra, 3000 Coimbra, Portugal)

  • Paulo Váz

    (Research Center in Digital Services, Polytechnic of Viseu, 3500 Viseu, Portugal)

  • José Silva

    (Research Center in Digital Services, Polytechnic of Viseu, 3500 Viseu, Portugal)

  • Pedro Martins

    (Research Center in Digital Services, Polytechnic of Viseu, 3500 Viseu, Portugal)

Abstract

The efficient prediction of corn biomass yield is critical for optimizing crop production and addressing global challenges in sustainable agriculture and renewable energy. This study employs advanced machine learning techniques, including Gradient Boosting Machines (GBMs), Random Forests (RFs), Support Vector Machines (SVMs), and Artificial Neural Networks (ANNs), integrated with comprehensive environmental, soil, and crop management data from key agricultural regions in the United States. A novel framework combines feature engineering, such as the creation of a Soil Fertility Index (SFI) and Growing Degree Days (GDDs), and the incorporation of interaction terms to address complex non-linear relationships between input variables and biomass yield. We conduct extensive sensitivity analysis and employ SHAP (SHapley Additive exPlanations) values to enhance model interpretability, identifying SFI, GDDs, and cumulative rainfall as the most influential features driving yield outcomes. Our findings highlight significant synergies among these variables, emphasizing their critical role in rural environmental governance and precision agriculture. Furthermore, an ensemble approach combining GBMs, RFs, and ANNs outperformed individual models, achieving an RMSE of 0.80 t/ha and R 2 of 0.89. These results underscore the potential of hybrid modeling for real-world applications in sustainable farming practices. Addressing the concerns of passive farmer participation, we propose targeted incentives, education, and institutional support mechanisms to enhance stakeholder collaboration in rural environmental governance. While the models assume rational decision-making, the inclusion of cultural and political factors warrants further investigation to improve the robustness of the framework. Additionally, a map of the study region and improved visualizations of feature importance enhance the clarity and relevance of our findings. This research contributes to the growing body of knowledge on predictive modeling in agriculture, combining theoretical rigor with practical insights to support policymakers and stakeholders in optimizing resource use and addressing environmental challenges. By improving the interpretability and applicability of machine learning models, this study provides actionable strategies for enhancing crop yield predictions and advancing rural environmental governance.

Suggested Citation

  • Maryam Abbasi & Paulo Váz & José Silva & Pedro Martins, 2025. "Machine Learning Approaches for Predicting Maize Biomass Yield: Leveraging Feature Engineering and Comprehensive Data Integration," Sustainability, MDPI, vol. 17(1), pages 1-15, January.
  • Handle: RePEc:gam:jsusta:v:17:y:2025:i:1:p:256-:d:1558785
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/17/1/256/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/17/1/256/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Juan Botero-Valencia & Vanessa García-Pineda & Alejandro Valencia-Arias & Jackeline Valencia & Erick Reyes-Vera & Mateo Mejia-Herrera & Ruber Hernández-García, 2025. "Machine Learning in Sustainable Agriculture: Systematic Review and Research Perspectives," Agriculture, MDPI, vol. 15(4), pages 1-37, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:1:p:256-:d:1558785. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.