IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i21p3356-d1507015.html
   My bibliography  Save this article

Model for Predicting Maize Crop Yield on Small Farms Using Clusterwise Linear Regression and GRASP

Author

Listed:
  • Germán-Homero Morán-Figueroa

    (Information Technology Research Group (GTI), Universidad del Cauca, Popayán 190001, Colombia)

  • Darwin-Fabián Muñoz-Pérez

    (Information Technology Research Group (GTI), Universidad del Cauca, Popayán 190001, Colombia)

  • José-Luis Rivera-Ibarra

    (Information Technology Research Group (GTI), Universidad del Cauca, Popayán 190001, Colombia)

  • Carlos-Alberto Cobos-Lozada

    (Information Technology Research Group (GTI), Universidad del Cauca, Popayán 190001, Colombia)

Abstract

Planting a crop involves several key steps: resource assessment, crop selection, crop rotation, planting schedules, soil preparation, planting, care, and harvesting of crops. In this context, estimating the productivity of a crop based on available information, such as expected climatic conditions and agricultural practices, helps farmers reduce the uncertainty of their investment. In Colombia, maize is the fourth most important crop in the country. Significant efforts are required to improve productivity in traditional and technified production systems. In this sense, this research proposes and evaluates an approach called Clusterwise Linear Regression (CLR) to predict the crop maize yield in small farms, considering data on climate, soil, fertilization, and management practices, among others. To develop the CLR model, we conducted the following steps: data collection and preparation, clustering using k-means, cluster optimization with Greedy Random Adaptive Search Procedure (GRASP), and performance evaluation. The cluster optimization process allows the identification of clusters with similar characteristics and generates multiple linear regression models with mixed variables that explain the yield of the farms on each cluster. The Simulated Multiple Start Annealing (MSSA) metaheuristics were also evaluated, but the results of GRASP were the best. The results indicate that the proposed CLR approach is more effective than the linear and nonlinear algorithms mentioned in the literature, such as multiple lasso linear regression, random forests, XGBoost, and support vector machines. These algorithms achieved an accuracy of 70%. However, with the new CLR model, a significantly improved accuracy of 87% was achieved with test data. The clusters’ studies revealed key factors affecting crop yield, such as fertilization, drainage, and soil type. This transparency is a benefit over black-box models, which can be harder to interpret. This advancement can allow farmers to make better decisions about the management of their crops.

Suggested Citation

  • Germán-Homero Morán-Figueroa & Darwin-Fabián Muñoz-Pérez & José-Luis Rivera-Ibarra & Carlos-Alberto Cobos-Lozada, 2024. "Model for Predicting Maize Crop Yield on Small Farms Using Clusterwise Linear Regression and GRASP," Mathematics, MDPI, vol. 12(21), pages 1-34, October.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:21:p:3356-:d:1507015
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/21/3356/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/21/3356/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:21:p:3356-:d:1507015. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.