IDEAS home Printed from https://ideas.repec.org/a/gam/jagris/v15y2025i7p731-d1623155.html
   My bibliography  Save this article

Determination of Optimal Dataset Characteristics for Improving YOLO Performance in Agricultural Object Detection

Author

Listed:
  • Jisu Song

    (Department of Bio-Industrial Machinery Engineering, Pusan National University, Miryang 50463, Republic of Korea)

  • Dongseok Kim

    (Department of Bio-Industrial Machinery Engineering, Pusan National University, Miryang 50463, Republic of Korea)

  • Eunji Jeong

    (Department of Bio-Industrial Machinery Engineering, Pusan National University, Miryang 50463, Republic of Korea)

  • Jaesung Park

    (Department of Bio-Industrial Machinery Engineering, Pusan National University, Miryang 50463, Republic of Korea)

Abstract

Recent advances in artificial intelligence and computer vision have led to significant progress in the use of agricultural technologies for yield prediction, pest detection, and real-time monitoring of plant conditions. However, collecting large-scale, high-quality image datasets in the agriculture sector remains challenging, particularly for specialized datasets such as plant disease images. This study analyzed the effects of the image size (320–640+) and the number of labels on the performance of a YOLO-based object detection model using diverse agricultural datasets for strawberries, tomatoes, chilies, and peppers. Model performance was evaluated using the intersection over union and average precision (AP), where the AP curve was smoothed using the Savitzky–Golay filter and EEM. The results revealed that increasing the number of labels improved the model performance to a certain degree, after which the performance gradually diminished. Furthermore, while increasing the image size from 320 to 640 substantially enhanced the model performance, additional increases beyond 640 yielded only marginal improvements. However, the training time and graphics processing unit usage scaled linearly with increasing image sizes, as larger size images require greater computational resources. These findings underscore the importance of an optimal strategy for selecting the image size and label quantity under resource constraints in real-world model development.

Suggested Citation

  • Jisu Song & Dongseok Kim & Eunji Jeong & Jaesung Park, 2025. "Determination of Optimal Dataset Characteristics for Improving YOLO Performance in Agricultural Object Detection," Agriculture, MDPI, vol. 15(7), pages 1-30, March.
  • Handle: RePEc:gam:jagris:v:15:y:2025:i:7:p:731-:d:1623155
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2077-0472/15/7/731/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2077-0472/15/7/731/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:15:y:2025:i:7:p:731-:d:1623155. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.