IDEAS home Printed from https://ideas.repec.org/a/gam/jagris/v15y2025i6p588-d1609353.html
   My bibliography  Save this article

Dynamic Task Planning for Multi-Arm Apple-Harvesting Robots Using LSTM-PPO Reinforcement Learning Algorithm

Author

Listed:
  • Zhengwei Guo

    (School of Mechanical Engineering, Guangxi University, Nanning 530004, China
    Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
    These authors contributed equally to this work.)

  • Heng Fu

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
    These authors contributed equally to this work.)

  • Jiahao Wu

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China)

  • Wenkai Han

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China)

  • Wenlei Huang

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China)

  • Wengang Zheng

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China)

  • Tao Li

    (Intelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China)

Abstract

This paper presents a dynamic task planning approach for multi-arm apple-picking robots based on a deep reinforcement learning (DRL) framework incorporating Long Short-Term Memory (LSTM) networks and Proximal Policy Optimization (PPO). In the context of rising labor costs and labor shortages in agriculture, automated apple harvesting is becoming increasingly important. The proposed algorithm addresses key challenges such as efficient task coordination, optimal picking sequences, and real-time decision-making in complex, dynamic orchard environments. The system’s performance is validated through simulations in both static and dynamic environments, with the algorithm demonstrating significant improvements in task completion time and robot efficiency compared to existing strategies. The results show that the LSTM-PPO approach outperforms other methods, offering enhanced adaptability, fault tolerance, and task execution efficiency, particularly under changing and unpredictable conditions. This research lays the foundation for the development of more efficient, adaptable robotic systems in agricultural applications.

Suggested Citation

  • Zhengwei Guo & Heng Fu & Jiahao Wu & Wenkai Han & Wenlei Huang & Wengang Zheng & Tao Li, 2025. "Dynamic Task Planning for Multi-Arm Apple-Harvesting Robots Using LSTM-PPO Reinforcement Learning Algorithm," Agriculture, MDPI, vol. 15(6), pages 1-20, March.
  • Handle: RePEc:gam:jagris:v:15:y:2025:i:6:p:588-:d:1609353
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2077-0472/15/6/588/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2077-0472/15/6/588/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jagris:v:15:y:2025:i:6:p:588-:d:1609353. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.