IDEAS home Printed from https://ideas.repec.org/a/eee/transe/v185y2024ics1366554524001091.html
   My bibliography  Save this article

A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

Author

Listed:
  • Li, Kunpeng
  • Liu, Tengbo
  • Ram Kumar, P.N.
  • Han, Xuefang

Abstract

Globally, e-commerce warehouses have begun implementing robotic mobile fulfillment systems (RMFS), which can improve order-picking efficiency by using automated guided vehicles (AGVs) to realize operations from parts to pickers. AGVs depart from their initial points, move to a target rack position, and subsequently transport racks to picking stations. The AGVs return the racks to their original positions after the workers pick them up. When all tasks are completed, the AGVs return to their starting point. In this context, the main challenge is the task assignment and route planning of multiple AGVs to minimize travel times. We formulate a mixed-integer linear programming (MILP) model with valid inequalities to solve small problem instances optimally. We introduce a reinforcement learning (RL)-based hyper-heuristic (HH) framework to solve large instances to near-optimality. A typical HH framework comprises two levels: high-level heuristics (HLH) and low-level heuristics (LLH). The framework starts from an initial solution and improves iteratively through LLHs, while the HLH invokes a selection strategy and an acceptance criterion to generate a new solution. We propose a novel selection strategy based on the improved Multi-Armed Bandits algorithm called Co-SLMAB and Exponential Monte Carlo with counters (EMCQ) as the acceptance criterion. The corresponding collision avoidance rules are then formulated for different conflicts to construct a conflict-free traveling route for AGVs. Besides testing the proposed framework’s effectiveness in real-life warehouse layouts, we perform extensive computational experiments and a thorough sensitivity analysis. The results show that (i) the proposed valid inequalities aid in obtaining better lower bounds and significantly speed up the solution process; (ii) the Co-SLMAB-HH framework is quite competitive compared to CPLEX, outperforming the other tested hyper-heuristics and the problem-specific heuristic regarding convergence and computation time; and (iii) a pool of LLHs consisting of a wide range of different operators is advantageous over a limited set of simple operators while solving problems using hyper-heuristics.

Suggested Citation

  • Li, Kunpeng & Liu, Tengbo & Ram Kumar, P.N. & Han, Xuefang, 2024. "A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 185(C).
  • Handle: RePEc:eee:transe:v:185:y:2024:i:c:s1366554524001091
    DOI: 10.1016/j.tre.2024.103518
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1366554524001091
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tre.2024.103518?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hongtao Hu & Xurui Yang & Shichang Xiao & Feiyang Wang, 2023. "Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning," International Journal of Production Research, Taylor & Francis Journals, vol. 61(1), pages 65-80, January.
    2. Hengle Qin & Jun Xiao & Dongdong Ge & Linwei Xin & Jianjun Gao & Simai He & Haodong Hu & John Gunnar Carlsson, 2022. "JD.com: Operations Research Algorithms Drive Intelligent Warehouse Robots to Work," Interfaces, INFORMS, vol. 52(1), pages 42-55, January.
    3. Li, Xiaowei & Hua, Guowei & Huang, Anqiang & Sheu, Jiuh-Biing & Cheng, T.C.E. & Huang, Fengquan, 2020. "Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
    4. Xing, Zheng & Liu, Haitao & Wang, Tingsong & Chew, Ek Peng & Lee, Loo Hay & Tan, Kok Choon, 2023. "Integrated automated guided vehicle dispatching and equipment scheduling with speed optimization," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 169(C).
    5. Binghai Zhou & Zhaoxu He, 2023. "A novel hybrid-load AGV for JIT-based sustainable material handling scheduling with time window in mixed-model assembly line," International Journal of Production Research, Taylor & Francis Journals, vol. 61(3), pages 796-817, February.
    6. Peng Yang & Guang Jin & Guofang Duan, 2022. "Modelling and analysis for multi-deep compact robotic mobile fulfilment system," International Journal of Production Research, Taylor & Francis Journals, vol. 60(15), pages 4727-4742, August.
    7. Jiang, Min & Leung, K.H. & Lyu, Zhongyuan & Huang, George Q., 2020. "Picking-replenishment synchronization for robotic forward-reserve warehouses," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
    8. Benjamin Rolf & Ilya Jackson & Marcel Müller & Sebastian Lang & Tobias Reggelin & Dmitry Ivanov, 2023. "A review on reinforcement learning algorithms and applications in supply chain management," International Journal of Production Research, Taylor & Francis Journals, vol. 61(20), pages 7151-7179, October.
    9. Kara, Imdat & Bektas, Tolga, 2006. "Integer linear programming formulations of multiple salesman problems and its variations," European Journal of Operational Research, Elsevier, vol. 174(3), pages 1449-1458, November.
    10. Gharehgozli, Amir & Zaerpour, Nima, 2020. "Robot scheduling for pod retrieval in a robotic mobile fulfillment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 142(C).
    11. Edmund Burke & Graham Kendall & Mustafa Mısır & Ender Özcan, 2012. "Monte Carlo hyper-heuristics for examination timetabling," Annals of Operations Research, Springer, vol. 196(1), pages 73-90, July.
    12. Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2022. "Rack retrieval and repositioning optimization problem in robotic mobile fulfillment systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).
    13. Burger, M. & Su, Z. & De Schutter, B., 2018. "A node current-based 2-index formulation for the fixed-destination multi-depot travelling salesman problem," European Journal of Operational Research, Elsevier, vol. 265(2), pages 463-477.
    14. Ana Esteso & David Peidro & Josefa Mula & Manuel Díaz-Madroñero, 2023. "Reinforcement learning applied to production planning and control," International Journal of Production Research, Taylor & Francis Journals, vol. 61(16), pages 5772-5789, August.
    15. Boccia, Maurizio & Masone, Adriano & Sterle, Claudio & Murino, Teresa, 2023. "The parallel AGV scheduling problem with battery constraints: A new formulation and a matheuristic approach," European Journal of Operational Research, Elsevier, vol. 307(2), pages 590-603.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Olusola O. Ajayi & Anish M. Kurien & Karim Djouani & Lamine Dieng, 2024. "4IR Applications in the Transport Industry: Systematic Review of the State of the Art with Respect to Data Collection and Processing Mechanisms," Sustainability, MDPI, vol. 16(17), pages 1-32, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2024. "Improving order picking efficiency through storage assignment optimization in robotic mobile fulfillment systems," European Journal of Operational Research, Elsevier, vol. 316(2), pages 718-732.
    2. Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2022. "Rack retrieval and repositioning optimization problem in robotic mobile fulfillment systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).
    3. Ding, Tianrong & Zhang, Yuankai & Wang, Zheng & Hu, Xiangpei, 2024. "Velocity-based rack storage location assignment for the unidirectional robotic mobile fulfillment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 186(C).
    4. Kumar, Suryakant & Sheu, Jiuh-Biing & Kundu, Tanmoy, 2023. "Planning a parts-to-picker order picking system with consideration of the impact of perceived workload," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 173(C).
    5. Jiuh‐Biing Sheu & Tsan‐Ming Choi, 2023. "Can we work more safely and healthily with robot partners? A human‐friendly robot–human‐coordinated order fulfillment scheme," Production and Operations Management, Production and Operations Management Society, vol. 32(3), pages 794-812, March.
    6. Chen, Ran & Yang, Jingjing & Yu, Yugang & Guo, Xiaolong, 2023. "Retrieval request scheduling in a shuttle-based storage and retrieval system with two lifts," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 174(C).
    7. Hu, Yue & Yang, Hongbing & Huang, Yi, 2022. "Conflict-free scheduling of large-scale multi-load AGVs in material transportation network," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 158(C).
    8. Justkowiak, Jan-Erik & Pesch, Erwin, 2023. "A column generation driven heuristic for order-scheduling and rack-sequencing in robotic mobile fulfillment systems," Omega, Elsevier, vol. 120(C).
    9. Liu, Weihua & George Shanthikumar, J. & Tae-Woo Lee, Paul & Li, Xiang & Zhou, Li, 2021. "Special issue editorial: Smart supply chains and intelligent logistics services," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 147(C).
    10. Chen, Wanying & Gong, Yeming & Chen, Qi & Wang, Hongwei, 2024. "Does battery management matter? Performance evaluation and operating policies in a self-climbing robotic warehouse," European Journal of Operational Research, Elsevier, vol. 312(1), pages 164-181.
    11. Jiang, Min & Huang, George Q., 2022. "Intralogistics synchronization in robotic forward-reserve warehouses for e-commerce last-mile delivery," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 158(C).
    12. Bektaş, Tolga, 2012. "Formulations and Benders decomposition algorithms for multidepot salesmen problems with load balancing," European Journal of Operational Research, Elsevier, vol. 216(1), pages 83-93.
    13. Haluk Yapicioglu, 2018. "Multiperiod Multi Traveling Salesmen Problem Considering Time Window Constraints with an Application to a Real World Case," Networks and Spatial Economics, Springer, vol. 18(4), pages 773-801, December.
    14. Muren, & Wu, Jianjun & Zhou, Li & Du, Zhiping & Lv, Ying, 2019. "Mixed steepest descent algorithm for the traveling salesman problem and application in air logistics," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 126(C), pages 87-102.
    15. Mohammed Al-Betar & Ahamad Khader & Iyad Doush, 2014. "Memetic techniques for examination timetabling," Annals of Operations Research, Springer, vol. 218(1), pages 23-50, July.
    16. João Reis, 2023. "Exploring Applications and Practical Examples by Streamlining Material Requirements Planning (MRP) with Python," Logistics, MDPI, vol. 7(4), pages 1-19, December.
    17. He, Zhiliang & Thürer, Matthias & Zhou, Wanling, 2024. "The use of reinforcement learning for material flow control: An assessment by simulation," International Journal of Production Economics, Elsevier, vol. 274(C).
    18. Alejandro Cataldo & Juan-Carlos Ferrer & Jaime Miranda & Pablo A. Rey & Antoine Sauré, 2017. "An integer programming approach to curriculum-based examination timetabling," Annals of Operations Research, Springer, vol. 258(2), pages 369-393, November.
    19. Tamás Kalmár-Nagy & Giovanni Giardini & Bendegúz Dezső Bak, 2017. "The Multiagent Planning Problem," Complexity, Hindawi, vol. 2017, pages 1-12, February.
    20. Li, Xiaowei & Hua, Guowei & Huang, Anqiang & Sheu, Jiuh-Biing & Cheng, T.C.E. & Huang, Fengquan, 2020. "Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transe:v:185:y:2024:i:c:s1366554524001091. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/600244/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.