A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

My bibliography Save this article

A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

Author

Listed:

Li, Kunpeng
Liu, Tengbo
Ram Kumar, P.N.
Han, Xuefang

Registered:

Abstract

Globally, e-commerce warehouses have begun implementing robotic mobile fulfillment systems (RMFS), which can improve order-picking efficiency by using automated guided vehicles (AGVs) to realize operations from parts to pickers. AGVs depart from their initial points, move to a target rack position, and subsequently transport racks to picking stations. The AGVs return the racks to their original positions after the workers pick them up. When all tasks are completed, the AGVs return to their starting point. In this context, the main challenge is the task assignment and route planning of multiple AGVs to minimize travel times. We formulate a mixed-integer linear programming (MILP) model with valid inequalities to solve small problem instances optimally. We introduce a reinforcement learning (RL)-based hyper-heuristic (HH) framework to solve large instances to near-optimality. A typical HH framework comprises two levels: high-level heuristics (HLH) and low-level heuristics (LLH). The framework starts from an initial solution and improves iteratively through LLHs, while the HLH invokes a selection strategy and an acceptance criterion to generate a new solution. We propose a novel selection strategy based on the improved Multi-Armed Bandits algorithm called Co-SLMAB and Exponential Monte Carlo with counters (EMCQ) as the acceptance criterion. The corresponding collision avoidance rules are then formulated for different conflicts to construct a conflict-free traveling route for AGVs. Besides testing the proposed framework’s effectiveness in real-life warehouse layouts, we perform extensive computational experiments and a thorough sensitivity analysis. The results show that (i) the proposed valid inequalities aid in obtaining better lower bounds and significantly speed up the solution process; (ii) the Co-SLMAB-HH framework is quite competitive compared to CPLEX, outperforming the other tested hyper-heuristics and the problem-specific heuristic regarding convergence and computation time; and (iii) a pool of LLHs consisting of a wide range of different operators is advantageous over a limited set of simple operators while solving problems using hyper-heuristics.

Suggested Citation

Li, Kunpeng & Liu, Tengbo & Ram Kumar, P.N. & Han, Xuefang, 2024. "A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 185(C).

Handle: RePEc:eee:transe:v:185:y:2024:i:c:s1366554524001091
DOI: 10.1016/j.tre.2024.103518

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Hongtao Hu & Xurui Yang & Shichang Xiao & Feiyang Wang, 2023. "Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning," International Journal of Production Research, Taylor & Francis Journals, vol. 61(1), pages 65-80, January.
Xing, Zheng & Liu, Haitao & Wang, Tingsong & Chew, Ek Peng & Lee, Loo Hay & Tan, Kok Choon, 2023. "Integrated automated guided vehicle dispatching and equipment scheduling with speed optimization," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 169(C).
Gharehgozli, Amir & Zaerpour, Nima, 2020. "Robot scheduling for pod retrieval in a robotic mobile fulfillment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 142(C).
Ana Esteso & David Peidro & Josefa Mula & Manuel Díaz-Madroñero, 2023. "Reinforcement learning applied to production planning and control," International Journal of Production Research, Taylor & Francis Journals, vol. 61(16), pages 5772-5789, August.
Hengle Qin & Jun Xiao & Dongdong Ge & Linwei Xin & Jianjun Gao & Simai He & Haodong Hu & John Gunnar Carlsson, 2022. "JD.com: Operations Research Algorithms Drive Intelligent Warehouse Robots to Work," Interfaces, INFORMS, vol. 52(1), pages 42-55, January.
Binghai Zhou & Zhaoxu He, 2023. "A novel hybrid-load AGV for JIT-based sustainable material handling scheduling with time window in mixed-model assembly line," International Journal of Production Research, Taylor & Francis Journals, vol. 61(3), pages 796-817, February.
Burger, M. & Su, Z. & De Schutter, B., 2018. "A node current-based 2-index formulation for the fixed-destination multi-depot travelling salesman problem," European Journal of Operational Research, Elsevier, vol. 265(2), pages 463-477.
Boccia, Maurizio & Masone, Adriano & Sterle, Claudio & Murino, Teresa, 2023. "The parallel AGV scheduling problem with battery constraints: A new formulation and a matheuristic approach," European Journal of Operational Research, Elsevier, vol. 307(2), pages 590-603.
Li, Xiaowei & Hua, Guowei & Huang, Anqiang & Sheu, Jiuh-Biing & Cheng, T.C.E. & Huang, Fengquan, 2020. "Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
Peng Yang & Guang Jin & Guofang Duan, 2022. "Modelling and analysis for multi-deep compact robotic mobile fulfilment system," International Journal of Production Research, Taylor & Francis Journals, vol. 60(15), pages 4727-4742, August.
Kara, Imdat & Bektas, Tolga, 2006. "Integer linear programming formulations of multiple salesman problems and its variations," European Journal of Operational Research, Elsevier, vol. 174(3), pages 1449-1458, November.
Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2022. "Rack retrieval and repositioning optimization problem in robotic mobile fulfillment systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).
Jiang, Min & Leung, K.H. & Lyu, Zhongyuan & Huang, George Q., 2020. "Picking-replenishment synchronization for robotic forward-reserve warehouses," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
Benjamin Rolf & Ilya Jackson & Marcel Müller & Sebastian Lang & Tobias Reggelin & Dmitry Ivanov, 2023. "A review on reinforcement learning algorithms and applications in supply chain management," International Journal of Production Research, Taylor & Francis Journals, vol. 61(20), pages 7151-7179, October.
Edmund Burke & Graham Kendall & Mustafa Mısır & Ender Özcan, 2012. "Monte Carlo hyper-heuristics for examination timetabling," Annals of Operations Research, Springer, vol. 196(1), pages 73-90, July.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Olusola O. Ajayi & Anish M. Kurien & Karim Djouani & Lamine Dieng, 2024. "4IR Applications in the Transport Industry: Systematic Review of the State of the Art with Respect to Data Collection and Processing Mechanisms," Sustainability, MDPI, vol. 16(17), pages 1-32, August.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2024. "Improving order picking efficiency through storage assignment optimization in robotic mobile fulfillment systems," European Journal of Operational Research, Elsevier, vol. 316(2), pages 718-732.
Kumar, Suryakant & Sheu, Jiuh-Biing & Kundu, Tanmoy, 2023. "Planning a parts-to-picker order picking system with consideration of the impact of perceived workload," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 173(C).
Zhuang, Yanling & Zhou, Yun & Hassini, Elkafi & Yuan, Yufei & Hu, Xiangpei, 2022. "Rack retrieval and repositioning optimization problem in robotic mobile fulfillment systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).
Ding, Tianrong & Zhang, Yuankai & Wang, Zheng & Hu, Xiangpei, 2024. "Velocity-based rack storage location assignment for the unidirectional robotic mobile fulfillment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 186(C).
Jiuh‐Biing Sheu & Tsan‐Ming Choi, 2023. "Can we work more safely and healthily with robot partners? A human‐friendly robot–human‐coordinated order fulfillment scheme," Production and Operations Management, Production and Operations Management Society, vol. 32(3), pages 794-812, March.
Justkowiak, Jan-Erik & Pesch, Erwin, 2023. "A column generation driven heuristic for order-scheduling and rack-sequencing in robotic mobile fulfillment systems," Omega, Elsevier, vol. 120(C).
Chen, Wanying & Wu, Peng & Gong, Yeming & Zhang, Zhengmin & Wang, Kun, 2025. "The role of energy consumption in robotic mobile fulfillment systems: Performance evaluation and operating policies with dynamic priority," Omega, Elsevier, vol. 130(C).
Chen, Ran & Yang, Jingjing & Yu, Yugang & Guo, Xiaolong, 2023. "Retrieval request scheduling in a shuttle-based storage and retrieval system with two lifts," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 174(C).
Hu, Yue & Yang, Hongbing & Huang, Yi, 2022. "Conflict-free scheduling of large-scale multi-load AGVs in material transportation network," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 158(C).
Vincenzo Varriale & Antonello Cammarano & Francesca Michelino & Mauro Caputo, 2025. "Critical analysis of the impact of artificial intelligence integration with cutting-edge technologies for production systems," Journal of Intelligent Manufacturing, Springer, vol. 36(1), pages 61-93, January.
Chen, Wanying & Gong, Yeming & Chen, Qi & Wang, Hongwei, 2024. "Does battery management matter? Performance evaluation and operating policies in a self-climbing robotic warehouse," European Journal of Operational Research, Elsevier, vol. 312(1), pages 164-181.
Jiang, Min & Huang, George Q., 2022. "Intralogistics synchronization in robotic forward-reserve warehouses for e-commerce last-mile delivery," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 158(C).
Bektaş, Tolga, 2012. "Formulations and Benders decomposition algorithms for multidepot salesmen problems with load balancing," European Journal of Operational Research, Elsevier, vol. 216(1), pages 83-93.
Mohammed Al-Betar & Ahamad Khader & Iyad Doush, 2014. "Memetic techniques for examination timetabling," Annals of Operations Research, Springer, vol. 218(1), pages 23-50, July.
Li, Xiaowei & Hua, Guowei & Huang, Anqiang & Sheu, Jiuh-Biing & Cheng, T.C.E. & Huang, Fengquan, 2020. "Storage assignment policy with awareness of energy consumption in the Kiva mobile fulfilment system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 144(C).
Lu Zhen & Jingwen Wu & Haolin Li & Zheyi Tan & Yingying Yuan, 2023. "Scheduling multiple types of equipment in an automated warehouse," Annals of Operations Research, Springer, vol. 322(2), pages 1119-1141, March.
Sun, Yige & Chung, Sai-Ho & Wen, Xin & Ma, Hoi-Lam, 2021. "Novel robotic job-shop scheduling models with deadlock and robot movement considerations," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 149(C).
Cao, Jianing & Han, Yuhang & Pan, Nan & Zhang, Jingcheng & Yang, Junwei, 2024. "A data-driven approach to urban charging facility expansion based on bi-level optimization: A case study in a Chinese city," Energy, Elsevier, vol. 300(C).
Enrique Benavent & Antonio Martínez, 2013. "Multi-depot Multiple TSP: a polyhedral study and computational results," Annals of Operations Research, Springer, vol. 207(1), pages 7-25, August.
Lam, H.Y. & Ho, G.T.S. & Mo, Daniel Y. & Tang, Valerie, 2023. "Responsive pick face replenishment strategy for stock allocation to fulfil e-commerce order," International Journal of Production Economics, Elsevier, vol. 264(C).

More about this item

Keywords

Parts-to-picker picking system; Automated Guided Vehicles; Task scheduling; Reinforcement learning; Hyper-heuristic;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transe:v:185:y:2024:i:c:s1366554524001091. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/600244/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data