Author
Listed:
- Zhenfang Ma
(Macau Institute of System Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa 999078, Macau)
- Kaizhou Gao
(Macau Institute of System Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa 999078, Macau)
- Hui Yu
(Macau Institute of System Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa 999078, Macau)
- Naiqi Wu
(Macau Institute of System Engineering, Macau University of Science and Technology, Avenida Wai Long, Taipa 999078, Macau)
Abstract
This study focuses on the scheduling problem of heterogeneous unmanned surface vehicles (USVs) with obstacle avoidance pretreatment. The goal is to minimize the overall maximum completion time of USVs. First, we develop a mathematical model for the problem. Second, with obstacles, an A* algorithm is employed to generate a path between two points where tasks need to be performed. Third, three meta-heuristics, i.e., simulated annealing (SA), genetic algorithm (GA), and harmony search (HS), are employed and improved to solve the problems. Based on problem-specific knowledge, nine local search operators are designed to improve the performance of the proposed algorithms. In each iteration, three Q-learning strategies are used to select high-quality local search operators. We aim to improve the performance of meta-heuristics by using Q-learning-based local search operators. Finally, 13 instances with different scales are adopted to validate the effectiveness of the proposed strategies. We compare with the classical meta-heuristics and the existing meta-heuristics. The proposed meta-heuristics with Q-learning are overall better than the compared ones. The results and comparisons show that HS with the second Q-learning, HS + QL2, exhibits the strongest competitiveness (the smallest mean rank value 1.00) among 15 algorithms.
Suggested Citation
Zhenfang Ma & Kaizhou Gao & Hui Yu & Naiqi Wu, 2024.
"Solving Heterogeneous USV Scheduling Problems by Problem-Specific Knowledge Based Meta-Heuristics with Q-Learning,"
Mathematics, MDPI, vol. 12(2), pages 1-23, January.
Handle:
RePEc:gam:jmathe:v:12:y:2024:i:2:p:339-:d:1322843
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:2:p:339-:d:1322843. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.