Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints

My bibliography Save this article

Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints

Author

Listed:

Jianfeng Huang
(College of Engineering, Shantou University, Shantou 515063, China)
Guoqiang Lu
(College of Engineering, Shantou University, Shantou 515063, China)
Yi Li
(College of Engineering, Shantou University, Shantou 515063, China)
Jiajun Wu
(College of Engineering, Shantou University, Shantou 515063, China)

Registered:

Abstract

This paper proposes a method and an algorithm called Q-sorting for reinforcement learning (RL) problems with multiple cumulative constraints. The primary contribution is a mechanism for dynamically determining the focus of optimization among multiple cumulative constraints and the objective. Executed actions are picked through a procedure with two steps: first filter out actions potentially breaking the constraints, and second sort the remaining ones according to the Q values of the focus in descending order. The algorithm was originally developed upon the classic tabular value representation and episodic setting of RL, but the idea can be extended and applied to other methods with function approximation and discounted setting. Numerical experiments are carried out on the adapted Gridworld and the motor speed synchronization problem, both with one and two cumulative constraints. Simulation results validate the effectiveness of the proposed Q-sorting in that cumulative constraints are honored both during and after the learning process. The advantages of Q-sorting are further emphasized through comparison with the method of lumped performances (LP), which takes constraints into account through weighting parameters. Q-sorting outperforms LP in both ease of use (unnecessity of trial and error to determine values of the weighting parameters) and performance consistency (6.1920 vs. 54.2635 rad/s for the standard deviation of the cumulative performance index over 10 repeated simulation runs). It has great potential for practical engineering use.

Suggested Citation

Jianfeng Huang & Guoqiang Lu & Yi Li & Jiajun Wu, 2024. "Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints," Mathematics, MDPI, vol. 12(13), pages 1-20, June.

Handle: RePEc:gam:jmathe:v:12:y:2024:i:13:p:2001-:d:1424494

Download full text from publisher

More about this item

Keywords

reinforcement learning; cumulative constraint; constrained Markov decision process (CMDP);
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:13:p:2001-:d:1424494. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints

Author

Abstract

Suggested Citation

Download full text from publisher

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data