Author
Listed:
- Zhu Rui
(School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu 611756, P. R. China)
- Hu Jun
(School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu 611756, P. R. China†Key Laboratory of Interior Layout Optimization and Security, Institutions of Higher Education of Sichuan Province, Chengdu Normal University, Chengdu 611130, P. R. China)
- Fan Ling
(��Key Laboratory of Interior Layout Optimization and Security, Institutions of Higher Education of Sichuan Province, Chengdu Normal University, Chengdu 611130, P. R. China)
- Zhang Qi
(��School of Intelligent Manufacturing, Panzhihua University, Panzhihua 617000, P. R. China)
- Wei Juan
(��Key Laboratory of Interior Layout Optimization and Security, Institutions of Higher Education of Sichuan Province, Chengdu Normal University, Chengdu 611130, P. R. China§Key Laboratory of Multidimensional Data Sensing and Intelligent Information Processing of Dazhou Key Laboratory, Dazhou 635000, P. R. China¶Key Laboratories of Sensing and Application of Intelligent Optoelectronic System in Sichuan Provincial Universities, Dazhou 635000, P. R. China)
Abstract
This paper proposed a pedestrian evacuation model combined with reinforcement learning in order to study how to better guide pedestrians to complete evacuation in specific indoor scenes. This model introduced the way of establishing a scene in cellular automata and formulated reward rules according to the characteristics of the scene. It fitted the psychological activities of pedestrians in the actual evacuation process and trained the strategy of pedestrians at the overall level through the Q-learning algorithm from the reinforcement learning area. A speed control mechanism combined with real statistical data was introduced to simulate the speed attenuation. A simulation platform was built to compare the evacuation conditions under different scenarios and the different total numbers of pedestrians. The research showed that the model could automatically realize the exit selection function of pedestrians and part of conformity behavior. In the same evacuation scenario, this model could show adaptability for the different total numbers of pedestrians.
Suggested Citation
Zhu Rui & Hu Jun & Fan Ling & Zhang Qi & Wei Juan, 2025.
"Study on pedestrian evacuation model based on reinforcement learning,"
International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 36(07), pages 1-18, July.
Handle:
RePEc:wsi:ijmpcx:v:36:y:2025:i:07:n:s0129183124502474
DOI: 10.1142/S0129183124502474
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijmpcx:v:36:y:2025:i:07:n:s0129183124502474. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijmpc/ijmpc.shtml .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.