IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2022i1p12-d1016347.html
   My bibliography  Save this article

Pedestrian Simulation with Reinforcement Learning: A Curriculum-Based Approach

Author

Listed:
  • Giuseppe Vizzari

    (Department of Informatics, Systems and Communication, University of Milano-Bicocca, Viale Sarca 336/14, 20126 Milano, Italy
    These authors contributed equally to this work.)

  • Thomas Cecconello

    (Department of Informatics, Systems and Communication, University of Milano-Bicocca, Viale Sarca 336/14, 20126 Milano, Italy
    These authors contributed equally to this work.)

Abstract

Pedestrian simulation is a consolidated but still lively area of research. State of the art models mostly take an agent-based perspective, in which pedestrian decisions are made according to a manually defined model. Reinforcement learning (RL), on the other hand, is used to train an agent situated in an environment how to act so as to maximize an accumulated numerical reward signal (a feedback provided by the environment to every chosen action). We explored the possibility of applying RL to pedestrian simulation. We carefully defined a reward function combining elements related to goal orientation, basic proxemics, and basic way-finding considerations. The proposed approach employs a particular training curriculum , a set of scenarios growing in difficulty supporting an incremental acquisition of general movement competences such as orientation, walking, and pedestrian interaction. The learned pedestrian behavioral model is applicable to situations not presented to the agents in the training phase, and seems therefore reasonably general. This paper describes the basic elements of the approach, the training procedure, and an experimentation within a software framework employing Unity and ML-Agents.

Suggested Citation

  • Giuseppe Vizzari & Thomas Cecconello, 2022. "Pedestrian Simulation with Reinforcement Learning: A Curriculum-Based Approach," Future Internet, MDPI, vol. 15(1), pages 1-25, December.
  • Handle: RePEc:gam:jftint:v:15:y:2022:i:1:p:12-:d:1016347
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/1/12/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/1/12/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stefania Bandini & Sara Manzoni & Giuseppe Vizzari, 2009. "Agent Based Modeling and Simulation: An Informatics Perspective," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 12(4), pages 1-4.
    2. Zhang, J. & Seyfried, A., 2014. "Comparison of intersecting pedestrian flows based on experiments," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 405(C), pages 316-325.
    3. Steffen, B. & Seyfried, A., 2010. "Methods for measuring pedestrian density, flow, speed and direction with minimal scatter," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(9), pages 1902-1910.
    4. Zhao, Xuedan & Xia, Long & Zhang, Jun & Song, Weiguo, 2020. "Artificial neural network based modeling on unidirectional and bidirectional pedestrian flow at straight corridors," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 547(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Korbmacher, Raphael & Dang, Huu-Tu & Tordeux, Antoine, 2024. "Predicting pedestrian trajectories at different densities: A multi-criteria empirical analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 634(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Flötteröd, Gunnar & Lämmel, Gregor, 2015. "Bidirectional pedestrian fundamental diagram," Transportation Research Part B: Methodological, Elsevier, vol. 71(C), pages 194-212.
    2. Seitz, Michael J. & Dietrich, Felix & Köster, Gerta, 2015. "The effect of stepping on pedestrian trajectories," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 421(C), pages 594-604.
    3. Habtamu Tkubet Ebuy & Hind Bril El Haouzi & Riad Benelmir & Remi Pannequin, 2023. "Occupant Behavior Impact on Building Sustainability Performance: A Literature Review," Sustainability, MDPI, vol. 15(3), pages 1-23, January.
    4. Cao, Shuchao & Lian, Liping & Chen, Mingyi & Yao, Ming & Song, Weiguo & Fang, Zhiming, 2018. "Investigation of difference of fundamental diagrams in pedestrian flow," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 506(C), pages 661-670.
    5. Tao, Y.Z. & Dong, L.Y., 2017. "A Cellular Automaton model for pedestrian counterflow with swapping," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 475(C), pages 155-168.
    6. Hu, Yanghui & Zhang, Jun & Song, Weiguo, 2019. "Experimental study on the movement strategies of individuals in multidirectional flows," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 534(C).
    7. Li, Tao & Shi, Dongdong & Chen, Juan & Li, Huiwen & Ma, Jian, 2022. "Experimental study of movement characteristics for different walking postures in a narrow channel," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 608(P2).
    8. von Krüchten, Cornelia & Schadschneider, Andreas, 2017. "Empirical study on social groups in pedestrian evacuation dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 475(C), pages 129-141.
    9. Nagahama, Akihito & Wada, Takahiro & Yanagisawa, Daichi & Nishinari, Katsuhiro, 2021. "Detection of leader–follower combinations frequently observed in mixed traffic with weak lane-discipline," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 570(C).
    10. Saberi, Meead & Aghabayk, Kayvan & Sobhani, Amir, 2015. "Spatial fluctuations of pedestrian velocities in bidirectional streams: Exploring the effects of self-organization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 434(C), pages 120-128.
    11. Fu, Libi & Zhang, Ying & Qin, Huigui & Shi, Qingxin & Chen, Qiyi & Chen, Yunqian & Shi, Yongqian, 2023. "A modified social force model for studying nonlinear dynamics of pedestrian-e-bike mixed flow at a signalized crosswalk," Chaos, Solitons & Fractals, Elsevier, vol. 174(C).
    12. Wang, Jiayue & Boltes, Maik & Seyfried, Armin & Zhang, Jun & Ziemer, Verena & Weng, Wenguo, 2018. "Linking pedestrian flow characteristics with stepping locomotion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 500(C), pages 106-120.
    13. Fu, Zhijian & Li, Tao & Deng, Qiangqiang & Schadschneider, Andreas & Luo, Lin & Ma, Jian, 2021. "Effect of turning curvature on the single-file dynamics of pedestrian flow: An experimental study," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 563(C).
    14. Tang, Tie-Qiao & Zhang, Bo-Tao & Zhang, Jian & Wang, Tao, 2019. "Statistical analysis and modeling of pedestrian flow in university canteen during peak period," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 29-40.
    15. Qingyan Ning & Maosheng Li, 2022. "Modeling Pedestrian Detour Behavior By-Passing Conflict Areas," Sustainability, MDPI, vol. 14(24), pages 1-17, December.
    16. Crick, Florence & Jenkins, Katie & Surminski, Swenja, 2018. "Strengthening insurance partnerships in the face of climate change: insights from an agent-based model of flood insurance in the UK," LSE Research Online Documents on Economics 87669, London School of Economics and Political Science, LSE Library.
    17. Rangel-Galván, Maricruz & Ballinas-Hernández, Ana L. & Rangel-Galván, Violeta, 2024. "Thermo-inspired model of self-propelled hard disk agents for heterogeneous bidirectional pedestrian flow," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 635(C).
    18. Shang, Pan & Li, Ruimin & Guo, Jifu & Xian, Kai & Zhou, Xuesong, 2019. "Integrating Lagrangian and Eulerian observations for passenger flow state estimation in an urban rail transit network: A space-time-state hyper network-based assignment approach," Transportation Research Part B: Methodological, Elsevier, vol. 121(C), pages 135-167.
    19. Farhadi, Saber & Nikoo, Mohammad Reza & Rakhshandehroo, Gholam Reza & Akhbari, Masih & Alizadeh, Mohammad Reza, 2016. "An agent-based-nash modeling framework for sustainable groundwater management: A case study," Agricultural Water Management, Elsevier, vol. 177(C), pages 348-358.
    20. Cui, Geng & Yanagisawa, Daichi & Nishinari, Katsuhiro, 2023. "Learning from experimental data to simulate pedestrian dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 623(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2022:i:1:p:12-:d:1016347. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.