IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v647y2024ics0378437124002437.html
   My bibliography  Save this article

Cooperative traffic optimization with multi-agent reinforcement learning and evolutionary strategy: Bridging the gap between micro and macro traffic control

Author

Listed:
  • Feng, Jianshuai
  • Lin, Kaize
  • Shi, Tianyu
  • Wu, Yuankai
  • Wang, Yong
  • Zhang, Hailong
  • Tan, Huachun

Abstract

The emergence of connected and autonomous vehicles (CAVs) holds promise for fine-grained traffic control. However, due to the longevity of future mixed traffic scenarios, there is a need for an in-depth exploration of integrating the microscopic speed control of CAVs with the macroscopic variable speed limit (VSL) of human-driven vehicles (HDVs). This paper proposes a Cooperative Traffic Optimization with Multi-agent Reinforcement Learning and Evolutionary VSL (CTO-ME) framework, which combines microscopic CAV control with macroscopic VSL control. The framework incorporates a Graph Attention Mechanism (GATs) into the multi-agent reinforcement learning framework for intelligent decision-making by microscopic-level vehicles. Additionally, an evolutionary strategy is developed to design the VSL network architecture, enabling macroscopic level real-time speed limit adjustments based on infrastructure. A multi-objective reward function is proposed to optimize both micro and macro efficiency and safety, accounting for both vehicle behavior and traffic flow. Experiments on the designed Bottleneck traffic scenarios show that the proposed approach, CTO-ME, is able to achieve superior performance and outperforms other baselines in terms of traffic throughput, average speed, and safety. Specifically, CTO-ME enhances average velocity by 37%, increases overall throughput by 309%, and raises arrival ratio by 70% than traditional Intelligent Driver Model (IDM).

Suggested Citation

  • Feng, Jianshuai & Lin, Kaize & Shi, Tianyu & Wu, Yuankai & Wang, Yong & Zhang, Hailong & Tan, Huachun, 2024. "Cooperative traffic optimization with multi-agent reinforcement learning and evolutionary strategy: Bridging the gap between micro and macro traffic control," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 647(C).
  • Handle: RePEc:eee:phsmap:v:647:y:2024:i:c:s0378437124002437
    DOI: 10.1016/j.physa.2024.129734
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437124002437
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2024.129734?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tanzina Afrin & Nita Yodo, 2020. "A Survey of Road Traffic Congestion Measures towards a Sustainable and Resilient Transportation System," Sustainability, MDPI, vol. 12(11), pages 1-23, June.
    2. Ding, Heng & Zhang, Lang & Chen, Jin & Zheng, Xiaoyan & Pan, Hao & Zhang, Weihua, 2023. "MPC-based dynamic speed control of CAVs in multiple sections upstream of the bottleneck area within a mixed vehicular environment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 613(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Veronika Harantová & Ambróz Hájnik & Alica Kalašová & Tomasz Figlus, 2022. "The Effect of the COVID-19 Pandemic on Traffic Flow Characteristics, Emissions Production and Fuel Consumption at a Selected Intersection in Slovakia," Energies, MDPI, vol. 15(6), pages 1-21, March.
    2. Raffaele Mauro & Andrea Pompigna, 2022. "A Statistically Based Model for the Characterization of Vehicle Interactions and Vehicle Platoons Formation on Two-Lane Roads," Sustainability, MDPI, vol. 14(8), pages 1-22, April.
    3. Di, Yunran & Zhang, Weihua & Ding, Heng & Zheng, Xiaoyan & Ran, Bin, 2024. "Cooperative control of dynamic CAV dedicated lanes and vehicle active lane changing in expressway bottleneck areas," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 638(C).
    4. Surya Michrandi Nasution & Emir Husni & Kuspriyanto Kuspriyanto & Rahadian Yusuf & Bernardo Nugroho Yahya, 2021. "Contextual Route Recommendation System in Heterogeneous Traffic Flow," Sustainability, MDPI, vol. 13(23), pages 1-21, November.
    5. Megan M Bruwer & Simen J Andersen, 2023. "Exploiting COVID-19 related traffic changes to evaluate flow dependency of an FCD-defined congestion measure," Environment and Planning B, , vol. 50(8), pages 2220-2237, October.
    6. Wael Etaiwi & Sahar Idwan, 2025. "Traffic management systems: a survey of current solutions and emerging technologies," Journal of Computational Social Science, Springer, vol. 8(1), pages 1-24, February.
    7. Mariusz Kmiecik, 2022. "Logistics Coordination Based on Inventory Management and Transportation Planning by Third-Party Logistics (3PL)," Sustainability, MDPI, vol. 14(13), pages 1-19, July.
    8. He, Ziliang & Wang, Ling & Su, Zicheng & Ma, Wanjing, 2024. "Integrating variable speed limit and ramp metering to enhance vehicle group safety and efficiency in a mixed traffic environment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 641(C).
    9. Joise, Topu & Goenka, Narsimha & Wangyel, Sangay & Shaturaev, Jakhongir, 2023. "Transforming Mobility Exploring the Impact and Challenges of Intelligent Transportation Systems in Asia," MPRA Paper 118994, University Library of Munich, Germany, revised 11 Sep 2023.
    10. José D. Padrón & David Soler & Carlos T. Calafate & Juan-Carlos Cano & Pietro Manzoni, 2022. "Improving Air Quality in Urban Recreational Areas through Smart Traffic Management," Sustainability, MDPI, vol. 14(6), pages 1-18, March.
    11. Toan, Trinh Dinh & Wong, Y.D., 2021. "Fuzzy logic-based methodology for quantification of traffic congestion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 570(C).
    12. Sai Chand & Emily Moylan & S. Travis Waller & Vinayak Dixit, 2020. "Analysis of Vehicle Breakdown Frequency: A Case Study of New South Wales, Australia," Sustainability, MDPI, vol. 12(19), pages 1-14, October.
    13. Adel Mottahedi & Farhang Sereshki & Mohammad Ataei & Ali Nouri Qarahasanlou & Abbas Barabadi, 2021. "The Resilience of Critical Infrastructure Systems: A Systematic Literature Review," Energies, MDPI, vol. 14(6), pages 1-32, March.
    14. Le Zhang & Lijing Lyu & Shanshui Zheng & Li Ding & Lang Xu, 2022. "A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game," Sustainability, MDPI, vol. 14(19), pages 1-14, September.
    15. Leo Tišljarić & Tonči Carić & Borna Abramović & Tomislav Fratrović, 2020. "Traffic State Estimation and Classification on Citywide Scale Using Speed Transition Matrices," Sustainability, MDPI, vol. 12(18), pages 1-16, September.
    16. N. P. Hariram & K. B. Mekha & Vipinraj Suganthan & K. Sudhakar, 2023. "Sustainalism: An Integrated Socio-Economic-Environmental Model to Address Sustainable Development and Sustainability," Sustainability, MDPI, vol. 15(13), pages 1-37, July.
    17. Hollbeck, Gabor B. & Pilarczyk, René & Wang, Shanshan & Schreckenberg, Michael & Guhr, Thomas, 2024. "Congestions and spectral transitions in time-lagged correlations of motorway traffic," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 649(C).
    18. Labiba Noshin Asha & Arup Dey & Nita Yodo & Lucy G. Aragon, 2022. "Optimization Approaches for Multiple Conflicting Objectives in Sustainable Green Supply Chain Management," Sustainability, MDPI, vol. 14(19), pages 1-24, October.
    19. Ghada Alturif & Wafaa Saleh, 2023. "Travel Demand Management in an Auto Dominated City: Can Travel Behaviour Be Nudged in the Kingdom of Saudi Arabia?," Sustainability, MDPI, vol. 15(11), pages 1-19, June.
    20. Suleiman Hassan Otuoze & Dexter V. L. Hunt & Ian Jefferson, 2021. "Neural Network Approach to Modelling Transport System Resilience for Major Cities: Case Studies of Lagos and Kano (Nigeria)," Sustainability, MDPI, vol. 13(3), pages 1-20, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:647:y:2024:i:c:s0378437124002437. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.