IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v189y2024ics0191261524001383.html
   My bibliography  Save this article

Providing real-time en-route suggestions to CAVs for congestion mitigation: A two-way deep reinforcement learning approach

Author

Listed:
  • Ma, Xiaoyu
  • He, Xiaozheng

Abstract

This research investigates the effectiveness of information provision for congestion reduction in Connected Autonomous Vehicle (CAV) systems. The inherent advantages of CAVs, such as vehicle-to-everything communication, advanced vehicle autonomy, and reduced human involvement, make them conducive to achieving Correlated Equilibrium (CE). Leveraging these advantages, this research proposes a reinforcement learning framework involving CAVs and an information provider, where CAVs conduct real-time learning to minimize their individual travel time, while the information provider offers real-time route suggestions aiming to minimize the system’s total travel time. The en-route routing problem of the CAVs is formulated as a Markov game and the information provision problem is formulated as a single-agent Markov decision process. Then, this research develops a customized two-way deep reinforcement learning approach to solve the interrelated problems, accounting for their unique characteristics. Moreover, CE has been formulated within the proposed framework. Theoretical analysis rigorously proves the realization of CE and that the proposed framework can effectively mitigate congestion without compromising individual user optimality. Numerical results demonstrate the effectiveness of this approach. Our research contributes to the advancement of congestion reduction strategies in CAV systems with the mitigation of the conflict between system-level and individual-level goals using CE as a theoretical foundation. The results highlight the potential of information provision in fostering coordination and correlation among CAVs, thereby enhancing traffic efficiency and achieving system-level goals in smart transportation.

Suggested Citation

  • Ma, Xiaoyu & He, Xiaozheng, 2024. "Providing real-time en-route suggestions to CAVs for congestion mitigation: A two-way deep reinforcement learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 189(C).
  • Handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001383
    DOI: 10.1016/j.trb.2024.103014
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191261524001383
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2024.103014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Aumann, Robert J, 1987. "Correlated Equilibrium as an Expression of Bayesian Rationality," Econometrica, Econometric Society, vol. 55(1), pages 1-18, January.
    2. Liu, Yixuan & Whinston, Andrew B., 2019. "Efficient real-time routing for autonomous vehicles through Bayes correlated equilibrium: An information design framework," Information Economics and Policy, Elsevier, vol. 47(C), pages 14-26.
    3. Wang, Chaojie & Peeta, Srinivas & Wang, Jian, 2021. "Incentive-based decentralized routing for connected and autonomous vehicles using information propagation," Transportation Research Part B: Methodological, Elsevier, vol. 149(C), pages 138-161.
    4. Du, Lili & Han, Lanshan & Chen, Shuwei, 2015. "Coordinated online in-vehicle routing balancing user optimality and system optimality through information perturbation," Transportation Research Part B: Methodological, Elsevier, vol. 79(C), pages 121-133.
    5. Liang Wang & Lei Zhao & Xiaojian Hu & Xinyong Zhao & Huan Wang, 2023. "A Reliability-Based Traffic Equilibrium Model with Boundedly Rational Travelers Considering Acceptable Arrival Thresholds," Sustainability, MDPI, vol. 15(8), pages 1-19, April.
    6. Zhou, Bo & Song, Qiankun & Zhao, Zhenjiang & Liu, Tangzhi, 2020. "A reinforcement learning scheme for the equilibrium of the in-vehicle route choice problem based on congestion game," Applied Mathematics and Computation, Elsevier, vol. 371(C).
    7. Ning, Yuqiang & Du, Lili, 2023. "Robust and resilient equilibrium routing mechanism for traffic congestion mitigation built upon correlated equilibrium and distributed optimization," Transportation Research Part B: Methodological, Elsevier, vol. 168(C), pages 170-205.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ning, Yuqiang & Du, Lili, 2023. "Robust and resilient equilibrium routing mechanism for traffic congestion mitigation built upon correlated equilibrium and distributed optimization," Transportation Research Part B: Methodological, Elsevier, vol. 168(C), pages 170-205.
    2. Le Zhang & Lijing Lyu & Shanshui Zheng & Li Ding & Lang Xu, 2022. "A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game," Sustainability, MDPI, vol. 14(19), pages 1-14, September.
    3. John Geanakoplos, 1993. "Common Knowledge," Cowles Foundation Discussion Papers 1062, Cowles Foundation for Research in Economics, Yale University.
    4. Samet, Dov, 1990. "Ignoring ignorance and agreeing to disagree," Journal of Economic Theory, Elsevier, vol. 52(1), pages 190-207, October.
    5. Radzvilas, Mantas, 2016. "Hypothetical Bargaining and the Equilibrium Selection Problem in Non-Cooperative Games," MPRA Paper 70248, University Library of Munich, Germany.
    6. Konstantinos Georgalos & Indrajit Ray & Sonali SenGupta, 2020. "Nash versus coarse correlation," Experimental Economics, Springer;Economic Science Association, vol. 23(4), pages 1178-1204, December.
    7. Antonio Cabrales & Michalis Drouvelis & Zeynep Gurguy & Indrajit Ray, 2017. "Transparency is Overrated: Communicating in a Coordination Game with Private Information," CESifo Working Paper Series 6781, CESifo.
    8. Qin, Cheng-Zhong & Yang, Chun-Lei, 2009. "An Explicit Approach to Modeling Finite-Order Type Spaces and Applications," University of California at Santa Barbara, Economics Working Paper Series qt8hq7j89k, Department of Economics, UC Santa Barbara.
    9. Sergiu Hart, 2013. "Adaptive Heuristics," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 11, pages 253-287, World Scientific Publishing Co. Pte. Ltd..
    10. Carsten Helm, 1998. "International Cooperation Behind the Veil of Uncertainty – The Case of Transboundary Acidification," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 12(2), pages 185-201, September.
    11. Ehud Lehrer & Eilon Solan, 2007. "Learning to play partially-specified equilibrium," Levine's Working Paper Archive 122247000000001436, David K. Levine.
    12. Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
    13. Tommaso Denti & Doron Ravid, 2023. "Robust Predictions in Games with Rational Inattention," Papers 2306.09964, arXiv.org.
    14. repec:dau:papers:123456789/8159 is not listed on IDEAS
    15. Michael Suk-Young Chwe, 1998. "Culture, Circles, And Commercials," Rationality and Society, , vol. 10(1), pages 47-75, February.
    16. Shmuel Zamir, 2008. "Bayesian games: Games with incomplete information," Discussion Paper Series dp486, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
    17. Cédric Wanko, 2008. "Approche Conceptuelle et Algorithmique des Equilibres de Nash Robustes Incitatifs," Working Papers 08-03, LAMETA, Universtiy of Montpellier, revised Feb 2008.
    18. Dutta, Jayasri & Morris, Stephen, 1997. "The Revelation of Information and Self-Fulfilling Beliefs," Journal of Economic Theory, Elsevier, vol. 73(1), pages 231-244, March.
    19. Robert J. Aumann, 2007. "War and Peace," Chapters, in: Jean-Philippe Touffut (ed.), Augustin Cournot: Modelling Economics, chapter 5, Edward Elgar Publishing.
    20. Joseph Kadane & Javier Girón & Daniel Peña & Peter Fishburn & Simon French & D. Lindley & Giovanni Parmigiani & Robert Winkler, 1993. "Several Bayesians: A review," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 2(1), pages 1-32, December.
    21. Fabrizio Germano & Peio Zuazo-Garin, 2017. "Bounded rationality and correlated equilibria," International Journal of Game Theory, Springer;Game Theory Society, vol. 46(3), pages 595-629, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001383. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.