Markov game for CV joint adaptive routing in stochastic traffic networks: A scalable learning approach

My bibliography Save this article

Markov game for CV joint adaptive routing in stochastic traffic networks: A scalable learning approach

Author

Listed:

Yang, Shan
Liu, Yang

Registered:

Abstract

This study proposes a learning-based approach to tackle the challenge of joint adaptive routing in stochastic traffic networks with Connected Vehicles (CVs). We introduce a Markov Routing Game (MRG) to model the adaptive routing behavior of all vehicles in such networks, thereby incorporating both competitive route choices and real-time decision-making. We establish the existence of the Nash policy (i.e., optimal joint adaptive routing policy) within the MRG that enables vehicles to adapt optimally to real-time traffic conditions online through efficient communication. To enhance scalability, we innovate with a homogeneity-based mean-field approximation method and, based on that, further develop the Homogeneity-based Mean-Field Deep Reinforcement Learning (HMF-DRL) algorithm to learn the Nash policy within the MRG. Through numerical experiments on the Nguyen–Dupuis network, we demonstrate our algorithm’s ability to efficiently converge and learn the joint adaptive routing policy that significantly enhances traffic network efficiency. Furthermore, our study provides insights into the effects of travel demand, penetration of CVs, and levels of uncertainty on the performance of the joint adaptive routing policy. This paper presents a significant step towards improving network efficiency and reducing the travel time for a majority of vehicles amid uncertain traffic conditions.

Suggested Citation

Yang, Shan & Liu, Yang, 2024. "Markov game for CV joint adaptive routing in stochastic traffic networks: A scalable learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 189(C).

Handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001218
DOI: 10.1016/j.trb.2024.102997

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Xie, Jiaohong & Yang, Zhenyu & Lai, Xiongfei & Liu, Yang & Yang, Xiao Bo & Teng, Teck-Hou & Tham, Chen-Khong, 2022. "Deep reinforcement learning for dynamic incident-responsive traffic information dissemination," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 166(C).
Fu, Liping, 2001. "An adaptive routing algorithm for in-vehicle route guidance systems with real-time information," Transportation Research Part B: Methodological, Elsevier, vol. 35(8), pages 749-765, September.
Gao, Song & Chabini, Ismail, 2006. "Optimal routing policy problems in stochastic time-dependent networks," Transportation Research Part B: Methodological, Elsevier, vol. 40(2), pages 93-122, February.
Zhou, Bo & Song, Qiankun & Zhao, Zhenjiang & Liu, Tangzhi, 2020. "A reinforcement learning scheme for the equilibrium of the in-vehicle route choice problem based on congestion game," Applied Mathematics and Computation, Elsevier, vol. 371(C).
Zhao, Wenjing & Ma, Zhuanglin & Yang, Kui & Huang, Helai & Monsuur, Fredrik & Lee, Jaeyoung, 2020. "Impacts of variable message signs on en-route route choice behavior," Transportation Research Part A: Policy and Practice, Elsevier, vol. 139(C), pages 335-349.
Vincent Mak & Eyran J. Gisches & Amnon Rapoport, 2015. "Route vs. Segment: An Experiment on Real-Time Travel Information in Congestible Networks," Production and Operations Management, Production and Operations Management Society, vol. 24(6), pages 947-960, June.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Azadian, Farshid & Murat, Alper E. & Chinnam, Ratna Babu, 2012. "Dynamic routing of time-sensitive air cargo using real-time information," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 48(1), pages 355-372.
Manseur, Farida & Farhi, Nadir & Nguyen Van Phu, Cyril & Haj-Salem, Habib & Lebacque, Jean-Patrick, 2020. "Robust routing, its price, and the tradeoff between routing robustness and travel time reliability in road networks," European Journal of Operational Research, Elsevier, vol. 285(1), pages 159-171.
Chai, Huajun, 2019. "Dynamic Traffic Routing and Adaptive Signal Control in a Connected Vehicles Environment," Institute of Transportation Studies, Working Paper Series qt9ng3z8vn, Institute of Transportation Studies, UC Davis.
Yang, Lixing & Zhou, Xuesong, 2017. "Optimizing on-time arrival probability and percentile travel time for elementary path finding in time-dependent transportation networks: Linear mixed integer programming reformulations," Transportation Research Part B: Methodological, Elsevier, vol. 96(C), pages 68-91.
Liu, Siyuan & Qu, Qiang, 2016. "Dynamic collective routing using crowdsourcing data," Transportation Research Part B: Methodological, Elsevier, vol. 93(PA), pages 450-469.
Wu, Xing & (Marco) Nie, Yu, 2011. "Modeling heterogeneous risk-taking behavior in route choice: A stochastic dominance approach," Transportation Research Part A: Policy and Practice, Elsevier, vol. 45(9), pages 896-915, November.
Nie, Yu (Marco) & Wu, Xing, 2009. "Shortest path problem considering on-time arrival probability," Transportation Research Part B: Methodological, Elsevier, vol. 43(6), pages 597-613, July.
Yang, Lixing & Zhou, Xuesong, 2014. "Constraint reformulation and a Lagrangian relaxation-based solution algorithm for a least expected time path problem," Transportation Research Part B: Methodological, Elsevier, vol. 59(C), pages 22-44.
D'Acierno, Luca & Cartenì, Armando & Montella, Bruno, 2009. "Estimation of urban traffic conditions using an Automatic Vehicle Location (AVL) System," European Journal of Operational Research, Elsevier, vol. 196(2), pages 719-736, July.
Pretolani, Daniele & Nielsen, Lars Relund & Andersen, Kim Allan & Ehrgott, Matthias, 2008. "Time-adaptive versus history-adaptive strategies for multicriterion routing in stochastic time-dependent networks," CORAL Working Papers L-2008-05, University of Aarhus, Aarhus School of Business, Department of Business Studies.
Hoang, Nam H. & Vu, Hai L. & Lo, Hong K., 2018. "An informed user equilibrium dynamic traffic assignment problem in a multiple origin-destination stochastic network," Transportation Research Part B: Methodological, Elsevier, vol. 115(C), pages 207-230.
Xinran Li & Haoxuan Kan & Xuedong Hua & Wei Wang, 2020. "Simulation-Based Electric Vehicle Sustainable Routing with Time-Dependent Stochastic Information," Sustainability, MDPI, vol. 12(6), pages 1-16, March.
Pi, Xidong & Qian, Zhen (Sean), 2017. "A stochastic optimal control approach for real-time traffic routing considering demand uncertainties and travelers’ choice heterogeneity," Transportation Research Part B: Methodological, Elsevier, vol. 104(C), pages 710-732.
Wang, Qing & Zhao, Wenjing & Ma, Shoufeng & Schonfeld, Paul M. & Zheng, Yue & Xue, Dabin, 2023. "Effects of a price incentive policy on urban rail transit passengers: A case study in Nanjing, China," Transportation Research Part A: Policy and Practice, Elsevier, vol. 178(C).
Stephen Boyles & S. Waller, 2011. "Optimal Information Location for Adaptive Routing," Networks and Spatial Economics, Springer, vol. 11(2), pages 233-254, June.
Bi Chen & William Lam & Agachai Sumalee & Qingquan Li & Hu Shao & Zhixiang Fang, 2013. "Finding Reliable Shortest Paths in Road Networks Under Uncertainty," Networks and Spatial Economics, Springer, vol. 13(2), pages 123-148, June.
S. Waller & David Fajardo & Melissa Duell & Vinayak Dixit, 2013. "Linear Programming Formulation for Strategic Dynamic Traffic Assignment," Networks and Spatial Economics, Springer, vol. 13(4), pages 427-443, December.
Huang, He & Gao, Song, 2012. "Optimal paths in dynamic networks with dependent random link travel times," Transportation Research Part B: Methodological, Elsevier, vol. 46(5), pages 579-598.
Tan, Lihua & Li, Chuandong & Huang, Junjian & Huang, Tingwen, 2021. "Output feedback leader-following consensus for nonlinear stochastic multiagent systems: The event-triggered method," Applied Mathematics and Computation, Elsevier, vol. 395(C).
He Huang & Song Gao, 2018. "Trajectory-Adaptive Routing in Dynamic Networks with Dependent Random Link Travel Times," Transportation Science, INFORMS, vol. 52(1), pages 102-117, January.

More about this item

Keywords

Markov Routing Game; Connected vehicles; Joint adaptive routing; Mean-field multi-agent reinforcement learning; Stochastic traffic network;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001218. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Markov game for CV joint adaptive routing in stochastic traffic networks: A scalable learning approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data