IDEAS home Printed from https://ideas.repec.org/a/kap/transp/v48y2021i4d10.1007_s11116-020-10120-0.html
   My bibliography  Save this article

Probabilistic model for destination inference and travel pattern mining from smart card data

Author

Listed:
  • Zhanhong Cheng

    (McGill University
    Interuniversity Research Centre on Enterprise Networks, Logistics and Transportation (CIRRELT))

  • Martin Trépanier

    (Interuniversity Research Centre on Enterprise Networks, Logistics and Transportation (CIRRELT)
    Department of Mathematics and Industrial Engineering Polytechnique Montréal)

  • Lijun Sun

    (McGill University
    Interuniversity Research Centre on Enterprise Networks, Logistics and Transportation (CIRRELT))

Abstract

Inferring trip destination in smart card data with only tap-in control is an important application. Most existing methods estimate trip destinations based on the continuity of trip chains, while the destinations of isolated/unlinked trips cannot be properly handled. We address this problem with a probabilistic topic model. A three-dimensional latent dirichlet allocation model is developed to extract latent topics of departure time, origin, and destination among the population; each passenger’s travel behavior is characterized by a latent topic distribution defined on a three-dimensional simplex. Given the origin station and departure time, the most likely destination can be obtained by statistical inference. Furthermore, we propose to represent stations by their rank of visiting frequency, which transforms divergent spatial patterns into similar behavioral regularities. The proposed destination estimation framework is tested on Guangzhou Metro smart card data, in which the ground-truth is available. Compared with benchmark models, the topic model not only shows increased accuracy but also captures essential latent patterns in passengers’ travel behavior. The proposed topic model can be used to infer the destination of unlinked trips, analyze travel patterns, and passenger clustering.

Suggested Citation

  • Zhanhong Cheng & Martin Trépanier & Lijun Sun, 2021. "Probabilistic model for destination inference and travel pattern mining from smart card data," Transportation, Springer, vol. 48(4), pages 2035-2053, August.
  • Handle: RePEc:kap:transp:v:48:y:2021:i:4:d:10.1007_s11116-020-10120-0
    DOI: 10.1007/s11116-020-10120-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11116-020-10120-0
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11116-020-10120-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sun, Lijun & Axhausen, Kay W., 2016. "Understanding urban mobility patterns with a probabilistic tensor factorization framework," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 511-524.
    2. Luca Pappalardo & Filippo Simini & Salvatore Rinzivillo & Dino Pedreschi & Fosca Giannotti & Albert-László Barabási, 2015. "Returners and explorers dichotomy in human mobility," Nature Communications, Nature, vol. 6(1), pages 1-8, November.
    3. Morency, Catherine & Trépanier, Martin & Agard, Bruno, 2007. "Measuring transit use variability with smart-card data," Transport Policy, Elsevier, vol. 14(3), pages 193-203, May.
    4. Marta C. González & César A. Hidalgo & Albert-László Barabási, 2009. "Understanding individual human mobility patterns," Nature, Nature, vol. 458(7235), pages 238-238, March.
    5. Behrang Assemi & Azalden Alsger & Mahboobeh Moghaddam & Mark Hickman & Mahmoud Mesbah, 2020. "Improving alighting stop inference accuracy in the trip chaining method using neural networks," Public Transport, Springer, vol. 12(1), pages 89-121, March.
    6. Bowen Du & Wenjun Zhou & Chuanren Liu & Yifeng Cui & Hui Xiong, 2019. "Transit Pattern Detection Using Tensor Factorization," INFORMS Journal on Computing, INFORMS, vol. 31(2), pages 193-206, April.
    7. Ma, Xiaolei & Liu, Congcong & Wen, Huimin & Wang, Yunpeng & Wu, Yao-Jan, 2017. "Understanding commuting patterns using transit smart card data," Journal of Transport Geography, Elsevier, vol. 58(C), pages 135-145.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang, Hongtai & Ping, An & Wei, Hongmin & Zhai, Guocong, 2023. "Unique in the metro system: The likelihood to re-identify a metro user with limited trajectory points," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 628(C).
    2. Chen, Ruoyu & Zhou, Jiangping, 2022. "Fare adjustment’s impacts on travel patterns and farebox revenue: An empirical study based on longitudinal smartcard data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 164(C), pages 111-133.
    3. Cardell-Oliver, Rachel & Olaru, Doina, 2022. "CIAM: A data-driven approach for classifying long-term engagement of public transport riders at multiple temporal scales," Transportation Research Part A: Policy and Practice, Elsevier, vol. 165(C), pages 321-336.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Böhm & Mirco Nanni & Luca Pappalardo, 2022. "Gross polluters and vehicle emissions reduction," Nature Sustainability, Nature, vol. 5(8), pages 699-707, August.
    2. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    3. Shi, Shuyang & Wang, Lin & Wang, Xiaofan, 2022. "Uncovering the spatiotemporal motif patterns in urban mobility networks by non-negative tensor decomposition," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).
    4. Hainan Huang & Yi Lin & Jiancheng Weng & Jian Rong & Xiaoming Liu, 2018. "Identification of Inelastic Subway Trips Based on Weekly Station Sequence Data: An Example from the Beijing Subway," Sustainability, MDPI, vol. 10(12), pages 1-15, December.
    5. Jie Huang & David Levinson & Jiaoe Wang & Haitao Jin, 2019. "Job-worker spatial dynamics in Beijing: Insights from Smart Card Data," Working Papers 2019-01, University of Minnesota: Nexus Research Group.
    6. Yang, Xiong & Zhuge, Chengxiang & Shao, Chunfu & Huang, Yuantan & Hayse Chiwing G. Tang, Justin & Sun, Mingdong & Wang, Pinxi & Wang, Shiqi, 2022. "Characterizing mobility patterns of private electric vehicle users with trajectory data," Applied Energy, Elsevier, vol. 321(C).
    7. Mofeng Yang & Yixuan Pan & Aref Darzi & Sepehr Ghader & Chenfeng Xiong & Lei Zhang, 2022. "A data-driven travel mode share estimation framework based on mobile device location data," Transportation, Springer, vol. 49(5), pages 1339-1383, October.
    8. Amaya, Margarita & Cruzat, Ramón & Munizaga, Marcela A., 2018. "Estimating the residence zone of frequent public transport users to make travel pattern and time use analysis," Journal of Transport Geography, Elsevier, vol. 66(C), pages 330-339.
    9. Fulman, Nir & Marinov, Maria & Benenson, Itzhak, 2023. "Investigating occasional travel patterns based on smartcard transactions," Transport Policy, Elsevier, vol. 141(C), pages 152-166.
    10. Jin, Meihan & Wang, Menghan & Gong, Yongxi & Liu, Yu, 2022. "Spatio-temporally constrained origin–destination inferring using public transit fare card data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 603(C).
    11. Yanyan Chen & Zheng Zhang & Tianwen Liang, 2019. "Assessing Urban Travel Patterns: An Analysis of Traffic Analysis Zone-Based Mobility Patterns," Sustainability, MDPI, vol. 11(19), pages 1-15, October.
    12. Marie Urban & Robert Stewart & Scott Basford & Zachary Palmer & Jason Kaufman, 2023. "Estimating building occupancy: a machine learning system for day, night, and episodic events," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 116(2), pages 2417-2436, March.
    13. Huang, Zhiren & Wang, Pu & Zhang, Fan & Gao, Jianxi & Schich, Maximilian, 2018. "A mobility network approach to identify and anticipate large crowd gatherings," Transportation Research Part B: Methodological, Elsevier, vol. 114(C), pages 147-170.
    14. Li, Heyang & Zeng, An, 2022. "Improving recommendation by connecting user behavior in temporal and topological dimensions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 585(C).
    15. Jinjun Tang & Xiaolu Wang & Fang Zong & Zheng Hu, 2020. "Uncovering Spatio-temporal Travel Patterns Using a Tensor-based Model from Metro Smart Card Data in Shenzhen, China," Sustainability, MDPI, vol. 12(4), pages 1-16, February.
    16. Pieroni, Caio & Giannotti, Mariana & Alves, Bianca B. & Arbex, Renato, 2021. "Big data for big issues: Revealing travel patterns of low-income population based on smart card data mining in a global south unequal city," Journal of Transport Geography, Elsevier, vol. 96(C).
    17. Wang, Yihong & Correia, Gonçalo Homem de Almeida & de Romph, Erik & Timmermans, H.J.P., 2017. "Using metro smart card data to model location choice of after-work activities: An application to Shanghai," Journal of Transport Geography, Elsevier, vol. 63(C), pages 40-47.
    18. Dong, Bing & Liu, Yapan & Fontenot, Hannah & Ouf, Mohamed & Osman, Mohamed & Chong, Adrian & Qin, Shuxu & Salim, Flora & Xue, Hao & Yan, Da & Jin, Yuan & Han, Mengjie & Zhang, Xingxing & Azar, Elie & , 2021. "Occupant behavior modeling methods for resilient building design, operation and policy at urban scale: A review," Applied Energy, Elsevier, vol. 293(C).
    19. Xia Zhao & Mengying Cui & David Levinson, 2023. "Exploring temporal variability in travel patterns on public transit using big smart card data," Environment and Planning B, , vol. 50(1), pages 198-217, January.
    20. Clodomir Santana & Federico Botta & Hugo Barbosa & Filippo Privitera & Ronaldo Menezes & Riccardo Di Clemente, 2023. "COVID-19 is linked to changes in the time–space dimension of human mobility," Nature Human Behaviour, Nature, vol. 7(10), pages 1729-1739, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:transp:v:48:y:2021:i:4:d:10.1007_s11116-020-10120-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.