IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v628y2023ics0378437123007318.html
   My bibliography  Save this article

Unique in the metro system: The likelihood to re-identify a metro user with limited trajectory points

Author

Listed:
  • Yang, Hongtai
  • Ping, An
  • Wei, Hongmin
  • Zhai, Guocong

Abstract

Though the collection of metro smart card data could help improve the operations of the metro system, the release of such data might lead to privacy issues. Few studies have quantified the probability to re-identify a user from the smart card data using very limited trajectory points. Thus, this study investigates this topic by analyzing eight-day metro smart card data of Chengdu, China. Results reveal that, on the macro level, three random trajectory points with a temporal resolution of one minute and one hour are enough to identify over 90% and 67% of the users. Even when the resolution is reduced to one day, 20% of the users could be still be identified by three points. On the individual level, three carefully selected points with a temporal resolution of one minute, one hour, and one day could lead to a re-identification risk no less than 0.5 for 99%, 89%, and 52% of the users. The effects of number of points, number of users, and other temporal resolutions are also thoroughly evaluated. These findings emphasize the great privacy issues involved in the release of metro smart card data and remind metro operators to take proactive measures to enhance privacy protection.

Suggested Citation

  • Yang, Hongtai & Ping, An & Wei, Hongmin & Zhai, Guocong, 2023. "Unique in the metro system: The likelihood to re-identify a metro user with limited trajectory points," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 628(C).
  • Handle: RePEc:eee:phsmap:v:628:y:2023:i:c:s0378437123007318
    DOI: 10.1016/j.physa.2023.129176
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437123007318
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2023.129176?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Vishal Mahajan & Nico Kuehnel & Aikaterini Intzevidou & Guido Cantelmo & Rolf Moeckel & Constantinos Antoniou, 2022. "Data to the people: a review of public and proprietary data for transport models," Transport Reviews, Taylor & Francis Journals, vol. 42(4), pages 415-440, July.
    2. Yang, Hongtai & Luo, Peng & Li, Chaojing & Zhai, Guocong & Yeh, Anthony G.O., 2023. "Nonlinear effects of fare discounts and built environment on ridesplitting adoption rates," Transportation Research Part A: Policy and Practice, Elsevier, vol. 169(C).
    3. Kang, Chaogui & Ma, Xiujun & Tong, Daoqin & Liu, Yu, 2012. "Intra-urban human mobility patterns: An urban morphology perspective," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(4), pages 1702-1717.
    4. Zhang, Yifan & Ng, S. Thomas, 2021. "Unveiling the rich-club phenomenon in urban mobility networks through the spatiotemporal characteristics of passenger flow," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 584(C).
    5. Wu, Jianjun & Qu, Yunchao & Sun, Huijun & Yin, Haodong & Yan, Xiaoyong & Zhao, Jiandong, 2019. "Data-driven model for passenger route choice in urban metro network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 524(C), pages 787-798.
    6. Timothy F. Welch & Alyas Widita, 2019. "Big data in public transportation: a review of sources and methods," Transport Reviews, Taylor & Francis Journals, vol. 39(6), pages 795-818, November.
    7. Yong, Juan & Zheng, Linjiang & Mao, Xiaowen & Tang, Xi & Gao, Ang & Liu, Weining, 2021. "Mining metro commuting mobility patterns using massive smart card data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 584(C).
    8. Yang, Xin & Xue, Qiuchi & Ding, Meiling & Wu, Jianjun & Gao, Ziyou, 2021. "Short-term prediction of passenger volume for urban rail systems: A deep learning approach based on smart-card data," International Journal of Production Economics, Elsevier, vol. 231(C).
    9. Yang, Hongtai & Zheng, Rong & Li, Xuan & Huo, Jinghai & Yang, Linchuan & Zhu, Tong, 2022. "Nonlinear and threshold effects of the built environment on e-scooter sharing ridership," Journal of Transport Geography, Elsevier, vol. 104(C).
    10. Lingjuan Chen & Yijing Zhao & Zupeng Liu & Xinran Yang, 2022. "Construction of Commuters’ Multi-Mode Choice Model Based on Public Transport Operation Data," Sustainability, MDPI, vol. 14(22), pages 1-20, November.
    11. Li, Xiaowei & Shi, Lanxin & Tang, Junqing & Yang, Chenyu & Zhao, Ting & Wang, Yuting & Wang, Wei, 2023. "Determinants of passengers' ticketing channel choice in rail transit systems: New evidence of e-payment behaviors from Xi'an, China," Transport Policy, Elsevier, vol. 140(C), pages 30-41.
    12. Wang, Zi-jia & Li, Xiao-hong & Chen, Feng, 2015. "Impact evaluation of a mass transit fare change on demand and revenue utilizing smart card data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 77(C), pages 213-224.
    13. Zhou, Li & Yang, Xin & Wang, Huan & Wu, Jianjun & Chen, Lei & Yin, Haodong & Qu, Yunchao, 2020. "A robust train timetable optimization approach for reducing the number of waiting passengers in metro systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 558(C).
    14. Peng, Yanni & Xiang, Wanli, 2020. "Short-term traffic volume prediction using GA-BP based on wavelet denoising and phase space reconstruction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 549(C).
    15. Liu, Kang & Yin, Ling & Ma, Zhanwu & Zhang, Fan & Zhao, Juanjuan, 2020. "Investigating physical encounters of individuals in urban metro systems with large-scale smart card data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    16. Jiangping Zhou & Yuling Yang & Chris Webster, 2020. "Using Big and Open Data to Analyze Transit-Oriented Development," Journal of the American Planning Association, Taylor & Francis Journals, vol. 86(3), pages 364-376, July.
    17. Qingru Zou & Xiangming Yao & Peng Zhao & Heng Wei & Hui Ren, 2018. "Detecting home location and trip purposes for cardholders by mining smart card transaction data in Beijing subway," Transportation, Springer, vol. 45(3), pages 919-944, May.
    18. Yong, Nuo & Ni, Shunjiang & Shen, Shifei & Chen, Peng & Ji, Xuewei, 2018. "Uncovering stable and occasional human mobility patterns: A case study of the Beijing subway," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 492(C), pages 28-38.
    19. Guo, Xin & Wang, David Z.W. & Wu, Jianjun & Sun, Huijun & Zhou, Li, 2020. "Mining commuting behavior of urban rail transit network by using association rules," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 559(C).
    20. Zhanhong Cheng & Martin Trépanier & Lijun Sun, 2021. "Probabilistic model for destination inference and travel pattern mining from smart card data," Transportation, Springer, vol. 48(4), pages 2035-2053, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sun, Li & Zhao, Juanjuan & Zhang, Jun & Zhang, Fan & Ye, Kejiang & Xu, Chengzhong, 2024. "Activity-based individual travel regularity exploring with entropy-space K-means clustering using smart card data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 636(C).
    2. Huang, Kang & Wu, Jianjun & Sun, Huijun & Yang, Xin & Gao, Ziyou & Feng, Xujie, 2022. "Timetable synchronization optimization in a subway–bus network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 608(P1).
    3. Shi, Shuyang & Wang, Lin & Wang, Xiaofan, 2022. "Uncovering the spatiotemporal motif patterns in urban mobility networks by non-negative tensor decomposition," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).
    4. Hui Zhang & Yu Cui & Jianmin Jia, 2024. "Mining Multimodal Travel Mobilities with Big Ridership Data: Comparative Analysis of Subways and Taxis," Sustainability, MDPI, vol. 16(10), pages 1-17, May.
    5. Liao, Cong & Scheuer, Bronte, 2022. "Evaluating the performance of transit-oriented development in Beijing metro station areas: Integrating morphology and demand into the node-place model," Journal of Transport Geography, Elsevier, vol. 100(C).
    6. Chen, Ruoyu & Zhou, Jiangping, 2022. "Fare adjustment’s impacts on travel patterns and farebox revenue: An empirical study based on longitudinal smartcard data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 164(C), pages 111-133.
    7. Liping Ge & Malek Sarhani & Stefan Voß & Lin Xie, 2021. "Review of Transit Data Sources: Potentials, Challenges and Complementarity," Sustainability, MDPI, vol. 13(20), pages 1-37, October.
    8. Yuxin Huang & Jingdao Fan & Zhenguo Yan & Shugang Li & Yanping Wang, 2021. "Research on Early Warning for Gas Risks at a Working Face Based on Association Rule Mining," Energies, MDPI, vol. 14(21), pages 1-19, October.
    9. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    10. Yang, Xiping & Fang, Zhixiang & Xu, Yang & Yin, Ling & Li, Junyi & Lu, Shiwei, 2019. "Spatial heterogeneity in spatial interaction of human movements—Insights from large-scale mobile positioning data," Journal of Transport Geography, Elsevier, vol. 78(C), pages 29-40.
    11. Wang, Wenjun & Pan, Lin & Yuan, Ning & Zhang, Sen & Liu, Dong, 2015. "A comparative analysis of intra-city human mobility by taxi," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 420(C), pages 134-147.
    12. Sun, Shan & Guo, Liang & Yang, Shuo & Cao, Jason, 2024. "Exploring the contributions of Ebike ownership, transit access, and the built environment to car ownership in a developing city," Journal of Transport Geography, Elsevier, vol. 116(C).
    13. Lihua Xu & Huifeng Xu & Tianyu Wang & Wenze Yue & Jinyang Deng & Liwei Mao, 2019. "Measuring Urban Spatial Activity Structures: A Comparative Analysis," Sustainability, MDPI, vol. 11(24), pages 1-17, December.
    14. Jin, Kun & Wang, Wei & Li, Xinran & Chen, Siyuan & Qin, Shaoyang & Hua, Xuedong, 2023. "Cascading failure in urban rail transit network considering demand variation and time delay," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 630(C).
    15. Chen, Yanguang, 2015. "The distance-decay function of geographical gravity model: Power law or exponential law?," Chaos, Solitons & Fractals, Elsevier, vol. 77(C), pages 174-189.
    16. Hiroaki Nishiuchi & Yasuyuki Kobayashi & Tomoyuki Todoroki & Tomoya Kawasaki, 2018. "Impact analysis of reductions in tram services in rural areas in Japan using smart card data," Public Transport, Springer, vol. 10(2), pages 291-309, August.
    17. Jian Li & Lu Zhang & Bu Liu & Ningning Shi & Liang Li & Haodong Yin, 2023. "Travel-Energy-Based Timetable Optimization in Urban Subway Systems," Sustainability, MDPI, vol. 15(3), pages 1-21, January.
    18. Erick Yohanes Kalengkongan & Wilson Bogar & Fitri H. Mamonto, 2022. "The Quality of Vehicles' Public Service Testing in The Tomohon Transportation Department," Technium Social Sciences Journal, Technium Science, vol. 32(1), pages 62-75, June.
    19. Xiaolu Li & Peng Zhang & Guangyu Zhu, 2019. "DBSCAN Clustering Algorithms for Non-Uniform Density Data and Its Application in Urban Rail Passenger Aggregation Distribution," Energies, MDPI, vol. 12(19), pages 1-22, September.
    20. Yiduo Huang & Zuojun Max Shen, 2021. "Optimizing timetable and network reopen plans for public transportation networks during a COVID19-like pandemic," Papers 2109.03940, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:628:y:2023:i:c:s0378437123007318. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.