IDEAS home Printed from https://ideas.repec.org/a/kap/transp/v51y2024i1d10.1007_s11116-022-10328-2.html
   My bibliography  Save this article

Maximum interpolable gap length in missing smartphone-based GPS mobility data

Author

Listed:
  • Danielle McCool

    (Utrecht University
    Statistics Netherlands)

  • Peter Lugtig

    (Utrecht University)

  • Barry Schouten

    (Utrecht University
    Statistics Netherlands)

Abstract

Passively-generated location data have the potential to augment mobility and transportation research, as demonstrated by a decade of research. A common trait of these data is a high proportion of missingness. Naïve handling, including list-wise deletion of subjects or days, or linear interpolation across time gaps, has the potential to bias summary results. On the other hand, it is unfeasible to collect mobility data at frequencies high enough to reflect all possible movements. In this paper, we describe the relationship between the temporal and spatial aspects of these data gaps, and illustrate the impact on measures of interest in the field of mobility. We propose a method to deal with missing location data that combines a so-called top-down ratio segmentation method with simple linear interpolation. The linear interpolation imputes missing data. The segmentation method transforms the set of location points to a series of lines, called segments. The method is designed for relatively short gaps, but is evaluated also for longer gaps. We study the effect of our imputation method for the duration of missing data using a completely observed subset of observations from the 2018 Statistics Netherlands travel study. We find that long gaps demonstrate greater downward bias on travel distance, movement events and radius of gyration as compared to shorter but more frequent gaps. When the missingness is unrelated to travel behavior, total sparsity can reach levels of up to 20% with gap lengths of up to 10 min while maintaining a maximum 5% downward bias in the metrics of interest. Temporal aspects can increase these limits; sparsity occurring in the evening or night hours is less biasing due to fewer travel behaviors.

Suggested Citation

  • Danielle McCool & Peter Lugtig & Barry Schouten, 2024. "Maximum interpolable gap length in missing smartphone-based GPS mobility data," Transportation, Springer, vol. 51(1), pages 297-327, February.
  • Handle: RePEc:kap:transp:v:51:y:2024:i:1:d:10.1007_s11116-022-10328-2
    DOI: 10.1007/s11116-022-10328-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11116-022-10328-2
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11116-022-10328-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bates, Douglas & Mächler, Martin & Bolker, Ben & Walker, Steve, 2015. "Fitting Linear Mixed-Effects Models Using lme4," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i01).
    2. Li Shen & Peter R. Stopher, 2014. "Review of GPS Travel Survey and GPS Data-Processing Methods," Transport Reviews, Taylor & Francis Journals, vol. 34(3), pages 316-334, May.
    3. Chen, Cynthia & Gong, Hongmian & Lawson, Catherine & Bialostozky, Evan, 2010. "Evaluating the feasibility of a passive travel survey collection in a complex urban environment: Lessons learned from the New York City case study," Transportation Research Part A: Policy and Practice, Elsevier, vol. 44(10), pages 830-840, December.
    4. Francesca Cellina & Dominik Bucher & Francesca Mangili & José Veiga Simão & Roman Rudel & Martin Raubal, 2019. "A Large Scale, App-Based Behaviour Change Experiment Persuading Sustainable Mobility Patterns: Methods, Results and Lessons Learnt," Sustainability, MDPI, vol. 11(9), pages 1-23, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roy, Avipsa & Fuller, Daniel & Nelson, Trisalyn & Kedron, Peter, 2022. "Assessing the role of geographic context in transportation mode detection from GPS data," Journal of Transport Geography, Elsevier, vol. 100(C).
    2. Hong, Ye & Stüdeli, Emanuel & Raubal, Martin, 2023. "Evaluating geospatial context information for travel mode detection," Journal of Transport Geography, Elsevier, vol. 113(C).
    3. Mariem Fekih & Tom Bellemans & Zbigniew Smoreda & Patrick Bonnel & Angelo Furno & Stéphane Galland, 2021. "A data-driven approach for origin–destination matrix construction from cellular network signalling data: a case study of Lyon region (France)," Transportation, Springer, vol. 48(4), pages 1671-1702, August.
    4. Siripirote, Treerapot & Sumalee, Agachai & Ho, H.W., 2020. "Statistical estimation of freight activity analytics from Global Positioning System data of trucks," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 140(C).
    5. JANSSENS, Jochen & DE CORTE, Annelies & SÖRENSEN, Kenneth, 2016. "Water distribution network design optimisation with respect to reliability," Working Papers 2016007, University of Antwerp, Faculty of Business and Economics.
    6. Raymond Hernandez & Elizabeth A. Pyatak & Cheryl L. P. Vigen & Haomiao Jin & Stefan Schneider & Donna Spruijt-Metz & Shawn C. Roll, 2021. "Understanding Worker Well-Being Relative to High-Workload and Recovery Activities across a Whole Day: Pilot Testing an Ecological Momentary Assessment Technique," IJERPH, MDPI, vol. 18(19), pages 1-17, October.
    7. Christopher Hassall & Michael Nisbet & Evan Norcliffe & He Wang, 2024. "The Potential Health Benefits of Urban Tree Planting Suggested through Immersive Environments," Land, MDPI, vol. 13(3), pages 1-12, February.
    8. Jie Zhao & Ji Chen & Damien Beillouin & Hans Lambers & Yadong Yang & Pete Smith & Zhaohai Zeng & Jørgen E. Olesen & Huadong Zang, 2022. "Global systematic review with meta-analysis reveals yield advantage of legume-based rotations and its drivers," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    9. Elisabeth Beckmann & Lukas Olbrich & Joseph Sakshaug, 2024. "Multivariate assessment of interviewer-related errors in a cross-national economic survey (Lukas Olbrich, Elisabeth Beckmann, Joseph W. Sakshaug)," Working Papers 253, Oesterreichische Nationalbank (Austrian Central Bank).
    10. F J Heather & D Z Childs & A M Darnaude & J L Blanchard, 2018. "Using an integral projection model to assess the effect of temperature on the growth of gilthead seabream Sparus aurata," PLOS ONE, Public Library of Science, vol. 13(5), pages 1-19, May.
    11. Valentina Krenz & Arjen Alink & Tobias Sommer & Benno Roozendaal & Lars Schwabe, 2023. "Time-dependent memory transformation in hippocampus and neocortex is semantic in nature," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    12. Toşa, Cristian & Sato, Hitomi & Morikawa, Takayuki & Miwa, Tomio, 2018. "Commuting behavior in emerging urban areas: Findings of a revealed-preferences and stated-intentions survey in Cluj-Napoca, Romania," Journal of Transport Geography, Elsevier, vol. 68(C), pages 78-93.
    13. Morán-Ordóñez, Alejandra & Ameztegui, Aitor & De Cáceres, Miquel & de-Miguel, Sergio & Lefèvre, François & Brotons, Lluís & Coll, Lluís, 2020. "Future trade-offs and synergies among ecosystem services in Mediterranean forests under global change scenarios," Ecosystem Services, Elsevier, vol. 45(C).
    14. Jack McDonnell & Thomas McKenna & Kathryn A. Yurkonis & Deirdre Hennessy & Rafael Andrade Moral & Caroline Brophy, 2023. "A Mixed Model for Assessing the Effect of Numerous Plant Species Interactions on Grassland Biodiversity and Ecosystem Function Relationships," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(1), pages 1-19, March.
    15. Ana Pinto & Tong Yin & Marion Reichenbach & Raghavendra Bhatta & Pradeep Kumar Malik & Eva Schlecht & Sven König, 2020. "Enteric Methane Emissions of Dairy Cattle Considering Breed Composition, Pasture Management, Housing Conditions and Feeding Characteristics along a Rural-Urban Gradient in a Rising Megacity," Agriculture, MDPI, vol. 10(12), pages 1-18, December.
    16. Damian M. Herz & Manuel Bange & Gabriel Gonzalez-Escamilla & Miriam Auer & Keyoumars Ashkan & Petra Fischer & Huiling Tan & Rafal Bogacz & Muthuraman Muthuraman & Sergiu Groppa & Peter Brown, 2022. "Dynamic control of decision and movement speed in the human basal ganglia," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    17. Kathrin Stenchly & Marc Victor Hansen & Katharina Stein & Andreas Buerkert & Wilhelm Loewenstein, 2018. "Income Vulnerability of West African Farming Households to Losses in Pollination Services: A Case Study from Ouagadougou, Burkina Faso," Sustainability, MDPI, vol. 10(11), pages 1-12, November.
    18. Dongyan Liu & Chongran Zhou & John K. Keesing & Oscar Serrano & Axel Werner & Yin Fang & Yingjun Chen & Pere Masque & Janine Kinloch & Aleksey Sadekov & Yan Du, 2022. "Wildfires enhance phytoplankton production in tropical oceans," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    19. Zhaogeng Yang & Yanhui Li & Peijin Hu & Jun Ma & Yi Song, 2020. "Prevalence of Anemia and its Associated Factors among Chinese 9-, 12-, and 14-Year-Old Children: Results from 2014 Chinese National Survey on Students Constitution and Health," IJERPH, MDPI, vol. 17(5), pages 1-10, February.
    20. Marco Lopez-Cruz & Fernando M. Aguate & Jacob D. Washburn & Natalia Leon & Shawn M. Kaeppler & Dayane Cristina Lima & Ruijuan Tan & Addie Thompson & Laurence Willard Bretonne & Gustavo los Campos, 2023. "Leveraging data from the Genomes-to-Fields Initiative to investigate genotype-by-environment interactions in maize in North America," Nature Communications, Nature, vol. 14(1), pages 1-14, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:transp:v:51:y:2024:i:1:d:10.1007_s11116-022-10328-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.