IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i15p2670-d874822.html
   My bibliography  Save this article

Clustering and Forecasting Urban Bus Passenger Demand with a Combination of Time Series Models

Author

Listed:
  • Irene Mariñas-Collado

    (Department of Statistics and Operation Research and Mathematics Didactics, Universidad de Oviedo, 33007 Oviedo, Spain)

  • Ana E. Sipols

    (Department of Applied Mathematics, Materials Science and Engineering and Electronic Technology, Rey Juan Carlos University, 28933 Madrid, Spain)

  • M. Teresa Santos-Martín

    (Department of Statistics, Institute of Fundamental Physics and Mathematics, Universidad de Salamanca, 37008 Salamanca, Spain)

  • Elisa Frutos-Bernal

    (Department of Statistics, Universidad de Salamanca, 37007 Salamanca, Spain)

Abstract

The present paper focuses on the analysis of large data sets from public transport networks, more specifically, on how to predict urban bus passenger demand. A series of steps are proposed to ease the understanding of passenger demand. First, given the large number of stops in the bus network, these are divided into clusters and then different models are fitted for a representative of each of the clusters. The aim is to compare and combine the predictions associated with traditional methods, such as exponential smoothing or ARIMA, with machine learning methods, such as support vector machines or artificial neural networks. Moreover, support vector machine predictions are improved by incorporating explanatory variables with temporal structure and moving averages. Finally, through cointegration techniques, the results obtained for the representative of each group are extrapolated to the rest of the series within the same cluster. A case study in the city of Salamanca (Spain) is presented to illustrate the problem.

Suggested Citation

  • Irene Mariñas-Collado & Ana E. Sipols & M. Teresa Santos-Martín & Elisa Frutos-Bernal, 2022. "Clustering and Forecasting Urban Bus Passenger Demand with a Combination of Time Series Models," Mathematics, MDPI, vol. 10(15), pages 1-16, July.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:15:p:2670-:d:874822
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/15/2670/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/15/2670/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Peter R. Winters, 1960. "Forecasting Sales by Exponentially Weighted Moving Averages," Management Science, INFORMS, vol. 6(3), pages 324-342, April.
    2. Caiado, Jorge & Crato, Nuno & Pena, Daniel, 2006. "A periodogram-based metric for time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2668-2684, June.
    3. de Menezes, Lilian M. & W. Bunn, Derek & Taylor, James W., 2000. "Review of guidelines for the use of combined forecasts," European Journal of Operational Research, Elsevier, vol. 120(1), pages 190-204, January.
    4. Chen, Cynthia & Chen, Jason & Barry, James, 2009. "Diurnal pattern of transit ridership: a case study of the New York City subway system," Journal of Transport Geography, Elsevier, vol. 17(3), pages 176-186.
    5. Johansen, Soren, 1995. "Likelihood-Based Inference in Cointegrated Vector Autoregressive Models," OUP Catalogue, Oxford University Press, number 9780198774501.
    6. Engle, Robert & Granger, Clive, 2015. "Co-integration and error correction: Representation, estimation, and testing," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 39(3), pages 106-135.
    7. Holt, Charles C., 2004. "Author's retrospective on 'Forecasting seasonals and trends by exponentially weighted moving averages'," International Journal of Forecasting, Elsevier, vol. 20(1), pages 11-13.
    8. Orhan Altuğ Karabiber & George Xydis, 2019. "Electricity Price Forecasting in the Danish Day-Ahead Market Using the TBATS, ANN and ARIMA Methods," Energies, MDPI, vol. 12(5), pages 1-29, March.
    9. Holt, Charles C., 2004. "Forecasting seasonals and trends by exponentially weighted moving averages," International Journal of Forecasting, Elsevier, vol. 20(1), pages 5-10.
    10. Galeano, Pedro, 2001. "Multivariate analysis in vector time series," DES - Working Papers. Statistics and Econometrics. WS ws012415, Universidad Carlos III de Madrid. Departamento de Estadística.
    11. Antonio Comi & Antonio Polimeni, 2020. "Bus Travel Time: Experimental Evidence and Forecasting," Forecasting, MDPI, vol. 2(3), pages 1-14, August.
    12. Johansen, Soren, 1991. "Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models," Econometrica, Econometric Society, vol. 59(6), pages 1551-1580, November.
    13. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    14. Ding, Chuan & Cao, Xinyu & Liu, Chao, 2019. "How does the station-area built environment influence Metrorail ridership? Using gradient boosting decision trees to identify non-linear thresholds," Journal of Transport Geography, Elsevier, vol. 77(C), pages 70-78.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jinshi Yu & Qi Duan & Haonan Huang & Shude He & Tao Zou, 2023. "Effective Incomplete Multi-View Clustering via Low-Rank Graph Tensor Completion," Mathematics, MDPI, vol. 11(3), pages 1-18, January.
    2. Lian, Ying & Lucas, Flavien & Sörensen, Kenneth, 2024. "Prepositioning can improve the performance of a dynamic stochastic on-demand public bus system," European Journal of Operational Research, Elsevier, vol. 312(1), pages 338-356.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Theodosiou, Marina, 2011. "Forecasting monthly and quarterly time series using STL decomposition," International Journal of Forecasting, Elsevier, vol. 27(4), pages 1178-1195, October.
    2. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    3. Jan G. De Gooijer & Rob J. Hyndman, 2005. "25 Years of IIF Time Series Forecasting: A Selective Review," Monash Econometrics and Business Statistics Working Papers 12/05, Monash University, Department of Econometrics and Business Statistics.
    4. Alessia Naccarato & Andrea Pierini & Giovanna Ferraro, 2021. "Markowitz portfolio optimization through pairs trading cointegrated strategy in long-term investment," Annals of Operations Research, Springer, vol. 299(1), pages 81-99, April.
    5. Wilton Bernardino & João B. Amaral & Nelson L. Paes & Raydonal Ospina & José L. Távora, 2022. "A statistical investigation of a stock valuation model," SN Business & Economics, Springer, vol. 2(8), pages 1-25, August.
    6. Brittle, Shane, 2009. "Ricardian Equivalence and the Efficacy of Fiscal Policy in Australia," Economics Working Papers wp09-10, School of Economics, University of Wollongong, NSW, Australia.
    7. Jesus Gonzalo & Tae-Hwy Lee, 2000. "On the robustness of cointegration tests when series are fractionally intergrated," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(7), pages 821-827.
    8. Ling Tang & Chengyuan Zhang & Tingfei Li & Ling Li, 2021. "A novel BEMD-based method for forecasting tourist volume with search engine data," Tourism Economics, , vol. 27(5), pages 1015-1038, August.
    9. Neil R. Ericsson & James G. MacKinnon, 2002. "Distributions of error correction tests for cointegration," Econometrics Journal, Royal Economic Society, vol. 5(2), pages 285-318, June.
    10. Pär Österholm, 2005. "The Taylor Rule: A Spurious Regression?," Bulletin of Economic Research, Wiley Blackwell, vol. 57(3), pages 217-247, July.
    11. Omtzigt Pieter & Fachin Stefano, 2002. "Bootstrapping and Bartlett corrections in the cointegrated VAR model," Economics and Quantitative Methods qf0212, Department of Economics, University of Insubria.
    12. Ericsson, Neil R & Hendry, David F & Mizon, Grayham E, 1998. "Exogeneity, Cointegration, and Economic Policy Analysis," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(4), pages 370-387, October.
    13. Chien-Chiang Lee & Chin-Yu Wang & Jhih-Hong Zeng, 2017. "Housing price–volume correlations and boom–bust cycles," Empirical Economics, Springer, vol. 52(4), pages 1423-1450, June.
    14. Eleni Constantinou & Avo Kazandjian & Georgios P. Kouretas & Vera Tahmazian, 2008. "Common Stochastic Trends Among The Cyprus Stock Exchange And The Ase, Lse And Nyse," Bulletin of Economic Research, Wiley Blackwell, vol. 60(4), pages 327-349, October.
    15. Phylaktis, Kate & Chen, Long, 2009. "Price discovery in foreign exchange markets: A comparison of indicative and actual transaction prices," Journal of Empirical Finance, Elsevier, vol. 16(4), pages 640-654, September.
    16. Yanhua Chen & Rosario N Mantegna & Athanasios A Pantelous & Konstantin M Zuev, 2018. "A dynamic analysis of S&P 500, FTSE 100 and EURO STOXX 50 indices under different exchange rates," PLOS ONE, Public Library of Science, vol. 13(3), pages 1-40, March.
    17. Kleibergen, Frank & Paap, Richard, 2006. "Generalized reduced rank tests using the singular value decomposition," Journal of Econometrics, Elsevier, vol. 133(1), pages 97-126, July.
    18. Bruce E. Hansen, 2018. "Johansen’s Reduced Rank Estimator Is GMM," Econometrics, MDPI, vol. 6(2), pages 1-9, May.
    19. Sulaiman, Saidu & Masih, Mansur, 2017. "Is liberalizing finance the game in town for Nigeria ?," MPRA Paper 95569, University Library of Munich, Germany.
    20. Rault, Christophe, 2005. "Further Results on Weak Exogeneity in Vector Error Correction Models," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 25(2), November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:15:p:2670-:d:874822. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.