IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/ewtcf.html
   My bibliography  Save this paper

Forecasting Australian port throughput: Lessons and Pitfalls in the era of Big Data

Author

Listed:
  • banerjee, soumya

    (University of Oxford)

Abstract

Modelling and forecasting port throughput enables stakeholders to make efficient decisions ranging from management of port development, to infrastructure investments, operational restructuring and tariffs policy. Accurate forecasting of port throughput is also critical for long-term resource allocation and short-term strategic planning. In turn, efficient decision making enhances the competitiveness of a port. However, in the era of big data we are faced with the enviable dilemma of having too much information. We pose the question: is more information always better for forecasting? We suggest that more information comes at the cost of more parameters of the forecasting model that need to be estimated. We compare multiple forecasting models of varying degrees of complexity and quantify the effect of the amount of data on model forecasting accuracy. Our methodology serves as a guideline for practitioners in this field. We also enjoin caution that even in the era of big data more information may not always be better. It would be advisable for analysts to weigh the costs of adding more data: the ultimate decision would depend on the problem, amount of data and the kind of models being used.

Suggested Citation

  • banerjee, soumya, 2016. "Forecasting Australian port throughput: Lessons and Pitfalls in the era of Big Data," OSF Preprints ewtcf, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:ewtcf
    DOI: 10.31219/osf.io/ewtcf
    as

    Download full text from publisher

    File URL: https://osf.io/download/5d4478dcbcd6d900198eb8ba/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/ewtcf?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Cho, Haeran & Fryzlewicz, Piotr, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," LSE Research Online Documents on Economics 57147, London School of Economics and Political Science, LSE Library.
    2. Haeran Cho & Piotr Fryzlewicz, 2015. "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(2), pages 475-507, March.
    3. Peter Schulze & Alexander Prinz, 2009. "Forecasting container transshipment in Germany," Applied Economics, Taylor & Francis Journals, vol. 41(22), pages 2809-2815.
    4. Fung, Michael K, 2002. "Forecasting Hong Kong's Container Throughput: An Error-Correction Model," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 21(1), pages 69-80, January.
    5. Maloni, Michael & Jackson, Eric C., 2005. "North American Container Port Capacity," 46th Annual Transportation Research Forum, Washington, D.C., March 6-8, 2005 208147, Transportation Research Forum.
    6. Athanasios Pallis & Thomas Vitsounis & Peter De Langen & Theo Notteboom, 2011. "Port Economics, Policy and Management: Content Classification and Survey," Transport Reviews, Taylor & Francis Journals, vol. 31(4), pages 445-471.
    7. Mohamed M. Mostafa, 2004. "Forecasting the Suez Canal traffic: a neural network analysis," Maritime Policy & Management, Taylor & Francis Journals, vol. 31(2), pages 139-156, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. banerjee, soumya, 2016. "Forecasting Australian port throughput: Lessons and Pitfalls in the era of Big Data," OSF Preprints c3av2, Center for Open Science.
    2. Simon Bussy & Mokhtar Z. Alaya & Anne‐Sophie Jannot & Agathe Guilloux, 2022. "Binacox: automatic cut‐point detection in high‐dimensional Cox model with applications in genetics," Biometrics, The International Biometric Society, vol. 78(4), pages 1414-1426, December.
    3. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    4. Steland, Ansgar, 2020. "Testing and estimating change-points in the covariance matrix of a high-dimensional time series," Journal of Multivariate Analysis, Elsevier, vol. 177(C).
    5. Qing Yang & Yu-Ning Li & Yi Zhang, 2020. "Change point detection for nonparametric regression under strongly mixing process," Statistical Papers, Springer, vol. 61(4), pages 1465-1506, August.
    6. Li, Degui, 2024. "Estimation of Large Dynamic Covariance Matrices: A Selective Review," Econometrics and Statistics, Elsevier, vol. 29(C), pages 16-30.
    7. Su-Han Woo & Stephen Pettit & Anthony Beresford & Dong-Wook Kwak, 2012. "Seaport Research: A Decadal Analysis of Trends and Themes Since the 1980s," Transport Reviews, Taylor & Francis Journals, vol. 32(3), pages 351-377, January.
    8. Brault, Vincent & Ouadah, Sarah & Sansonnet, Laure & Lévy-Leduc, Céline, 2018. "Nonparametric multiple change-point estimation for analyzing large Hi-C data matrices," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 143-165.
    9. Yi Xiao & Shouyang Wang & Ming Xiao & Jin Xiao & Yi Hu, 2017. "The Analysis for the Cargo Volume with Hybrid Discrete Wavelet Modeling," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 16(03), pages 851-863, May.
    10. Chen, Likai & Wang, Weining & Wu, Wei Biao, 2019. "Inference of Break-Points in High-Dimensional Time Series," IRTG 1792 Discussion Papers 2019-013, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    11. Cho, Haeran & Korkas, Karolos K., 2022. "High-dimensional GARCH process segmentation with an application to Value-at-Risk," Econometrics and Statistics, Elsevier, vol. 23(C), pages 187-203.
    12. Cho, Haeran & Kirch, Claudia, 2024. "Data segmentation algorithms: Univariate mean change and beyond," Econometrics and Statistics, Elsevier, vol. 30(C), pages 76-95.
    13. Aykroyd, Robert G. & Barber, Stuart & Miller, Luke R., 2016. "Classification of multiple time signals using localized frequency characteristics applied to industrial process monitoring," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 351-362.
    14. Bin Liu & Cheng Zhou & Xinsheng Zhang & Yufeng Liu, 2020. "A unified data‐adaptive framework for high dimensional change point detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 933-963, September.
    15. Ma, Chenchen & Tu, Yundong, 2023. "Group fused Lasso for large factor models with multiple structural breaks," Journal of Econometrics, Elsevier, vol. 233(1), pages 132-154.
    16. Pang, Tianxiao & Du, Lingjie & Chong, Terence Tai-Leung, 2021. "Estimating multiple breaks in nonstationary autoregressive models," Journal of Econometrics, Elsevier, vol. 221(1), pages 277-311.
    17. Neil Hwang & Jiarui Xu & Shirshendu Chatterjee & Sharmodeep Bhattacharyya, 2022. "The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 283-320, June.
    18. Aaron Paul Lowther & Rebecca Killick & Idris Arthur Eckley, 2023. "Detecting changes in mixed‐sampling rate data sequences," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    19. Mengjia Yu & Xiaohui Chen, 2021. "Finite sample change point inference and identification for high‐dimensional mean vectors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 247-270, April.
    20. Hajra Siddiqa & Sajid Ali & Ismail Shah, 2021. "Most recent changepoint detection in censored panel data," Computational Statistics, Springer, vol. 36(1), pages 515-540, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:ewtcf. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.