IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v17y2025i3p131-d1615891.html
   My bibliography  Save this article

A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles

Author

Listed:
  • Hani Alnami

    (Electrical Engineering & Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USA
    Computer Science Department, Jazan University, Jazan 82917, Saudi Arabia)

  • Imad Mahgoub

    (Electrical Engineering & Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USA)

  • Hamzah Al-Najada

    (Electrical Engineering & Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL 33431, USA)

  • Easa Alalwany

    (College of Computer Science and Engineering, Taibah University, Yanbu 46421, Saudi Arabia)

Abstract

Abnormal traffic flow prediction is crucial for reducing traffic congestion. Most recent studies utilized machine learning models in traffic flow detection systems. However, these detection systems do not support real-time analysis. Centralized machine learning methods face a number of challenges due to the sheer volume of traffic data that needs to be processed in real-time. Thus, it is not scalable and lacks fault tolerance and data privacy. This study designs and evaluates a scalable distributed machine learning-based scheme to predict highway traffic flows in real-time. The proposed system is segment-based where the vehicles in each segment form a cluster. We train and validate a local Random Forest Regression (RFR) model for each vehicle’s cluster (highway-segment) using six different hyper parameters. Due to the variance of traffic flow patterns between segments, we build a global Distributed Machine Learning Random Forest (DMLRF) regression model to improve the system performance for abnormal traffic flows. Kappa Architecture is utilized to enable real-time prediction. The proposed model is evaluated and compared to other base-line models, Linear Regression (LR), Logistic Regression (LogR), and K Nearest Neighbor (KNN) regression in terms of Mean Square Error (MSE), Root Mean Square Error (RMSE), Mean Absolute Error (MAE), R-squared (R 2 ), and Adjusted R-Squared (AR 2 ). The proposed scheme demonstrates high accuracy in predicting abnormal traffic flows while maintaining scalability and data privacy.

Suggested Citation

  • Hani Alnami & Imad Mahgoub & Hamzah Al-Najada & Easa Alalwany, 2025. "A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles," Future Internet, MDPI, vol. 17(3), pages 1-21, March.
  • Handle: RePEc:gam:jftint:v:17:y:2025:i:3:p:131-:d:1615891
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/17/3/131/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/17/3/131/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hou, Qinzhong & Leng, Junqiang & Ma, Guosheng & Liu, Weiyi & Cheng, Yuxing, 2019. "An adaptive hybrid model for short-term urban traffic flow prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 527(C).
    2. Zhi (Aaron) Cheng & Min-Seok Pang & Paul A. Pavlou, 2020. "Mitigating Traffic Congestion: The Role of Intelligent Transportation Systems," Information Systems Research, INFORMS, vol. 31(3), pages 653-674, September.
    3. Nhlakanipho Michael Mqadi & Nalindren Naicker & Timothy Adeliyi, 2021. "Solving Misclassification of the Credit Card Imbalance Problem Using Near Miss," Mathematical Problems in Engineering, Hindawi, vol. 2021, pages 1-16, July.
    4. Kursa, Miron B. & Rudnicki, Witold R., 2010. "Feature Selection with the Boruta Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 36(i11).
    5. Sura Mahmood Abdullah & Muthusamy Periyasamy & Nafees Ahmed Kamaludeen & S. K. Towfek & Raja Marappan & Sekar Kidambi Raju & Amal H. Alharbi & Doaa Sami Khafaga, 2023. "Optimizing Traffic Flow in Smart Cities: Soft GRU-Based Recurrent Neural Networks for Enhanced Congestion Prediction Using Deep Learning," Sustainability, MDPI, vol. 15(7), pages 1-21, March.
    6. Zhanzhong Wang & Ruijuan Chu & Minghang Zhang & Xiaochao Wang & Siliang Luan, 2020. "An Improved Hybrid Highway Traffic Flow Prediction Model Based on Machine Learning," Sustainability, MDPI, vol. 12(20), pages 1-22, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ismail Shah & Izhar Muhammad & Sajid Ali & Saira Ahmed & Mohammed M. A. Almazah & A. Y. Al-Rezami, 2022. "Forecasting Day-Ahead Traffic Flow Using Functional Time Series Approach," Mathematics, MDPI, vol. 10(22), pages 1-16, November.
    2. Tong, Jianfeng & Liu, Zhenxing & Zhang, Yong & Zheng, Xiujuan & Jin, Junyang, 2023. "Improved multi-gate mixture-of-experts framework for multi-step prediction of gas load," Energy, Elsevier, vol. 282(C).
    3. Asma Shaheen & Javed Iqbal, 2018. "Spatial Distribution and Mobility Assessment of Carcinogenic Heavy Metals in Soil Profiles Using Geostatistics and Random Forest, Boruta Algorithm," Sustainability, MDPI, vol. 10(3), pages 1-20, March.
    4. Ramón Ferri-García & María del Mar Rueda, 2022. "Variable selection in Propensity Score Adjustment to mitigate selection bias in online surveys," Statistical Papers, Springer, vol. 63(6), pages 1829-1881, December.
    5. Yang Zhao & Denise Gorse, 2024. "Earthquake prediction from seismic indicators using tree-based ensemble learning," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 120(3), pages 2283-2309, February.
    6. Manuel J. García Rodríguez & Vicente Rodríguez Montequín & Francisco Ortega Fernández & Joaquín M. Villanueva Balsera, 2019. "Public Procurement Announcements in Spain: Regulations, Data Analysis, and Award Price Estimator Using Machine Learning," Complexity, Hindawi, vol. 2019, pages 1-20, November.
    7. Sangjin Kim & Jong-Min Kim, 2019. "Two-Stage Classification with SIS Using a New Filter Ranking Method in High Throughput Data," Mathematics, MDPI, vol. 7(6), pages 1-16, May.
    8. Baihan Wang & Alfred Pozarickij & Mohsen Mazidi & Neil Wright & Pang Yao & Saredo Said & Andri Iona & Christiana Kartsonaki & Hannah Fry & Kuang Lin & Yiping Chen & Huaidong Du & Daniel Avery & Dan Sc, 2025. "Comparative studies of 2168 plasma proteins measured by two affinity-based platforms in 4000 Chinese adults," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    9. Foutzopoulos, Giorgos & Pandis, Nikolaos & Tsagris, Michail, 2024. "Predicting full retirement attainment of NBA players," MPRA Paper 121540, University Library of Munich, Germany.
    10. Zhao-Yue Chen & Hervé Petetin & Raúl Fernando Méndez Turrubiates & Hicham Achebak & Carlos Pérez García-Pando & Joan Ballester, 2024. "Population exposure to multiple air pollutants and its compound episodes in Europe," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    11. Schrader, Silja & Graham, Sonia & Campbell, Rebecca & Height, Kaitlyn & Hawkes, Gina, 2024. "Grower attitudes and practices toward area-wide management of cropping weeds in Australia," Land Use Policy, Elsevier, vol. 137(C).
    12. Rabin K. Jana & Indranil Ghosh, 2025. "A residual driven ensemble machine learning approach for forecasting natural gas prices: analyses for pre-and during-COVID-19 phases," Annals of Operations Research, Springer, vol. 345(2), pages 757-778, February.
    13. Shao, Feng & Shao, Hu & Wang, Dongle & Lam, William H.K., 2024. "A multi-task spatio-temporal generative adversarial network for prediction of travel time reliability in peak hour periods," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 638(C).
    14. Piotr Pomorski & Denise Gorse, 2023. "Improving Portfolio Performance Using a Novel Method for Predicting Financial Regimes," Papers 2310.04536, arXiv.org.
    15. Caperna, Giulio & Colagrossi, Marco & Geraci, Andrea & Mazzarella, Gianluca, 2022. "A babel of web-searches: Googling unemployment during the pandemic," Labour Economics, Elsevier, vol. 74(C).
    16. Hakan Pabuccu & Adrian Barbu, 2023. "Feature Selection with Annealing for Forecasting Financial Time Series," Papers 2303.02223, arXiv.org, revised Feb 2024.
    17. Abolfazl Mollalo & Kiara M. Rivera & Behzad Vahedi, 2020. "Artificial Neural Network Modeling of Novel Coronavirus (COVID-19) Incidence Rates across the Continental United States," IJERPH, MDPI, vol. 17(12), pages 1-13, June.
    18. Chunyang Huang & Shaoliang Zhang, 2023. "Explainable artificial intelligence model for identifying Market Value in Professional Soccer Players," Papers 2311.04599, arXiv.org, revised Nov 2023.
    19. Elina H. Hwang & Xitong Guo & Yong Tan & Yuanyuan Dang, 2022. "Delivering Healthcare Through Teleconsultations: Implications for Offline Healthcare Disparity," Information Systems Research, INFORMS, vol. 33(2), pages 515-539, June.
    20. Faisal Alsayegh & Moh A Alkhamis & Fatima Ali & Sreeja Attur & Nicholas M Fountain-Jones & Mohammad Zubaid, 2022. "Anemia or other comorbidities? using machine learning to reveal deeper insights into the drivers of acute coronary syndromes in hospital admitted patients," PLOS ONE, Public Library of Science, vol. 17(1), pages 1-15, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:17:y:2025:i:3:p:131-:d:1615891. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.