IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/1956865.html
   My bibliography  Save this article

Multiclass Classification by Various Machine Learning Algorithms and Interpretation of the Risk Factors of Pedestrian Accidents Using Explainable AI

Author

Listed:
  • Sanghun Lee
  • Sangyeop Kim
  • Jaehoon Kim
  • Doyun Kim
  • Dohyun Lee
  • Gwangmuk Im
  • Hyeonseop Yuk
  • Tae-Young Heo
  • Juan P. Amezquita-Sanchez

Abstract

Pedestrian injuries and fatalities due to traffic accidents remain at a high level. Therefore, the need for efforts to reduce this ratio is on the rise. Machine learning models can facilitate the exploration of the various factors that influence the occurrence of pedestrian accidents. In this study, we used data on pedestrian traffic accidents classified into three categories of injury severity: minor, severe, and fatal. To compare the performance of various types of models, logistic regression, Naïve Bayes, XGBoost, CatBoost, and LightGBM were used for analysis. Five machine learning methods were applied to the analysis, and hyperparameter tuning was performed to improve the performance of the model. The performances of the five models were 0.688, 0.577, 0.705, 0.708, and 0.707, respectively, and LightGBM showed the best classification accuracy at 0.708 in this study. Based on SHAP (Shapley additive explanation), one of the explainable artificial intelligence (XAI) techniques, we were able to obtain the variable importance of the LightGBM model, through which we identified the main factors affecting each level of injury severity. In addition, by using LIME (local interpretable model-agnostic explanation), another XAI technique, it was found that the age of the driver and pedestrian was the factor that had the most significant influence on the model’s classification prediction. Specifically, as the size of the vehicle increases, the severity of the accident increases. When the driver is older, the severity of the accident is small, and when the driver is young, the severity of the accident is high.

Suggested Citation

  • Sanghun Lee & Sangyeop Kim & Jaehoon Kim & Doyun Kim & Dohyun Lee & Gwangmuk Im & Hyeonseop Yuk & Tae-Young Heo & Juan P. Amezquita-Sanchez, 2023. "Multiclass Classification by Various Machine Learning Algorithms and Interpretation of the Risk Factors of Pedestrian Accidents Using Explainable AI," Mathematical Problems in Engineering, Hindawi, vol. 2023, pages 1-15, May.
  • Handle: RePEc:hin:jnlmpe:1956865
    DOI: 10.1155/2023/1956865
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/mpe/2023/1956865.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/mpe/2023/1956865.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2023/1956865?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:1956865. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.