IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v16y2023i1p107-d1305056.html
   My bibliography  Save this article

Urban Traffic Accident Features Investigation to Improve Urban Transportation Infrastructure Sustainability by Integrating GIS and Data Mining Techniques

Author

Listed:
  • Khanh Giang Le

    (Faculty of Civil Engineering, University of Transport and Communications, No. 3 Cau Giay Street, Lang Thuong Ward, Dong Da District, Hanoi, Vietnam)

  • Quang Hoc Tran

    (Faculty of Civil Engineering, University of Transport and Communications, No. 3 Cau Giay Street, Lang Thuong Ward, Dong Da District, Hanoi, Vietnam)

  • Van Manh Do

    (Faculty of Civil Engineering, University of Transport and Communications, No. 3 Cau Giay Street, Lang Thuong Ward, Dong Da District, Hanoi, Vietnam)

Abstract

Urban traffic accidents pose significant challenges to the sustainability of transportation infrastructure not only in Vietnam but also all over the world. To decrease the frequency of accidents, it is crucial to analyze accident data to determine the relationship between accidents and causes, especially for serious accidents. This study suggests an integrated approach using Geographic Information System (GIS) and Data Mining methods to investigate the features of urban traffic accidents in Hanoi, Vietnam aiming to solve these challenges and enhance the safety and efficiency of urban transportation. Firstly, the dataset was segmented into homogenous clusters using the two-step cluster method. Secondly, the correlation between causes and traffic accidents was examined on the overall dataset as well as on each cluster using the association rule mining (ARM) technique. Finally, the location of accident groups and high-frequency sites of accidents (hotspots) were determined by using GIS techniques. As a result, a five-cluster model was created, which corresponded to five common accident groupings in Hanoi. Moreover, the results of the study also identified the types of accidents, the main causes, the time, and the surrounding areas corresponding to each accident group. In detail, cluster 5 depicted accidents on streets, provincial, and national roads caused by motorbikes making up the highest percentage within the groups, accounting for 29.2%. Speeding and driving in the wrong lane in the afternoon and at night were the main causes in this cluster ( C f ≥ 0.9 and L t ≥ 1.22). Next, cluster 2 had the second-highest proportion. Cluster 2 presented accidents between a truck/car and a motorbike on national and provincial roads, accounting for 27.8%. Cluster 1 presented accidents between a truck/car and a motorbike on local streets, accounting for 22%. Cluster 3 illustrated accidents between two motorbikes on the country lanes, accounting for 12.3%. Finally, cluster 4 depicted single-vehicle motorbike crashes, with the lowest rate of 8.8%. More importantly, this study also recommended using repeatability criteria for the same type of accidents or causes to determine the location of hotspots. Also, suggestions for improving traffic infrastructure sustainability were proposed. To our knowledge, this is the first time in which these three methods are applied simultaneously for analyzing traffic accidents.

Suggested Citation

  • Khanh Giang Le & Quang Hoc Tran & Van Manh Do, 2023. "Urban Traffic Accident Features Investigation to Improve Urban Transportation Infrastructure Sustainability by Integrating GIS and Data Mining Techniques," Sustainability, MDPI, vol. 16(1), pages 1-19, December.
  • Handle: RePEc:gam:jsusta:v:16:y:2023:i:1:p:107-:d:1305056
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/16/1/107/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/16/1/107/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hahsler, Michael & Grün, Bettina & Hornik, Kurt, 2005. "arules - A Computational Environment for Mining Association Rules and Frequent Item Sets," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i15).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jesus Crespo Cuaresma & Bettina Grün & Paul Hofmarcher & Stefan Humer & Mathias Moser, 2015. "A Comprehensive Approach to Posterior Jointness Analysis in Bayesian Model Averaging Applications," Department of Economics Working Papers wuwp193, Vienna University of Economics and Business, Department of Economics.
    2. Yoichi Matsumoto, 2013. "Heterogeneous Combinations of Knowledge Elements: How the Knowledge Base Structure Impacts Knowledge-related Outcomes of a Firm," Discussion Paper Series DP2013-15, Research Institute for Economics & Business Administration, Kobe University.
    3. Man-, ZuyiKeunZuyi Wang & Takagi, Chifumi & Kim, Man-Keun & Chung, Anh, 2022. "Uncover Drivers Influencing Consumers' WTP Using Machine Learning: Case of Organic Coffee in Taiwan," 2022 Annual Meeting, July 31-August 2, Anaheim, California 322150, Agricultural and Applied Economics Association.
    4. Kurt Hornik & Christian Buchta & Achim Zeileis, 2009. "Open-source machine learning: R meets Weka," Computational Statistics, Springer, vol. 24(2), pages 225-232, May.
    5. Hofmarcher, Paul & Crespo Cuaresma, Jesus & Grün, Bettina & Humer, Stefan & Moser, Mathias, 2018. "Bivariate jointness measures in Bayesian Model Averaging: Solving the conundrum," Journal of Macroeconomics, Elsevier, vol. 57(C), pages 150-165.
    6. Małecka-Ziembińska Edyta & Siwiec Anna, 2020. "Searching for similarities in EU corporate income taxes for their harmonization," Economics and Business Review, Sciendo, vol. 6(4), pages 72-94, December.
    7. Nancy Awad & Jean-Francois Couchot & Bechara Al Bouna & Laurent Philippe, 2020. "Publishing Anonymized Set-Valued Data via Disassociation towards Analysis," Future Internet, MDPI, vol. 12(4), pages 1-21, April.
    8. Scholz, Michael, 2016. "R Package clickstream: Analyzing Clickstream Data with Markov Chains," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i04).
    9. Jasleen Kaur & Khushdeep Dharni, 2022. "Assessing efficacy of association rules for predicting global stock indices," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 49(3), pages 329-339, September.
    10. Deszczyński, Bartosz & Beręsewicz, Maciej, 2021. "The maturity of relationship management and firm performance – A step toward relationship management middle-range theory," Journal of Business Research, Elsevier, vol. 135(C), pages 358-372.
    11. Michael Hahsler & Radoslaw Karpienko, 2017. "Visualizing association rules in hierarchical groups," Journal of Business Economics, Springer, vol. 87(3), pages 317-335, April.
    12. Ji Yeon Lee & Richa Kumari & Jae Yun Jeong & Tae-Hyun Kim & Byeong-Hee Lee, 2020. "Knowledge Discovering on Graphene Green Technology by Text Mining in National R&D Projects in South Korea," Sustainability, MDPI, vol. 12(23), pages 1-16, November.
    13. Yoonju Lee & Heejin Kim & Hyesun Jeong & Yunhwan Noh, 2020. "Patterns of Multimorbidity in Adults: An Association Rules Analysis Using the Korea Health Panel," IJERPH, MDPI, vol. 17(8), pages 1-14, April.
    14. Sun, Chenhao & Wang, Xin & Zheng, Yihui, 2020. "An ensemble system to predict the spatiotemporal distribution of energy security weaknesses in transmission networks," Applied Energy, Elsevier, vol. 258(C).
    15. Suelane Garcia Fontes & Ronaldo Gonçalves Morato & Silvio Luiz Stanzani & Pedro Luiz Pizzigatti Corrêa, 2021. "Jaguar movement behavior: using trajectories and association rule mining algorithms to unveil behavioral states and social interactions," PLOS ONE, Public Library of Science, vol. 16(2), pages 1-18, February.
    16. Mulenga, Brian P. & Raper, Kellie Curry & Peel, Derrell S., 2020. "A Market Basket Analysis of Beef Calf Management Practice Adoption," Journal of Agricultural and Resource Economics, Western Agricultural Economics Association, vol. 46(2), August.
    17. Da-Yeong Lee & Dae-Seong Lee & Young-Seuk Park, 2022. "Taxonomic and Functional Diversity of Benthic Macroinvertebrate Assemblages in Reservoirs of South Korea," IJERPH, MDPI, vol. 20(1), pages 1-17, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:16:y:2023:i:1:p:107-:d:1305056. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.