IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v16y2024i14p5880-d1432443.html
   My bibliography  Save this article

Proposing Machine Learning Models Suitable for Predicting Open Data Utilization

Author

Listed:
  • Junyoung Jeong

    (Graduate School of Management of Technology, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon 16419, Republic of Korea)

  • Keuntae Cho

    (Graduate School of Management of Technology, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon 16419, Republic of Korea
    Department of Systems Management Engineering, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon 16419, Republic of Korea)

Abstract

As the digital transformation accelerates in our society, open data are being increasingly recognized as a key resource for digital innovation in the public sector. This study explores the following two research questions: (1) Can a machine learning approach be appropriately used for measuring and evaluating open data utilization? (2) Should different machine learning models be applied for measuring open data utilization depending on open data attributes (field and usage type)? This study used single-model (random forest, XGBoost, LightGBM, CatBoost) and multi-model (stacking ensemble) machine learning methods. A key finding is that the best-performing models differed depending on open data attributes (field and type of use). The applicability of the machine learning approach for measuring and evaluating open data utilization in advance was also confirmed. This study contributes to open data utilization and to the application of its intrinsic value to society.

Suggested Citation

  • Junyoung Jeong & Keuntae Cho, 2024. "Proposing Machine Learning Models Suitable for Predicting Open Data Utilization," Sustainability, MDPI, vol. 16(14), pages 1-23, July.
  • Handle: RePEc:gam:jsusta:v:16:y:2024:i:14:p:5880-:d:1432443
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/16/14/5880/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/16/14/5880/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    2. Muhammad Mahboob Khurshid & Nor Hidayati Zakaria & Ammar Rashid & Muhammad Nouman Shafique, 2018. "Examining the Factors of Open Government Data Usability From Academician's Perspective," International Journal of Information Technology Project Management (IJITPM), IGI Global, vol. 9(3), pages 72-85, July.
    3. Gautam Ahuja & Curba Morris Lampert, 2001. "Entrepreneurship in the large corporation: a longitudinal study of how established firms create breakthrough inventions," Strategic Management Journal, Wiley Blackwell, vol. 22(6‐7), pages 521-543, June.
    4. Alon Peled, 2011. "When transparency and collaboration collide: The USA Open Data program," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(11), pages 2085-2094, November.
    5. Raphaela Helbig & Sven von Höveling & Andreas Solsbach & Jorge Marx Gómez, 2021. "Strategic analysis of providing corporate sustainability open data," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 28(3), pages 195-214, July.
    6. Thanongsak Xayasouk & HwaMin Lee & Giyeol Lee, 2020. "Air Pollution Prediction Using Long Short-Term Memory (LSTM) and Deep Autoencoder (DAE) Models," Sustainability, MDPI, vol. 12(6), pages 1-17, March.
    7. Juho Hamari & Mimmi Sjöklint & Antti Ukkonen, 2016. "The sharing economy: Why people participate in collaborative consumption," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(9), pages 2047-2059, September.
    8. Alon Peled, 2011. "When transparency and collaboration collide: The USA Open Data program," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(11), pages 2085-2094, November.
    9. Gi-Wook Cha & Hyeun-Jun Moon & Young-Chan Kim, 2021. "Comparison of Random Forest and Gradient Boosting Machine Models for Predicting Demolition Waste Based on Small Datasets and Categorical Variables," IJERPH, MDPI, vol. 18(16), pages 1-16, August.
    10. Zhang, Liang & Wen, Jin & Li, Yanfei & Chen, Jianli & Ye, Yunyang & Fu, Yangyang & Livingood, William, 2021. "A review of machine learning in building load prediction," Applied Energy, Elsevier, vol. 285(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gabriela Viale Pereira & Marie Anne Macadar & Edimara M. Luciano & Maurício Gregianin Testa, 2017. "Delivering public value through open government data initiatives in a Smart City context," Information Systems Frontiers, Springer, vol. 19(2), pages 213-229, April.
    2. Martin Lodge & Kai Wegrich, 2015. "Crowdsourcing and regulatory reviews: A new way of challenging red tape in British government?," Regulation & Governance, John Wiley & Sons, vol. 9(1), pages 30-46, March.
    3. Gabriela Viale Pereira & Marie Anne Macadar & Edimara M. Luciano & Maurício Gregianin Testa, 0. "Delivering public value through open government data initiatives in a Smart City context," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    4. Teresa M. Harrison & Theresa A. Pardo & Meghan Cook, 2012. "Creating Open Government Ecosystems: A Research and Development Agenda," Future Internet, MDPI, vol. 4(4), pages 1-29, October.
    5. Frédérik Lesage & Robert A. Hackett, 2014. "Between Objectivity and Openness—The Mediality of Data for Journalism," Media and Communication, Cogitatio Press, vol. 2(2), pages 42-54.
    6. Browder, Russell E. & Aldrich, Howard E. & Bradley, Steven W., 2019. "The emergence of the maker movement: Implications for entrepreneurship research," Journal of Business Venturing, Elsevier, vol. 34(3), pages 459-476.
    7. Alan Ponce & Raul Alberto Ponce Rodriguez, 2020. "An Analysis of the Supply of Open Government Data," Future Internet, MDPI, vol. 12(11), pages 1-18, October.
    8. Heimstädt, Maximilian, 2017. "Openwashing: A decoupling perspective on organizational transparency," Technological Forecasting and Social Change, Elsevier, vol. 125(C), pages 77-86.
    9. Henri A. Schildt & Markku V.J. Maula & Thomas Keil, 2005. "Explorative and Exploitative Learning from External Corporate Ventures," Entrepreneurship Theory and Practice, , vol. 29(4), pages 493-515, July.
    10. Swen Nadkarni & Reinhard Prügl, 2021. "Digital transformation: a review, synthesis and opportunities for future research," Management Review Quarterly, Springer, vol. 71(2), pages 233-341, April.
    11. Kuosmanen, Natalia & Valmari, Nelli, 2023. "Renewal of Companies Through Product Switching," ETLA Working Papers 104, The Research Institute of the Finnish Economy.
    12. Zhang, Feng & Jiang, Guohua & Cantwell, John A., 2015. "Subsidiary exploration and the innovative performance of large multinational corporations," International Business Review, Elsevier, vol. 24(2), pages 224-234.
    13. Avimanyu Datta, 2016. "Antecedents To Radical Innovations: A Longitudinal Look At Firms In The Information Technology Industry By Aggregation Of Patents," International Journal of Innovation Management (ijim), World Scientific Publishing Co. Pte. Ltd., vol. 20(07), pages 1-31, October.
    14. Liu, Zhiqiang & Yan, Miao & Fan, Youqing & Chen, Liling, 2021. "Ascribed or achieved? The role of birth order on innovative behaviour in the workplace," Journal of Business Research, Elsevier, vol. 134(C), pages 480-492.
    15. Frida Thomas Pacho, 2018. "Diversified Network Effects on Innovation Performance in Tanzania: Innovation Strategy in Service Firms," Journal of Entrepreneurship and Business Innovation, Macrothink Institute, Journal of Entrepreneurship and Business Innovation, vol. 5(1), pages 1-1, December.
    16. Chen, Zhelun & O’Neill, Zheng & Wen, Jin & Pradhan, Ojas & Yang, Tao & Lu, Xing & Lin, Guanjing & Miyata, Shohei & Lee, Seungjae & Shen, Chou & Chiosa, Roberto & Piscitelli, Marco Savino & Capozzoli, , 2023. "A review of data-driven fault detection and diagnostics for building HVAC systems," Applied Energy, Elsevier, vol. 339(C).
    17. Li, Mingxiang, 2021. "Exploring novel technologies through board interlocks: Spillover vs. broad exploration," Research Policy, Elsevier, vol. 50(9).
    18. Sadovnikova, Anna & Pujari, Ashish & Mikhailitchenko, Andrey, 2016. "Radical innovation in strategic partnerships: A framework for analysis," Journal of Business Research, Elsevier, vol. 69(5), pages 1829-1833.
    19. Pieper, Nadine & Woisetschläger, David M., 2024. "Customer misbehavior in access-based mobility services: An examination of prevention strategies," Journal of Business Research, Elsevier, vol. 171(C).
    20. Kathryn Rudie Harrigan & Maria Chiara Guardo & Bo Cowgill, 2017. "Multiplicative-innovation synergies: tests in technological acquisitions," The Journal of Technology Transfer, Springer, vol. 42(5), pages 1212-1233, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:16:y:2024:i:14:p:5880-:d:1432443. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.