IDEAS home Printed from https://ideas.repec.org/a/eee/ecomod/v498y2024ics0304380024002862.html
   My bibliography  Save this article

More transparent and explainable machine learning algorithms are required to provide enhanced and sustainable dataset understanding

Author

Listed:
  • Wood, David A.

Abstract

For detailed dataset interrogation and auditing purposes the lack of dataset explainability/transparency of the majority of available machine-learning (ML) models poses limitations. There is a tendency for ML models to focus on prediction speed and accuracy at the expense of transparently revealing dataset relationships. A case is made here to broaden that focus and for ML models to offer alternative configurations tailored to provide more explanations about how individual predictions are derived. Indeed, those striving to achieve sustainable objectives should not rely on opaque ML models and seek transparency as a fundamental objective of good modelling practice (GMP). Doing so tends to boost trust and confidence in the outputs of models relating to complex socio-environmental systems (SES), particularly those being used to potentially justify controversial social, political and ethical decisions. Currently, the transparent open box algorithms (TOB) are the only ML algorithms available that are configured specifically to routinely provide detailed data record relationships for each of their predictions. This study describes the data mining benefits of the Python-coded optimized data-matching TOB algorithms generally, and when applied to environmental datasets characterized by complex non-linear relationships involving many variables.

Suggested Citation

  • Wood, David A., 2024. "More transparent and explainable machine learning algorithms are required to provide enhanced and sustainable dataset understanding," Ecological Modelling, Elsevier, vol. 498(C).
  • Handle: RePEc:eee:ecomod:v:498:y:2024:i:c:s0304380024002862
    DOI: 10.1016/j.ecolmodel.2024.110898
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304380024002862
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ecolmodel.2024.110898?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cai, Jianchao & Xu, Kai & Zhu, Yanhui & Hu, Fang & Li, Liuhuan, 2020. "Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest," Applied Energy, Elsevier, vol. 262(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Kai & Liu, Xing & Guo, Xin & Wang, Jianhang & Wang, Zhuang & Huang, Lianzhong, 2024. "A novel high-precision and self-adaptive prediction method for ship energy consumption based on the multi-model fusion approach," Energy, Elsevier, vol. 310(C).
    2. Santhappan, Joseph Sekhar & Boddu, Muralikrishna & Gopinath, Arun S. & Mathimani, Thangavel, 2024. "Analysis of 27 supervised machine learning models for the co-gasification assessment of peanut shell and spent tea residue in an open-core downdraft gasifier," Renewable Energy, Elsevier, vol. 235(C).
    3. Zhang, Tao & Li, Yiteng & Chen, Yin & Feng, Xiaoyu & Zhu, Xingyu & Chen, Zhangxing & Yao, Jun & Zheng, Yongchun & Cai, Jianchao & Song, Hongqing & Sun, Shuyu, 2021. "Review on space energy," Applied Energy, Elsevier, vol. 292(C).
    4. Simin Kheradmand & Nima Heidarzadeh & Seyed Hossein Kia, 2023. "Prediction of the CO2 emission across grassland and cropland using tower-based eddy covariance flux measurements: a machine learning approach," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 25(6), pages 5495-5509, June.
    5. Juan Luis Martín-Ortega & Javier Chornet & Ioannis Sebos & Sander Akkermans & María José López Blanco, 2024. "Enhancing Transparency of Climate Efforts: MITICA’s Integrated Approach to Greenhouse Gas Mitigation," Sustainability, MDPI, vol. 16(10), pages 1-35, May.
    6. Roy, Dibyendu & Zhu, Shunmin & Wang, Ruiqi & Mondal, Pradip & Ling-Chin, Janie & Roskilly, Anthony Paul, 2024. "Techno-economic and environmental analyses of hybrid renewable energy systems for a remote location employing machine learning models," Applied Energy, Elsevier, vol. 361(C).
    7. Rao, Amar & Talan, Amogh & Abbas, Shujaat & Dev, Dhairya & Taghizadeh-Hesary, Farhad, 2023. "The role of natural resources in the management of environmental sustainability: Machine learning approach," Resources Policy, Elsevier, vol. 82(C).
    8. Xiaodong Li & Ai Ren & Qi Li, 2022. "Exploring Patterns of Transportation-Related CO 2 Emissions Using Machine Learning Methods," Sustainability, MDPI, vol. 14(8), pages 1-21, April.
    9. Jiapeng Cui & Feng Tan, 2022. "PLSR-Based Assessment of Soil Respiration Rate Changes under Aerated Irrigation in Relation to Soil Environmental Factors," Agriculture, MDPI, vol. 13(1), pages 1-15, December.
    10. Christian Mulomba Mukendi & Hyebong Choi & Suhui Jung & Yun-Seon Kim, 2024. "Determinants of Yearly CO 2 Emission Fluctuations: A Machine Learning Perspective to Unveil Dynamics," Sustainability, MDPI, vol. 16(10), pages 1-28, May.
    11. Deo, Ravinesh C. & Ahmed, A.A. Masrur & Casillas-Pérez, David & Pourmousavi, S. Ali & Segal, Gary & Yu, Yanshan & Salcedo-Sanz, Sancho, 2023. "Cloud cover bias correction in numerical weather models for solar energy monitoring and forecasting systems with kernel ridge regression," Renewable Energy, Elsevier, vol. 203(C), pages 113-130.
    12. Tan, Daniel & Suvarna, Manu & Shee Tan, Yee & Li, Jie & Wang, Xiaonan, 2021. "A three-step machine learning framework for energy profiling, activity state prediction and production estimation in smart process manufacturing," Applied Energy, Elsevier, vol. 291(C).
    13. Xiaoxu Guo & Ruibing Kou & Xiang He, 2024. "Towards Carbon Neutrality: Machine Learning Analysis of Vehicle Emissions in Canada," Sustainability, MDPI, vol. 16(23), pages 1-18, November.
    14. Yan, Peiliang & Fan, Weijun & Zhang, Rongchun, 2023. "Predicting the NOx emissions of low heat value gas rich-quench-lean combustor via three integrated learning algorithms with Bayesian optimization," Energy, Elsevier, vol. 273(C).
    15. Zifan Huang & Zexia Duan & Yichi Zhang & Tianbo Ji, 2024. "Response of Sustainable Solar Photovoltaic Power Output to Summer Heatwave Events in Northern China," Sustainability, MDPI, vol. 16(12), pages 1-28, June.
    16. Li, Jie & Suvarna, Manu & Pan, Lanjia & Zhao, Yingru & Wang, Xiaonan, 2021. "A hybrid data-driven and mechanistic modelling approach for hydrothermal gasification," Applied Energy, Elsevier, vol. 304(C).
    17. Hung-Ta Wen & Jau-Huai Lu & Deng-Siang Jhang, 2021. "Features Importance Analysis of Diesel Vehicles’ NO x and CO 2 Emission Predictions in Real Road Driving Based on Gradient Boosting Regression Model," IJERPH, MDPI, vol. 18(24), pages 1-28, December.
    18. Yuling Huang & Xiaoping Lu & Chujin Zhou & Yunlin Song, 2023. "DADE-DQN: Dual Action and Dual Environment Deep Q-Network for Enhancing Stock Trading Strategy," Mathematics, MDPI, vol. 11(17), pages 1-27, August.
    19. Zhang, Jinlai & Yang, Wenjie & Chen, Yumei & Ding, Mingkang & Huang, Huiling & Wang, Bingkun & Gao, Kai & Chen, Shuhan & Du, Ronghua, 2024. "Fast object detection of anomaly photovoltaic (PV) cells using deep neural networks," Applied Energy, Elsevier, vol. 372(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecomod:v:498:y:2024:i:c:s0304380024002862. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/ecological-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.