IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i18p2797-d1474987.html
   My bibliography  Save this article

Integrating Fuzzy C-Means Clustering and Explainable AI for Robust Galaxy Classification

Author

Listed:
  • Gabriel Marín Díaz

    (Faculty of Statistics, Complutense University, Puerta de Hierro, 28040 Madrid, Spain)

  • Raquel Gómez Medina

    (Science and Aerospace Department, Universidad Europea de Madrid, Villaviciosa de Odón, 28670 Madrid, Spain)

  • José Alberto Aijón Jiménez

    (Science and Aerospace Department, Universidad Europea de Madrid, Villaviciosa de Odón, 28670 Madrid, Spain)

Abstract

The classification of galaxies has significantly advanced using machine learning techniques, offering deeper insights into the universe. This study focuses on the typology of galaxies using data from the Galaxy Zoo project, where classifications are based on the opinions of non-expert volunteers, introducing a degree of uncertainty. The objective of this study is to integrate Fuzzy C-Means (FCM) clustering with explainability methods to achieve a precise and interpretable model for galaxy classification. We applied FCM to manage this uncertainty and group galaxies based on their morphological characteristics. Additionally, we used explainability methods, specifically SHAP (SHapley Additive exPlanations) values and LIME (Local Interpretable Model-Agnostic Explanations), to interpret and explain the key factors influencing the classification. The results show that using FCM allows for accurate classification while managing data uncertainty, with high precision values that meet the expectations of the study. Additionally, SHAP values and LIME provide a clear understanding of the most influential features in each cluster. This method enhances our classification and understanding of galaxies and is extendable to environmental studies on Earth, offering tools for environmental management and protection. The presented methodology highlights the importance of integrating FCM and XAI techniques to address complex problems with uncertain data.

Suggested Citation

  • Gabriel Marín Díaz & Raquel Gómez Medina & José Alberto Aijón Jiménez, 2024. "Integrating Fuzzy C-Means Clustering and Explainable AI for Robust Galaxy Classification," Mathematics, MDPI, vol. 12(18), pages 1-27, September.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:18:p:2797-:d:1474987
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/18/2797/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/18/2797/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Leticia Monje & Ramón A. Carrasco & Carlos Rosado & Manuel Sánchez-Montañés, 2022. "Deep Learning XAI for Bus Passenger Forecasting: A Use Case in Spain," Mathematics, MDPI, vol. 10(9), pages 1-20, April.
    2. Teck-Hua Ho & Young-Hoon Park & Yong-Pin Zhou, 2006. "Incorporating Satisfaction into Customer Value Analysis: Optimal Investment in Lifetime Value," Marketing Science, INFORMS, vol. 25(3), pages 260-277, 05-06.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Philipp Afèche & Mojtaba Araghi & Opher Baron, 2017. "Customer Acquisition, Retention, and Service Access Quality: Optimal Advertising, Capacity Level, and Capacity Allocation," Manufacturing & Service Operations Management, INFORMS, vol. 19(4), pages 674-691, October.
    2. Gui Liberali & Alina Ferecatu, 2022. "Morphing for Consumer Dynamics: Bandits Meet Hidden Markov Models," Marketing Science, INFORMS, vol. 41(4), pages 769-794, July.
    3. Ekinci, Yeliz & Ülengin, Füsun & Uray, Nimet & Ülengin, Burç, 2014. "Analysis of customer lifetime value and marketing expenditure decisions through a Markovian-based model," European Journal of Operational Research, Elsevier, vol. 237(1), pages 278-288.
    4. Mika Sumida & Guillermo Gallego & Paat Rusmevichientong & Huseyin Topaloglu & James Davis, 2021. "Revenue-Utility Tradeoff in Assortment Optimization Under the Multinomial Logit Model with Totally Unimodular Constraints," Management Science, INFORMS, vol. 67(5), pages 2845-2869, May.
    5. Oğuzhan Kivrak & Cüneyt Akar, 2020. "Effect of Social Media Interactions on CLV in Telecommunications," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 19(02), pages 447-468, March.
    6. Andrés Musalem & Yogesh V. Joshi, 2009. "—How Much Should You Invest in Each Customer Relationship? A Competitive Strategic Approach," Marketing Science, INFORMS, vol. 28(3), pages 555-565, 05-06.
    7. Romero, Jaime & van der Lans, Ralf & Wierenga, Berend, 2013. "A Partially Hidden Markov Model of Customer Dynamics for CLV Measurement," Journal of Interactive Marketing, Elsevier, vol. 27(3), pages 185-208.
    8. Violeta Lukic Vujadinovic & Aleksandar Damnjanovic & Aleksandar Cakic & Dragan R. Petkovic & Marijana Prelevic & Vladan Pantovic & Mirjana Stojanovic & Dejan Vidojevic & Djordje Vranjes & Istvan Bodol, 2024. "AI-Driven Approach for Enhancing Sustainability in Urban Public Transportation," Sustainability, MDPI, vol. 16(17), pages 1-18, September.
    9. Shaohui Ma & Joachim Büschken, 2011. "Counting your customers from an “always a share” perspective," Marketing Letters, Springer, vol. 22(3), pages 243-257, September.
    10. James G. Maxham, III & Richard G. Netemeyer & Donald R. Lichtenstein, 2008. "The Retail Value Chain: Linking Employee Perceptions to Employee Performance, Customer Evaluations, and Store Performance," Marketing Science, INFORMS, vol. 27(2), pages 147-167, 03-04.
    11. Sam Aflaki & Ioana Popescu, 2014. "Managing Retention in Service Relationships," Management Science, INFORMS, vol. 60(2), pages 415-433, February.
    12. Piris, Yolande & Gay, Anne-Cécile, 2021. "Customer satisfaction and natural language processing," Journal of Business Research, Elsevier, vol. 124(C), pages 264-271.
    13. Wang, Shengyou & Zhuge, Chengxiang & Shao, Chunfu & Wang, Pinxi & Yang, Xiong & Wang, Shiqi, 2023. "Short-term electric vehicle charging demand prediction: A deep learning approach," Applied Energy, Elsevier, vol. 340(C).
    14. Yeliz Ekinci & Füsun Ulengin & Nimet Uray, 2014. "Using customer lifetime value to plan optimal promotions," The Service Industries Journal, Taylor & Francis Journals, vol. 34(2), pages 103-122, January.
    15. Albert C. Bemmaor & Nicolas Glady, 2012. "Modeling Purchasing Behavior with Sudden "Death": A Flexible Customer Lifetime Model," Management Science, INFORMS, vol. 58(5), pages 1012-1021, May.
    16. Eric T. Anderson & Gavan J. Fitzsimons & Duncan Simester, 2006. "Measuring and Mitigating the Costs of Stockouts," Management Science, INFORMS, vol. 52(11), pages 1751-1763, November.
    17. Junhyun Bae & Li Chen & Shiqing Yao, 2022. "Service Capacity and Price Promotion Wars," Management Science, INFORMS, vol. 68(12), pages 8757-8772, December.
    18. Steven M. Shugan, 2007. "—It's the Findings, Stupid, Not the Assumptions," Marketing Science, INFORMS, vol. 26(4), pages 449-459, 07-08.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:18:p:2797-:d:1474987. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.