IDEAS home Printed from https://ideas.repec.org/a/spr/alstar/v108y2024i2d10.1007_s10182-024-00502-5.html
   My bibliography  Save this article

Deducing neighborhoods of classes from a fitted model

Author

Listed:
  • Alexander Gerharz

    (TU Dortmund University)

  • Andreas Groll

    (TU Dortmund University)

  • Gunther Schauberger

    (TUM School of Medicine and Health, Chair of Epidemiology)

Abstract

In this article, a new kind of interpretable machine learning method is presented, which can help to understand the partition of the feature space into predicted classes in a classification model using quantile shifts, and this way make the underlying statistical or machine learning model more trustworthy. Basically, real data points (or specific points of interest) are used and the changes of the prediction after slightly raising or decreasing specific features are observed. By comparing the predictions before and after the shifts, under certain conditions the observed changes in the predictions can be interpreted as neighborhoods of the classes with regard to the shifted features. Chord diagrams are used to visualize the observed changes. For illustration, this quantile shift method (QSM) is applied to an artificial example with medical labels and a real data example.

Suggested Citation

  • Alexander Gerharz & Andreas Groll & Gunther Schauberger, 2024. "Deducing neighborhoods of classes from a fitted model," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 108(2), pages 395-425, June.
  • Handle: RePEc:spr:alstar:v:108:y:2024:i:2:d:10.1007_s10182-024-00502-5
    DOI: 10.1007/s10182-024-00502-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10182-024-00502-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10182-024-00502-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fox, John & Hong, Jangman, 2009. "Effect Displays in R for Multinomial and Proportional-Odds Logit Models: Extensions to the effects Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i01).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    2. Ulrich Matter & Alois Stutzer, 2019. "Does Public Attention Reduce The Influence Of Moneyed Interests? Policy Positions On Sopa/Pipa Before And After The Internet Blackout," Economic Inquiry, Western Economic Association International, vol. 57(4), pages 1879-1895, October.
    3. Erdogan, Murside Rabia & Camgoz, Selin Metin & Karan, Mehmet Baha & Berument, M. Hakan, 2022. "The switching behavior of large-scale electricity consumers in The Turkish electricity retail market," Energy Policy, Elsevier, vol. 160(C).
    4. Leonardo Salvatore Alaimo & Mariantonietta Fiore & Antonino Galati, 2020. "How the Covid-19 Pandemic Is Changing Online Food Shopping Human Behaviour in Italy," Sustainability, MDPI, vol. 12(22), pages 1-18, November.
    5. Lenth, Russell V., 2016. "Least-Squares Means: The R Package lsmeans," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(i01).
    6. Ratna K. Shrestha & Raunak Shrestha & Sara Shneiderman & Jeevan Baniya, 2023. "Beyond Reconstruction: What Leads to Satisfaction in Post-Disaster Recovery?," Journal of Happiness Studies, Springer, vol. 24(4), pages 1367-1395, April.
    7. repec:jss:jstsof:37:i04 is not listed on IDEAS
    8. Tomáš Formánek & Radek Tahal, 2020. "Socio-Demographic Aspects Affecting Individual Stances towards Electric and Hybrid Vehicles in the Czech Republic," Central European Business Review, Prague University of Economics and Business, vol. 2020(2), pages 78-93.
    9. Pilhöfer, Alexander & Unwin, Antony, 2013. "New Approaches in Visualization of Categorical Data: R Package extracat," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i07).
    10. Dilyara Ibragimova, 2013. "Money management in russian families," HSE Working papers WP BRP 11/SOC/2013, National Research University Higher School of Economics.
    11. Jamie C. Moore & Gabriele B. Durrant & Peter W. F. Smith, 2021. "Do coefficients of variation of response propensities approximate non‐response biases during survey data collection?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 301-323, January.
    12. Minghui Yin & Balekouzou Augustin & Chang Shu & Tingting Qin & Ping Yin, 2016. "Probit Models to Investigate Prevalence of Total Diagnosed and Undiagnosed Diabetes among Aged 45 Years or Older Adults in China," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-13, October.
    13. Formánek Tomáš & Tahal Radek, 2017. "Socio-demographic and lifestyle determinants of loyalty program participation in the Czech Republic," Management & Marketing, Sciendo, vol. 12(4), pages 524-539, December.
    14. Hong, Jinhyun, 2016. "How does the seasonality influence utilitarian walking behaviour in different urbanization settings in Scotland?," Social Science & Medicine, Elsevier, vol. 162(C), pages 143-150.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:alstar:v:108:y:2024:i:2:d:10.1007_s10182-024-00502-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.