IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i10p1630-d812988.html
   My bibliography  Save this article

Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions

Author

Listed:
  • Shengkun Xie

    (Global Management Studies, Ted Rogers School of Management, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada)

  • Rebecca Luo

    (Global Management Studies, Ted Rogers School of Management, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada)

Abstract

Predictive modeling is a critical technique in many real-world applications, including auto insurance rate-making and the decision making of rate filings review for regulation purposes. It is also important in predicting financial and economic risk in business and economics. Unlike testing hypotheses in statistical inference, results obtained from predictive modeling serve as statistical evidence for the decision making of the underlying problem and discovering the functional relationship between the response variable and the predictors. As a result of this, the variable importance measures become an essential aspect of helping to better understand the contributions of predictors to the built model. In this work, we focus on the study of using generalized linear models (GLM) for the size of loss distributions. In addition, we address the problem of measuring the importance of the variables used in the GLM to further evaluate their potential impact on insurance pricing. In this regard, we propose to shift the focus from variable importance measures of factor levels to factors themselves and to develop variable importance measures for factors included in the model. Therefore, this work is exclusively for modeling with categorical variables as predictors. This work contributes to the further development of GLM modeling to make it even more practical due to this added value. This study also aims to provide benchmark estimates to allow for the regulation of insurance rates using GLM from the variable importance aspect.

Suggested Citation

  • Shengkun Xie & Rebecca Luo, 2022. "Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions," Mathematics, MDPI, vol. 10(10), pages 1-19, May.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1630-:d:812988
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/10/1630/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/10/1630/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Mihaela & Jemna Dănuţ-Vasile, 2015. "Modeling the Frequency of Auto Insurance Claims by Means of Poisson and Negative Binomial Models," Scientific Annals of Economics and Business, Sciendo, vol. 62(2), pages 151-168, July.
    2. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    3. Kevin Kuo & Daniel Lupton, 2020. "Towards Explainability of Machine Learning Models in Insurance Pricing," Papers 2003.10674, arXiv.org.
    4. Shengkun Xie, 2021. "Improving Explainability of Major Risk Factors in Artificial Neural Networks for Auto Insurance Rate Regulation," Risks, MDPI, vol. 9(7), pages 1-21, July.
    5. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149, November.
    6. Janssen, Marijn & van der Voort, Haiko & Wahyudi, Agung, 2017. "Factors influencing big data decision-making quality," Journal of Business Research, Elsevier, vol. 70(C), pages 338-345.
    7. Hyonho Chun & Sündüz Keleş, 2010. "Sparse partial least squares regression for simultaneous dimension reduction and variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(1), pages 3-25, January.
    8. Martin Branda, 2014. "Optimization Approaches to Multiplicative Tariff of Rates Estimation in Non-Life Insurance," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 31(05), pages 1-17.
    9. Crevecoeur, Jonas & Antonio, Katrien & Desmedt, Stijn & Masquelein, Alexandre, 2023. "Bridging the gap between pricing and reserving with an occurrence and development model for non-life insurance claims," ASTIN Bulletin, Cambridge University Press, vol. 53(2), pages 185-212, May.
    10. Yanyuan Ma & Liping Zhu, 2013. "A Review on Dimension Reduction," International Statistical Review, International Statistical Institute, vol. 81(1), pages 134-150, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiangbin Zhao & Mengtao Liang & Rongyu Tian & Zaoyan Zhang & Xiangang Cao, 2023. "Reliability Optimization of Hybrid Systems Driven by Constraint Importance Measure Considering Different Cost Functions," Mathematics, MDPI, vol. 11(20), pages 1-21, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiří Valecký, 2016. "Modelling Claim Frequency in Vehicle Insurance," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 64(2), pages 683-689.
    2. Simon CK Lee, 2020. "Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing," Risks, MDPI, vol. 8(1), pages 1-21, February.
    3. Shengkun Xie & Anna T. Lawniczak, 2018. "Estimating Major Risk Factor Relativities in Rate Filings Using Generalized Linear Models," IJFS, MDPI, vol. 6(4), pages 1-14, October.
    4. Shengkun Xie & Kun Shi, 2023. "Generalised Additive Modelling of Auto Insurance Data with Territory Design: A Rate Regulation Perspective," Mathematics, MDPI, vol. 11(2), pages 1-24, January.
    5. Chenglong Ye & Lin Zhang & Mingxuan Han & Yanjia Yu & Bingxin Zhao & Yuhong Yang, 2022. "Combining Predictions of Auto Insurance Claims," Econometrics, MDPI, vol. 10(2), pages 1-15, April.
    6. Mohammad Ali Yamin, 2021. "Investigating the Drivers of Supply Chain Resilience in the Wake of the COVID-19 Pandemic: Empirical Evidence from an Emerging Economy," Sustainability, MDPI, vol. 13(21), pages 1-16, October.
    7. Klein, Daniel & Ludwig, Christopher A. & Nicolay, Katharina, 2020. "Internal digitalization and tax-efficient decision making," ZEW Discussion Papers 20-051, ZEW - Leibniz Centre for European Economic Research.
    8. Hung Hung & Su‐Yun Huang, 2019. "Sufficient dimension reduction via random‐partitions for the large‐p‐small‐n problem," Biometrics, The International Biometric Society, vol. 75(1), pages 245-255, March.
    9. Qimeng Pan & Lysa Porth & Hong Li, 2022. "Assessing the Effectiveness of the Actuaries Climate Index for Estimating the Impact of Extreme Weather on Crop Yield and Insurance Applications," Sustainability, MDPI, vol. 14(11), pages 1-24, June.
    10. Jiří Valecký, . "Calculation of Solvency Capital Requirements for Non-life Underwriting Risk Using Generalized Linear Models," Prague Economic Papers, University of Economics, Prague, vol. 0, pages 1-17.
    11. Pinho, Luis Gustavo B. & Nobre, Juvêncio S. & Singer, Julio M., 2015. "Cook’s distance for generalized linear mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 126-136.
    12. Adriana Dima & Elena Radu & Ecaterina Milica Dobrota & Adrian Otoiu & Alina Florentina Saracu, 2023. "Sustainable Development of E-commerce in the Post-COVID Times: A Mixed-Methods Analysis of Pestle Factors," The AMFITEATRU ECONOMIC journal, Academy of Economic Studies - Bucharest, Romania, vol. 25(S17), pages 1095-1095, November.
    13. Qiang Sun & Hongtu Zhu & Yufeng Liu & Joseph G. Ibrahim, 2015. "SPReM: Sparse Projection Regression Model For High-Dimensional Linear Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 289-302, March.
    14. Jingmei Gao & Zahid Sarwar, 2024. "How do firms create business value and dynamic capabilities by leveraging big data analytics management capability?," Information Technology and Management, Springer, vol. 25(3), pages 283-304, September.
    15. Cheng, Qing & Zhu, Liping, 2017. "On relative efficiency of principal Hessian directions," Statistics & Probability Letters, Elsevier, vol. 126(C), pages 108-113.
    16. Šoltés Erik & Zelinová Silvia & Bilíková Mária, 2019. "General Linear Model: An Effective Tool For Analysis Of Claim Severity In Motor Third Party Liability Insurance," Statistics in Transition New Series, Statistics Poland, vol. 20(4), pages 13-31, December.
    17. Lee Woojoo & Lee Donghwan & Lee Youngjo & Pawitan Yudi, 2011. "Sparse Canonical Covariance Analysis for High-throughput Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-24, July.
    18. Yuqing Zhang & Neil Walton, 2019. "Adaptive Pricing in Insurance: Generalized Linear Models and Gaussian Process Regression Approaches," Papers 1907.05381, arXiv.org.
    19. Yu, Dengdeng & Zhang, Li & Mizera, Ivan & Jiang, Bei & Kong, Linglong, 2019. "Sparse wavelet estimation in quantile regression with multiple functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 12-29.
    20. Yagli, Gokhan Mert & Yang, Dazhi & Srinivasan, Dipti, 2019. "Automatic hourly solar forecasting using machine learning models," Renewable and Sustainable Energy Reviews, Elsevier, vol. 105(C), pages 487-498.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1630-:d:812988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.