IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i10p1630-d812988.html
   My bibliography  Save this article

Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions

Author

Listed:
  • Shengkun Xie

    (Global Management Studies, Ted Rogers School of Management, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada)

  • Rebecca Luo

    (Global Management Studies, Ted Rogers School of Management, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada)

Abstract

Predictive modeling is a critical technique in many real-world applications, including auto insurance rate-making and the decision making of rate filings review for regulation purposes. It is also important in predicting financial and economic risk in business and economics. Unlike testing hypotheses in statistical inference, results obtained from predictive modeling serve as statistical evidence for the decision making of the underlying problem and discovering the functional relationship between the response variable and the predictors. As a result of this, the variable importance measures become an essential aspect of helping to better understand the contributions of predictors to the built model. In this work, we focus on the study of using generalized linear models (GLM) for the size of loss distributions. In addition, we address the problem of measuring the importance of the variables used in the GLM to further evaluate their potential impact on insurance pricing. In this regard, we propose to shift the focus from variable importance measures of factor levels to factors themselves and to develop variable importance measures for factors included in the model. Therefore, this work is exclusively for modeling with categorical variables as predictors. This work contributes to the further development of GLM modeling to make it even more practical due to this added value. This study also aims to provide benchmark estimates to allow for the regulation of insurance rates using GLM from the variable importance aspect.

Suggested Citation

  • Shengkun Xie & Rebecca Luo, 2022. "Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions," Mathematics, MDPI, vol. 10(10), pages 1-19, May.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1630-:d:812988
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/10/1630/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/10/1630/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Mihaela & Jemna Dănuţ-Vasile, 2015. "Modeling the Frequency of Auto Insurance Claims by Means of Poisson and Negative Binomial Models," Scientific Annals of Economics and Business, Sciendo, vol. 62(2), pages 151-168, July.
    2. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    3. Kevin Kuo & Daniel Lupton, 2020. "Towards Explainability of Machine Learning Models in Insurance Pricing," Papers 2003.10674, arXiv.org.
    4. Shengkun Xie, 2021. "Improving Explainability of Major Risk Factors in Artificial Neural Networks for Auto Insurance Rate Regulation," Risks, MDPI, vol. 9(7), pages 1-21, July.
    5. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149, October.
    6. Janssen, Marijn & van der Voort, Haiko & Wahyudi, Agung, 2017. "Factors influencing big data decision-making quality," Journal of Business Research, Elsevier, vol. 70(C), pages 338-345.
    7. Hyonho Chun & Sündüz Keleş, 2010. "Sparse partial least squares regression for simultaneous dimension reduction and variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(1), pages 3-25, January.
    8. Martin Branda, 2014. "Optimization Approaches to Multiplicative Tariff of Rates Estimation in Non-Life Insurance," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 31(05), pages 1-17.
    9. Crevecoeur, Jonas & Antonio, Katrien & Desmedt, Stijn & Masquelein, Alexandre, 2023. "Bridging the gap between pricing and reserving with an occurrence and development model for non-life insurance claims," ASTIN Bulletin, Cambridge University Press, vol. 53(2), pages 185-212, May.
    10. Yanyuan Ma & Liping Zhu, 2013. "A Review on Dimension Reduction," International Statistical Review, International Statistical Institute, vol. 81(1), pages 134-150, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiangbin Zhao & Mengtao Liang & Rongyu Tian & Zaoyan Zhang & Xiangang Cao, 2023. "Reliability Optimization of Hybrid Systems Driven by Constraint Importance Measure Considering Different Cost Functions," Mathematics, MDPI, vol. 11(20), pages 1-21, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shengkun Xie & Anna T. Lawniczak, 2018. "Estimating Major Risk Factor Relativities in Rate Filings Using Generalized Linear Models," IJFS, MDPI, vol. 6(4), pages 1-14, October.
    2. Jiří Valecký, 2016. "Modelling Claim Frequency in Vehicle Insurance," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 64(2), pages 683-689.
    3. Simon CK Lee, 2020. "Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing," Risks, MDPI, vol. 8(1), pages 1-21, February.
    4. Shengkun Xie & Kun Shi, 2023. "Generalised Additive Modelling of Auto Insurance Data with Territory Design: A Rate Regulation Perspective," Mathematics, MDPI, vol. 11(2), pages 1-24, January.
    5. Yang Lu, 2019. "Flexible (panel) regression models for bivariate count–continuous data with an insurance application," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1503-1521, October.
    6. Avanzi, Benjamin & Taylor, Greg & Wong, Bernard & Yang, Xinda, 2021. "On the modelling of multivariate counts with Cox processes and dependent shot noise intensities," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 9-24.
    7. Chenglong Ye & Lin Zhang & Mingxuan Han & Yanjia Yu & Bingxin Zhao & Yuhong Yang, 2022. "Combining Predictions of Auto Insurance Claims," Econometrics, MDPI, vol. 10(2), pages 1-15, April.
    8. Mohammad Ali Yamin, 2021. "Investigating the Drivers of Supply Chain Resilience in the Wake of the COVID-19 Pandemic: Empirical Evidence from an Emerging Economy," Sustainability, MDPI, vol. 13(21), pages 1-16, October.
    9. Julieta Fuentes & Pilar Poncela & Julio Rodríguez, 2015. "Sparse Partial Least Squares in Time Series for Macroeconomic Forecasting," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(4), pages 576-595, June.
    10. Aivars Spilbergs & Andris Fomins & Māris Krastiņš, 2022. "Multivariate Modelling of Motor Third Party Liability Insurance Claims," European Journal of Business Science and Technology, Mendel University in Brno, Faculty of Business and Economics, vol. 8(1), pages 5-18.
    11. Deprez, Laurens & Antonio, Katrien & Boute, Robert, 2021. "Pricing service maintenance contracts using predictive analytics," European Journal of Operational Research, Elsevier, vol. 290(2), pages 530-545.
    12. Martin Branda, 2014. "Optimization Approaches to Multiplicative Tariff of Rates Estimation in Non-Life Insurance," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 31(05), pages 1-17.
    13. Chen, Canyi & Xu, Wangli & Zhu, Liping, 2022. "Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    14. Jeonghwan Kim & Woojoo Lee, 2019. "On testing the hidden heterogeneity in negative binomial regression models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(4), pages 457-470, May.
    15. Jordan Vazquez & Cécile Godé & Jean-Fabrice Lebraty, 2018. "Environnement big data et décision : l'étape de contre la montre du tour de France 2017," Post-Print halshs-02188793, HAL.
    16. Klein, Daniel & Ludwig, Christopher A. & Nicolay, Katharina, 2020. "Internal digitalization and tax-efficient decision making," ZEW Discussion Papers 20-051, ZEW - Leibniz Centre for European Economic Research.
    17. Shamim, Saqib & Zeng, Jing & Khan, Zaheer & Zia, Najam Ul, 2020. "Big data analytics capability and decision making performance in emerging market firms: The role of contractual and relational governance mechanisms," Technological Forecasting and Social Change, Elsevier, vol. 161(C).
    18. Li, Lei & Lin, Jiabao & Ouyang, Ye & Luo, Xin (Robert), 2022. "Evaluating the impact of big data analytics usage on the decision-making quality of organizations," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    19. Tommaso Proietti, 2016. "On the Selection of Common Factors for Macroeconomic Forecasting," Advances in Econometrics, in: Dynamic Factor Models, volume 35, pages 593-628, Emerald Group Publishing Limited.
    20. Hung Hung & Su‐Yun Huang, 2019. "Sufficient dimension reduction via random‐partitions for the large‐p‐small‐n problem," Biometrics, The International Biometric Society, vol. 75(1), pages 245-255, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:10:p:1630-:d:812988. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.