IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v192y2024ics0167947323002219.html
   My bibliography  Save this article

Subgroup detection based on partially linear additive individualized model with missing data in response

Author

Listed:
  • Cai, Tingting
  • Li, Jianbo
  • Zhou, Qin
  • Yin, Songlou
  • Zhang, Riquan

Abstract

Based on partially linear additive individualized model, a fusion-penalized inverse probability weighted least squares method is proposed to detect the subgroup for missing data in response. Firstly, the B-spline technique is used to approximate the unknown additive individualized functions and then an inverse probability weighted quadratic loss function is established with fusion penalty on the difference of subject-wise B-spline coefficients. Secondly, minimization of such quadratic loss function leads to the estimation of linear regression parameters and individualized B spline coefficients. With a proper tuning parameter, some differences in penalty term are shrunk into zero and thus the corresponding subjects will be clustered into the same subgroup. Thirdly, a clustering method is developed to automatically determine the subgroup membership for the subjects with missing data. Fourthly, large sample properties of resulting estimates are given under some regular conditions. Finally, numerical studies are presented to illustrate the performance of the proposed subgroup detection method.

Suggested Citation

  • Cai, Tingting & Li, Jianbo & Zhou, Qin & Yin, Songlou & Zhang, Riquan, 2024. "Subgroup detection based on partially linear additive individualized model with missing data in response," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
  • Handle: RePEc:eee:csdana:v:192:y:2024:i:c:s0167947323002219
    DOI: 10.1016/j.csda.2023.107910
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947323002219
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107910?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    2. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    3. Susan Wei & Michael R. Kosorok, 2013. "Latent Supervised Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 957-970, September.
    4. Wooldridge, Jeffrey M., 2007. "Inverse probability weighted estimation for general missing data problems," Journal of Econometrics, Elsevier, vol. 141(2), pages 1281-1301, December.
    5. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    6. Liu, Lili & Lin, Lu, 2019. "Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 239-259.
    7. Lihui Zhao & Lu Tian & Tianxi Cai & Brian Claggett & L. J. Wei, 2013. "Effectively Selecting a Target Population for a Future Comparative Study," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 527-539, June.
    8. Shujie Ma & Jian Huang, 2017. "A Concave Pairwise Fusion Approach to Subgroup Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 410-423, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shao, Lihui & Wu, Jiaqi & Zhang, Weiping & Chen, Yu, 2024. "Integrated subgroup identification from multi-source data," Computational Statistics & Data Analysis, Elsevier, vol. 193(C).
    2. Baosheng Liang & Peng Wu & Xingwei Tong & Yanping Qiu, 2020. "Regression and subgroup detection for heterogeneous samples," Computational Statistics, Springer, vol. 35(4), pages 1853-1878, December.
    3. Zhang, Xiaochen & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2022. "Subgroup analysis for high-dimensional functional regression," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    4. Liu, Lili & Lin, Lu, 2019. "Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 239-259.
    5. Wang, Xin & Zhu, Zhengyuan & Zhang, Hao Helen, 2023. "Spatial heterogeneity automatic detection and estimation," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    6. Shuang Zhang & Xingdong Feng, 2022. "Distributed identification of heterogeneous treatment effects," Computational Statistics, Springer, vol. 37(1), pages 57-89, March.
    7. Fang, Kuangnan & Chen, Yuanxing & Ma, Shuangge & Zhang, Qingzhao, 2022. "Biclustering analysis of functionals via penalized fusion," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    8. Weirong Li & Wensheng Zhu, 2024. "Subgroup analysis with concave pairwise fusion penalty for ordinal response," Statistical Papers, Springer, vol. 65(6), pages 3327-3355, August.
    9. Lu, Wenqi & Qin, Guoyou & Zhu, Zhongyi & Tu, Dongsheng, 2021. "Multiply robust subgroup identification for longitudinal data with dropouts via median regression," Journal of Multivariate Analysis, Elsevier, vol. 181(C).
    10. Xu Gao & Weining Shen & Jing Ning & Ziding Feng & Jianhua Hu, 2022. "Addressing patient heterogeneity in disease predictive model development," Biometrics, The International Biometric Society, vol. 78(3), pages 1045-1055, September.
    11. Huicong Yu & Jiaqi Wu & Weiping Zhang, 2024. "Simultaneous subgroup identification and variable selection for high dimensional data," Computational Statistics, Springer, vol. 39(6), pages 3181-3205, September.
    12. Mehrabani, Ali, 2023. "Estimation and identification of latent group structures in panel data," Journal of Econometrics, Elsevier, vol. 235(2), pages 1464-1482.
    13. Pei, Youquan & Peng, Heng & Xu, Jinfeng, 2024. "A latent class Cox model for heterogeneous time-to-event data," Journal of Econometrics, Elsevier, vol. 239(2).
    14. Jun Zhu & Hsin‐Cheng Huang & Perla E. Reyes, 2010. "On selection of spatial linear models for lattice data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(3), pages 389-402, June.
    15. Ye, Mao & Lu, Zhao-Hua & Li, Yimei & Song, Xinyuan, 2019. "Finite mixture of varying coefficient model: Estimation and component selection," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 452-474.
    16. Xia Chen & Liyue Mao, 2020. "Penalized empirical likelihood for partially linear errors-in-variables models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 597-623, December.
    17. Wentian Guo & Yuan Ji & Daniel V. T. Catenacci, 2017. "A subgroup cluster-based Bayesian adaptive design for precision medicine," Biometrics, The International Biometric Society, vol. 73(2), pages 367-377, June.
    18. Tizheng Li & Xiaojuan Kang, 2022. "Variable selection of higher-order partially linear spatial autoregressive model with a diverging number of parameters," Statistical Papers, Springer, vol. 63(1), pages 243-285, February.
    19. Fei Jin & Lung-fei Lee, 2018. "Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices," Econometrics, MDPI, vol. 6(1), pages 1-24, February.
    20. Xinyang Wang & Dehui Wang & Kai Yang, 2021. "Integer-valued time series model order shrinkage and selection via penalized quasi-likelihood approach," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(5), pages 713-750, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:192:y:2024:i:c:s0167947323002219. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.