IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v36y2021i2d10.1007_s00180-020-01052-5.html
   My bibliography  Save this article

Outlier detection under a covariate-adjusted exponential regression model with censored data

Author

Listed:
  • Yingli Pan

    (Hubei University)

  • Zhan Liu

    (Hubei University)

  • Guangyu Song

    (Hubei University)

Abstract

Exponential regression models with censored data are most widely used in practice. In the modeling process, there exist situations where the covariates are not directly observed but are observed after being contaminated by unknown functions of an observable confounder in a multiplicative manner. The problem of outlier detection is a fundamental and important problem in applied statistics. In this paper, we use a nonparametric regression method to adjust the covariates and recast the outlier detection issue into a high-dimensional regularization regression issue in the covariate-adjusted exponential regression model with censored data. We propose a smoothly clipped absolute deviation (SCAD) penalized likelihood method to detect the possible outliers, which features that the proposed method can simultaneously deal with outlier detection and estimations for the regression coefficients. The coordinate descent algorithm is employed to facilitate computation. Simulation studies are conducted to evaluate the finite-sample performance of our proposed method. An application to a German breast cancer study demonstrates the utility of the proposed method in practice.

Suggested Citation

  • Yingli Pan & Zhan Liu & Guangyu Song, 2021. "Outlier detection under a covariate-adjusted exponential regression model with censored data," Computational Statistics, Springer, vol. 36(2), pages 961-976, June.
  • Handle: RePEc:spr:compst:v:36:y:2021:i:2:d:10.1007_s00180-020-01052-5
    DOI: 10.1007/s00180-020-01052-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-020-01052-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-020-01052-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    2. Zhao, Jingxin & Xie, Chuanlong, 2018. "A nonparametric test for covariate-adjusted models," Statistics & Probability Letters, Elsevier, vol. 133(C), pages 65-70.
    3. Senturk, Damla & Nguyen, Danh V., 2006. "Estimation in covariate-adjusted regression," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 3294-3310, July.
    4. Vanda Inácio de Carvalho & Miguel de Carvalho & Adam J. Branscum, 2017. "Nonparametric Bayesian covariate‐adjusted estimation of the Youden index," Biometrics, The International Biometric Society, vol. 73(4), pages 1279-1288, December.
    5. Zhang, Jun & Zhu, Li-Xing & Liang, Hua, 2012. "Nonlinear models with measurement errors subject to single-indexed distortion," Journal of Multivariate Analysis, Elsevier, vol. 112(C), pages 1-23.
    6. She, Yiyuan & Owen, Art B., 2011. "Outlier Detection Using Nonconvex Penalized Regression," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 626-639.
    7. Sung-Soo Kim & W. Krzanowski, 2007. "Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization," Computational Statistics, Springer, vol. 22(1), pages 109-119, April.
    8. Pallavi Sawant & Nedret Billor & Hyejin Shin, 2012. "Functional outlier detection with robust functional principal component analysis," Computational Statistics, Springer, vol. 27(1), pages 83-102, March.
    9. Li-Chu Chien, 2013. "Multiple deletion diagnostics in beta regression models," Computational Statistics, Springer, vol. 28(4), pages 1639-1661, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jun Zhang & Junpeng Zhu & Yan Zhou & Xia Cui & Tao Lu, 2020. "Multiplicative regression models with distortion measurement errors," Statistical Papers, Springer, vol. 61(5), pages 2031-2057, October.
    2. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    3. Z. John Daye & Jinbo Chen & Hongzhe Li, 2012. "High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis," Biometrics, The International Biometric Society, vol. 68(1), pages 316-326, March.
    4. Luca Insolia & Ana Kenney & Martina Calovi & Francesca Chiaromonte, 2021. "Robust Variable Selection with Optimality Guarantees for High-Dimensional Logistic Regression," Stats, MDPI, vol. 4(3), pages 1-17, August.
    5. Alexander Robitzsch, 2022. "Comparing the Robustness of the Structural after Measurement (SAM) Approach to Structural Equation Modeling (SEM) against Local Model Misspecifications with Alternative Estimation Approaches," Stats, MDPI, vol. 5(3), pages 1-42, July.
    6. Thompson, Ryan, 2022. "Robust subset selection," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    7. Zhang, Jing & Wang, Qin & Mays, D'Arcy, 2021. "Robust MAVE through nonconvex penalized regression," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    8. Xie, Chuanlong & Zhu, Lixing, 2019. "A goodness-of-fit test for variable-adjusted models," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 27-48.
    9. She, Yiyuan, 2012. "An iterative algorithm for fitting nonconvex penalized generalized linear models with grouped predictors," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2976-2990.
    10. Jun Zhang & Yiping Yang & Gaorong Li, 2020. "Logarithmic calibration for multiplicative distortion measurement errors regression models," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 74(4), pages 462-488, November.
    11. Yang Peng & Bin Luo & Xiaoli Gao, 2022. "Robust Moderately Clipped LASSO for Simultaneous Outlier Detection and Variable Selection," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(2), pages 694-707, November.
    12. Luca Insolia & Ana Kenney & Francesca Chiaromonte & Giovanni Felici, 2022. "Simultaneous feature selection and outlier detection with optimality guarantees," Biometrics, The International Biometric Society, vol. 78(4), pages 1592-1603, December.
    13. Tianxiang Liu & Ting Kei Pong & Akiko Takeda, 2019. "A refined convergence analysis of $$\hbox {pDCA}_{e}$$ pDCA e with applications to simultaneous sparse recovery and outlier detection," Computational Optimization and Applications, Springer, vol. 73(1), pages 69-100, May.
    14. Hui Jiang & Tianyu Zhan, 2017. "Unit-Free and Robust Detection of Differential Expression from RNA-Seq Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 178-199, June.
    15. Dai, Shuang & Huang, Zhensheng, 2020. "Nonparametric inference for covariate-adjusted model," Statistics & Probability Letters, Elsevier, vol. 162(C).
    16. Mingyang Ren & Sanguo Zhang & Qingzhao Zhang, 2021. "Robust high-dimensional regression for data with anomalous responses," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(4), pages 703-736, August.
    17. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    18. Guan, Wei & Gray, Alexander, 2013. "Sparse high-dimensional fractional-norm support vector machine via DC programming," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 136-148.
    19. Margherita Giuzio, 2017. "Genetic algorithm versus classical methods in sparse index tracking," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 40(1), pages 243-256, November.
    20. Chang, Jinyuan & Chen, Song Xi & Chen, Xiaohong, 2015. "High dimensional generalized empirical likelihood for moment restrictions with dependent data," Journal of Econometrics, Elsevier, vol. 185(1), pages 283-304.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:36:y:2021:i:2:d:10.1007_s00180-020-01052-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.