IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i3p423-d1578467.html
   My bibliography  Save this article

Fitting Penalized Estimator for Sparse Covariance Matrix with Left-Censored Data by the EM Algorithm

Author

Listed:
  • Shanyi Lin

    (School of Mathematics and Statistics, Changchun University of Technology, Changchun 130012, China)

  • Qian-Zhen Zheng

    (College of Education, Zhejiang Normal University, Jinhua 321004, China)

  • Laixu Shang

    (College of Education, Zhejiang Normal University, Jinhua 321004, China)

  • Ping-Feng Xu

    (Academy for Advanced Interdisciplinary Studies & Key Laboratory of Applied Statistics of MOE, Northeast Normal University, Changchun 130024, China
    Shanghai Zhangjiang Institute of Mathematics, Shanghai 201203, China)

  • Man-Lai Tang

    (Department of Physics, Astronomy and Mathematics, School of Physics, Engineering & Computer Science, University of Hertfordshire, Hertfordshire AL10 9AB, UK)

Abstract

Estimating the sparse covariance matrix can effectively identify important features and patterns, and traditional estimation methods require complete data vectors on all subjects. When data are left-censored due to detection limits, common strategies such as excluding censored individuals or replacing censored values with suitable constants may result in large biases. In this paper, we propose two penalized log-likelihood estimators, incorporating the L 1 penalty and SCAD penalty, for estimating the sparse covariance matrix of a multivariate normal distribution in the presence of left-censored data. However, the fitting of these penalized estimators poses challenges due to the observed log-likelihood involving high-dimensional integration over the censored variables. To address this issue, we treat censored data as a special case of incomplete data and employ the Expectation Maximization algorithm combined with the coordinate descent algorithm to efficiently fit the two penalized estimators. Through simulation studies, we demonstrate that both penalized estimators achieve greater estimation accuracy compared to methods that replace censored values with constants. Moreover, the SCAD penalized estimator generally outperforms the L 1 penalized estimator. Our method is used to analyze the proteomic datasets.

Suggested Citation

  • Shanyi Lin & Qian-Zhen Zheng & Laixu Shang & Ping-Feng Xu & Man-Lai Tang, 2025. "Fitting Penalized Estimator for Sparse Covariance Matrix with Left-Censored Data by the EM Algorithm," Mathematics, MDPI, vol. 13(3), pages 1-17, January.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:3:p:423-:d:1578467
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/3/423/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/3/423/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:3:p:423-:d:1578467. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.