IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/118350.html
   My bibliography  Save this paper

Statistical inference for noisy incomplete binary matrix

Author

Listed:
  • Chen, Yunxiao
  • Li, Chengcheng
  • Ouyang, Jing
  • Xu, Gongjun

Abstract

We consider the statistical inference for noisy incomplete binary (or 1-bit) matrix. Despite the importance of uncertainty quantification to matrix completion, most of the categorical matrix completion literature focuses on point estimation and prediction. This paper moves one step further toward statistical inference for binary matrix completion. Under a popular nonlinear factor analysis model, we obtain a point estimator and derive its asymptotic normality. Moreover, our analysis adopts a flexible missing-entry design that does not require a random sampling scheme as required by most of the existing asymptotic results for matrix completion. Under reasonable conditions, the proposed estimator is statistically efficient and optimal in the sense that the Cramer-Rao lower bound is achieved asymptotically for the model parameters. Two applications are considered, including (1) linking two forms of an educational test and (2) linking the roll call voting records from multiple years in the United States Senate. The first application enables the comparison between examinees who took different test forms, and the second application allows us to compare the liberal-conservativeness of senators who did not serve in the Senate at the same time.

Suggested Citation

  • Chen, Yunxiao & Li, Chengcheng & Ouyang, Jing & Xu, Gongjun, 2023. "Statistical inference for noisy incomplete binary matrix," LSE Research Online Documents on Economics 118350, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:118350
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/118350/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. He, Xuming & Shao, Qi-Man, 2000. "On Parameters of Increasing Dimensions," Journal of Multivariate Analysis, Elsevier, vol. 73(1), pages 120-135, April.
    2. Y Chen & X Li, 2022. "Determining the number of factors in high-dimensional generalized latent factor models [Eigenvalue ratio test for the number of factors]," Biometrika, Biometrika Trust, vol. 109(3), pages 769-782.
    3. Kidwell, Paul & Lebanon, Guy & Collins-Thompson, Kevyn, 2011. "Statistical Estimation of Word Acquisition With Application to Readability Prediction," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 21-30.
    4. Ghosh, Malay, 1995. "Inconsistent maximum likelihood estimators for the Rasch model," Statistics & Probability Letters, Elsevier, vol. 23(2), pages 165-170, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2019. "Valid Post-Selection Inference in High-Dimensional Approximately Sparse Quantile Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 749-758, April.
    2. Kato, Kengo & F. Galvao, Antonio & Montes-Rojas, Gabriel V., 2012. "Asymptotics for panel quantile regression models with individual effects," Journal of Econometrics, Elsevier, vol. 170(1), pages 76-91.
    3. Adam C. Sales & Ben B. Hansen, 2020. "Limitless Regression Discontinuity," Journal of Educational and Behavioral Statistics, , vol. 45(2), pages 143-174, April.
    4. Ji-Yeon Yang & Xuming He, 2011. "A Multistep Protein Lysate Array Quantification Method and its Statistical Properties," Biometrics, The International Biometric Society, vol. 67(4), pages 1197-1205, December.
    5. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2013. "Uniform post selection inference for LAD regression and other z-estimation problems," CeMMAP working papers CWP74/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Arun G. Chandrasekhar & Victor Chernozhukov & Francesca Molinari & Paul Schrimpf, 2019. "Best Linear Approximations to Set Identified Functions: With an Application to the Gender Wage Gap," NBER Working Papers 25593, National Bureau of Economic Research, Inc.
    7. Demian Pouzo, 2014. "Bootstrap Consistency for Quadratic Forms of Sample Averages with Increasing Dimension," Papers 1411.2701, arXiv.org, revised Aug 2015.
    8. Fan, Jianqing & Guo, Yongyi & Jiang, Bai, 2022. "Adaptive Huber regression on Markov-dependent data," Stochastic Processes and their Applications, Elsevier, vol. 150(C), pages 802-818.
    9. V. Chernozhukov & I. Fernández-Val & A. Galichon, 2009. "Improving point and interval estimators of monotone functions by rearrangement," Biometrika, Biometrika Trust, vol. 96(3), pages 559-575.
    10. Han, Dongxiao & Huang, Jian & Lin, Yuanyuan & Shen, Guohao, 2022. "Robust post-selection inference of high-dimensional mean regression with heavy-tailed asymmetric or heteroskedastic errors," Journal of Econometrics, Elsevier, vol. 230(2), pages 416-431.
    11. Giessing, Alexander & He, Xuming, 2019. "On the predictive risk in misspecified quantile regression," Journal of Econometrics, Elsevier, vol. 213(1), pages 235-260.
    12. Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
    13. Yanqin Fan & Fang Han & Wei Li & Xiao-Hua Zhou, 2019. "On rank estimators in increasing dimensions," Papers 1908.05255, arXiv.org.
    14. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2013. "Robust inference in high-dimensional approximately sparse quantile regression models," CeMMAP working papers 70/13, Institute for Fiscal Studies.
    15. repec:spo:wpmain:info:hdl:2441/5rkqqmvrn4tl22s9mc4b6ga2g is not listed on IDEAS
    16. Lin, Fangzheng & Tang, Yanlin & Zhu, Zhongyi, 2020. "Weighted quantile regression in varying-coefficient model with longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    17. Chen, Yunxiao & Li, Xiaoou & Zhang, Siliang, 2019. "Structured latent factor analysis for large-scale data: identifiability, estimability, and their implications," LSE Research Online Documents on Economics 101122, London School of Economics and Political Science, LSE Library.
    18. Victor Chernozhukov & Iv·n Fern·ndez-Val & Alfred Galichon, 2010. "Quantile and Probability Curves Without Crossing," Econometrica, Econometric Society, vol. 78(3), pages 1093-1125, May.
    19. Wang, Chuchu & Song, Xinyuan, 2024. "Nonparametric quantile scalar-on-image regression," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    20. Kean Ming Tan & Lan Wang & Wen‐Xin Zhou, 2022. "High‐dimensional quantile regression: Convolution smoothing and concave regularization," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 205-233, February.
    21. Li, Rui & Reich, Brian J. & Bondell, Howard D., 2021. "Deep distribution regression," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).

    More about this item

    Keywords

    1-bit matrix; matrix completion; binary data; asymptotic normality; non-linear latent variable model;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:118350. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.