A weak‐signal‐assisted procedure for variable selection and statistical inference with an informative subsample

My bibliography Save this article

A weak‐signal‐assisted procedure for variable selection and statistical inference with an informative subsample

Author

Listed:

Fang Fang
Jiwei Zhao
S. Ejaz Ahmed
Annie Qu

Registered:

Abstract

This paper is motivated from an HIV‐1 drug resistance study where we encounter three analytical challenges: to analyze data with an informative subsample, to take into account the weak signals, and to detect important signals and also conduct statistical inference. We start with an initial estimation method, which adopts a penalized pairwise conditional likelihood approach for variable selection. This initial estimator incorporates the informative subsample issue. To accounting for the effect of weak signals, we use a key idea of partial ridge regression. We also propose a one‐step estimation method for each of the signal coefficients and then construct confidence intervals accordingly. We apply the proposed method to the Stanford HIV‐1 drug resistance study and compare the results with existing approaches. We also conduct comprehensive simulation studies to demonstrate the superior performance of our proposed method.

Suggested Citation

Fang Fang & Jiwei Zhao & S. Ejaz Ahmed & Annie Qu, 2021. "A weak‐signal‐assisted procedure for variable selection and statistical inference with an informative subsample," Biometrics, The International Biometric Society, vol. 77(3), pages 996-1010, September.

Handle: RePEc:bla:biomet:v:77:y:2021:i:3:p:996-1010
DOI: 10.1111/biom.13346

Download full text from publisher

References listed on IDEAS

Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
Xiaoli Gao & S. E. Ahmed & Yang Feng, 2017. "Post selection shrinkage estimation for high‐dimensional data analysis," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 33(2), pages 97-120, March.
Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
Kung‐Yee Liang & Jing Qin, 2000. "Regression analysis under non‐standard situations: a pairwise pseudolikelihood approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 773-786.
Cun-Hui Zhang & Stephanie S. Zhang, 2014. "Confidence intervals for low dimensional parameters in high dimensional linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 217-242, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Liu, Yu & Zhuang, Xiaoyang, 2023. "Shrinkage estimation of semi-parametric spatial autoregressive panel data model with fixed effects," Statistics & Probability Letters, Elsevier, vol. 194(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
Haixiang Zhang & Jian Huang & Liuquan Sun, 2022. "Projection‐based and cross‐validated estimation in high‐dimensional Cox model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(1), pages 353-372, March.
Chenchuan (Mark) Li & Ulrich K. Müller, 2021. "Linear regression with many controls of limited explanatory power," Quantitative Economics, Econometric Society, vol. 12(2), pages 405-442, May.
Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
- Christian Hansen & Yuan Liao, 2016. "The Factor-Lasso and K-Step Bootstrap Approach for Inference in High-Dimensional Economic Applications," Departmental Working Papers 201610, Rutgers University, Department of Economics.
- Christian Hansen & Yuan Liao, 2016. "The Factor-Lasso and K-Step Bootstrap Approach for Inference in High-Dimensional Economic Applications," Papers 1611.09420, arXiv.org, revised Dec 2016.
- Hansen, Christian & Liao, Yuan, 2016. "The Factor-Lasso and K-Step Bootstrap Approach for Inference in High-Dimensional Economic Applications," MPRA Paper 75313, University Library of Munich, Germany.
Maur,Jean-Christophe & Nedeljkovic,Milan & Von Uexkull,Jan Erik, 2022. "FDI and Trade Outcomes at the Industry Level—A Data-Driven Approach," Policy Research Working Paper Series 9901, The World Bank.
Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Annual Review of Economics, Annual Reviews, vol. 7(1), pages 649-688, August.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Papers 1501.03430, arXiv.org, revised Aug 2015.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers CWP36/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers 36/16, Institute for Fiscal Studies.
Lu Tang & Peter X.‐K. Song, 2021. "Poststratification fusion learning in longitudinal data analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 914-928, September.
Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments," American Economic Review, American Economic Association, vol. 105(5), pages 486-490, May.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-selection and post-regularization inference in linear models with many controls and instruments," CeMMAP working papers 02/15, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-selection and post-regularization inference in linear models with many controls and instruments," CeMMAP working papers CWP02/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments," Papers 1501.03185, arXiv.org.
Guo, Xu & Li, Runze & Liu, Jingyuan & Zeng, Mudong, 2023. "Statistical inference for linear mediation models with high-dimensional mediators and application to studying stock reaction to COVID-19 pandemic," Journal of Econometrics, Elsevier, vol. 235(1), pages 166-179.
Toshio Honda, 2021. "The de-biased group Lasso estimation for varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(1), pages 3-29, February.
Lu Xia & Bin Nan & Yi Li, 2023. "Debiased lasso for generalized linear models with a diverging number of covariates," Biometrics, The International Biometric Society, vol. 79(1), pages 344-357, March.
Huang, Yuan & Li, Changcheng & Li, Runze & Yang, Songshan, 2022. "An overview of tests on high-dimensional means," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
Susan M. Shortreed & Ashkan Ertefaie, 2017. "Outcome‐adaptive lasso: Variable selection for causal inference," Biometrics, The International Biometric Society, vol. 73(4), pages 1111-1122, December.
Gueuning, Thomas & Claeskens, Gerda, 2016. "Confidence intervals for high-dimensional partially linear single-index models," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 13-29.
Jingxuan Luo & Lili Yue & Gaorong Li, 2023. "Overview of High-Dimensional Measurement Error Regression Models," Mathematics, MDPI, vol. 11(14), pages 1-22, July.
Wang, Yining & Wang, Jialei & Balakrishnan, Sivaraman & Singh, Aarti, 2019. "Rate optimal estimation and confidence intervals for high-dimensional regression with missing covariates," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
- Max H. Farrell, 2013. "Robust Inference on Average Treatment Effects with Possibly More Covariates than Observations," Papers 1309.4686, arXiv.org, revised Feb 2018.
Donggyu Kim & Minseok Shin, 2024. "Robust High-Dimensional Time-Varying Coefficient Estimation," Working Papers 202417, University of California at Riverside, Department of Economics.
Lan, Wei & Zhong, Ping-Shou & Li, Runze & Wang, Hansheng & Tsai, Chih-Ling, 2016. "Testing a single regression coefficient in high dimensional linear models," Journal of Econometrics, Elsevier, vol. 195(1), pages 154-168.
Cai, Xizhen & Zhu, Yeying & Huang, Yuan & Ghosh, Debashis, 2022. "High-dimensional causal mediation analysis based on partial linear structural equation models," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:77:y:2021:i:3:p:996-1010. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A weak‐signal‐assisted procedure for variable selection and statistical inference with an informative subsample

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data