IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v42y2015i4p1078-1091.html
   My bibliography  Save this article

Variance Estimation under Two-Phase Sampling

Author

Listed:
  • Takumi Saegusa

Abstract

type="main" xml:id="sjos12152-abs-0001"> We consider the variance estimation of the weighted likelihood estimator (WLE) under two-phase stratified sampling without replacement. Asymptotic variance of the WLE in many semiparametric models contains unknown functions or does not have a closed form. The standard method of the inverse probability weighted (IPW) sample variances of an estimated influence function is then not available in these models. To address this issue, we develop the variance estimation procedure for the WLE in a general semiparametric model. The phase I variance is estimated by taking a numerical derivative of the IPW log likelihood. The phase II variance is estimated based on the bootstrap for a stratified sample in a finite population. Despite a theoretical difficulty of dependent observations due to sampling without replacement, we establish the (bootstrap) consistency of our estimators. Finite sample properties of our method are illustrated in a simulation study.

Suggested Citation

  • Takumi Saegusa, 2015. "Variance Estimation under Two-Phase Sampling," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(4), pages 1078-1091, December.
  • Handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1078-1091
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1111/sjos.12152
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Norman E. Breslow & Jon A. Wellner, 2007. "Weighted Likelihood for Semiparametric Models and Two‐phase Stratified Samples, with Application to Cox Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 34(1), pages 86-102, March.
    2. Thomas Lumley & Pamela A. Shaw & James Y. Dai, 2011. "Connections between Survey Calibration Estimators and Semiparametric Models for Incomplete Data," International Statistical Review, International Statistical Institute, vol. 79(2), pages 200-220, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Göran Kauermann & Mehboob Ali, 2021. "Semi-parametric regression when some (expensive) covariates are missing by design," Statistical Papers, Springer, vol. 62(4), pages 1675-1696, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gustavo Amorim & Ran Tao & Sarah Lotspeich & Pamela A. Shaw & Thomas Lumley & Bryan E. Shepherd, 2021. "Two‐phase sampling designs for data validation in settings with covariate measurement error and continuous outcome," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1368-1389, October.
    2. Patrice Bertail & Emilie Chautru & Stephan Clémençon, 2017. "Empirical Processes in Survey Sampling with (Conditional) Poisson Designs," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 97-111, March.
    3. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    4. Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
    5. Han, Bo & Wang, Xiaoguang, 2020. "Semiparametric estimation for the non-mixture cure model in case-cohort and nested case-control studies," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    6. Yucong Lin & Jinhua Su & Yang Liu & Jue Hou & Feifei Wang, 2024. "Implicit profiling estimation for semiparametric models with bundled parameters," Statistical Papers, Springer, vol. 65(5), pages 3203-3234, July.
    7. Matthew R. Williams & Terrance D. Savitsky, 2021. "Uncertainty Estimation for Pseudo‐Bayesian Inference Under Complex Sampling," International Statistical Review, International Statistical Institute, vol. 89(1), pages 72-107, April.
    8. Yanqing Sun & Xiyuan Qian & Qiong Shou & Peter B. Gilbert, 2017. "Analysis of two-phase sampling data with semiparametric additive hazards models," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(3), pages 377-399, July.
    9. Menggang Yu & Bin Nan, 2010. "Regression Calibration in Semiparametric Accelerated Failure Time Models," Biometrics, The International Biometric Society, vol. 66(2), pages 405-414, June.
    10. Soyoung Kim & Yayun Xu & Mei‐Jie Zhang & Kwang‐Woo Ahn, 2020. "Stratified proportional subdistribution hazards model with covariate‐adjusted censoring weight for case‐cohort studies," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1222-1242, December.
    11. Brady Ryan & Ananthika Nirmalkanna & Candemir Cigsar & Yildiz E. Yilmaz, 2023. "Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 510-539, July.
    12. Chixiang Chen & Ming Wang & Shuo Chen, 2023. "An efficient data integration scheme for synthesizing information from multiple secondary datasets for the parameter inference of the main analysis," Biometrics, The International Biometric Society, vol. 79(4), pages 2947-2960, December.
    13. Jan Feifel & Madlen Gebauer & Martin Schumacher & Jan Beyersmann, 2020. "Nested exposure case-control sampling: a sampling scheme to analyze rare time-dependent exposures," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 21-44, January.
    14. Yei Eun Shin & Ruth M. Pfeiffer & Barry I. Graubard & Mitchell H. Gail, 2022. "Weight calibration to improve efficiency for estimating pure risks from the additive hazards model with the nested case‐control design," Biometrics, The International Biometric Society, vol. 78(1), pages 179-191, March.
    15. Bryan E. Shepherd & Kyunghee Han & Tong Chen & Aihua Bian & Shannon Pugh & Stephany N. Duda & Thomas Lumley & William J. Heerman & Pamela A. Shaw, 2023. "Multiwave validation sampling for error‐prone electronic health records," Biometrics, The International Biometric Society, vol. 79(3), pages 2649-2663, September.
    16. Peisong Han, 2016. "Combining Inverse Probability Weighting and Multiple Imputation to Improve Robustness of Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 246-260, March.
    17. Shixiao Zhang & Peisong Han & Changbao Wu, 2023. "Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference," International Statistical Review, International Statistical Institute, vol. 91(2), pages 165-192, August.
    18. Yei Eun Shin & Ruth M. Pfeiffer & Barry I. Graubard & Mitchell H. Gail, 2020. "Weight calibration to improve the efficiency of pure risk estimates from case‐control samples nested in a cohort," Biometrics, The International Biometric Society, vol. 76(4), pages 1087-1097, December.
    19. Rebecca Payne & Ming Yang & Yingye Zheng & Majken K. Jensen & Tianxi Cai, 2016. "Robust risk prediction with biomarkers under two‐phase stratified cohort design," Biometrics, The International Biometric Society, vol. 72(4), pages 1037-1045, December.
    20. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Conditional screening for ultrahigh-dimensional survival data in case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(4), pages 632-661, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1078-1091. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.