IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v73y2017i4p1210-1220.html
   My bibliography  Save this article

Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case‐control genetic association studies

Author

Listed:
  • Tamar Sofer
  • Elizabeth D. Schifano
  • David C. Christiani
  • Xihong Lin

Abstract

We propose a weighted pseudolikelihood method for analyzing the association of a SNP set, example, SNPs in a gene or a genetic pathway or network, with multiple secondary phenotypes in case‐control genetic association studies. To boost analysis power, we assume that the SNP‐specific effects are shared across all secondary phenotypes using a scaled mean model. We estimate regression parameters using Inverse Probability Weighted (IPW) estimating equations obtained from the weighted pseudolikelihood, which accounts for case‐control sampling to prevent potential ascertainment bias. To test the effect of a SNP set, we propose a weighted variance component pseudo‐score test. We also propose a penalized IPW pseudolikelihood method for selecting a subset of SNPs that are associated with the multiple secondary phenotypes. We show that the proposed variable selection procedure has the oracle properties and is robust to misspecification of the correlation structure among secondary phenotypes. We select the tuning parameter using a weighted Bayesian Information‐like Criterion (wBIC). We evaluate the finite sample performance of the proposed methods via simulations, and illustrate the methods by the analysis of the multiple secondary smoking behavior outcomes in a lung cancer case‐control genetic association study.

Suggested Citation

  • Tamar Sofer & Elizabeth D. Schifano & David C. Christiani & Xihong Lin, 2017. "Weighted pseudolikelihood for SNP set analysis with multiple secondary outcomes in case‐control genetic association studies," Biometrics, The International Biometric Society, vol. 73(4), pages 1210-1220, December.
  • Handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1210-1220
    DOI: 10.1111/biom.12680
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12680
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12680?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Theory," Econometrica, Econometric Society, vol. 52(3), pages 681-700, May.
    2. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Applications to Poisson Models," Econometrica, Econometric Society, vol. 52(3), pages 701-720, May.
    3. Johnson, Brent A. & Lin, D.Y. & Zeng, Donglin, 2008. "Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 672-680, June.
    4. Robert B. Davies, 1980. "The Distribution of a Linear Combination of χ2 Random Variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(3), pages 323-333, November.
    5. Gao, Xin & Song, Peter X.-K., 2010. "Composite Likelihood Bayesian Information Criteria for Model Selection in High-Dimensional Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1531-1540.
    6. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    7. Wenjiang J. Fu, 2003. "Penalized Estimating Equations," Biometrics, The International Biometric Society, vol. 59(1), pages 126-132, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xingwei Tong & Xin He & Liuquan Sun & Jianguo Sun, 2009. "Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 620-635, December.
    2. Alexander Robitzsch, 2022. "Comparing the Robustness of the Structural after Measurement (SAM) Approach to Structural Equation Modeling (SEM) against Local Model Misspecifications with Alternative Estimation Approaches," Stats, MDPI, vol. 5(3), pages 1-42, July.
    3. Blommaert, A. & Hens, N. & Beutels, Ph., 2014. "Data mining for longitudinal data under multicollinearity and time dependence using penalized generalized estimating equations," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 667-680.
    4. Lu Tang & Peter X.‐K. Song, 2021. "Poststratification fusion learning in longitudinal data analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 914-928, September.
    5. Wang, Li & Wang, Suojin & Wang, Guannan, 2014. "Variable selection and estimation for longitudinal survey data," Journal of Multivariate Analysis, Elsevier, vol. 130(C), pages 409-424.
    6. Fan, Yali & Qin, Guoyou & Zhu, Zhongyi, 2012. "Variable selection in robust regression models for longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 156-167.
    7. Fang, Jianglin, 2023. "A split-and-conquer variable selection approach for high-dimensional general semiparametric models with massive data," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    8. Lan Wang & Jianhui Zhou & Annie Qu, 2012. "Penalized Generalized Estimating Equations for High-Dimensional Longitudinal Data Analysis," Biometrics, The International Biometric Society, vol. 68(2), pages 353-360, June.
    9. Giuliani, Elisa & Martinelli, Arianna & Rabellotti, Roberta, 2016. "Is Co-Invention Expediting Technological Catch Up? A Study of Collaboration between Emerging Country Firms and EU Inventors," World Development, Elsevier, vol. 77(C), pages 192-205.
    10. Bettina Becker & Martin Theuringer, 2000. "Macroeconomic Determinants of Contingent Protection: The Case of the European Union," IWP Discussion Paper Series 02/2000, Institute for Economic Policy, Cologne, Germany.
    11. Hallin, Marc & La Vecchia, Davide, 2020. "A Simple R-estimation method for semiparametric duration models," Journal of Econometrics, Elsevier, vol. 218(2), pages 736-749.
    12. Barone-Adesi, Giovanni & Fusari, Nicola & Mira, Antonietta & Sala, Carlo, 2020. "Option market trading activity and the estimation of the pricing kernel: A Bayesian approach," Journal of Econometrics, Elsevier, vol. 216(2), pages 430-449.
    13. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    14. Dionne, Georges, 1998. "La mesure empirique des problèmes d’information," L'Actualité Economique, Société Canadienne de Science Economique, vol. 74(4), pages 585-606, décembre.
    15. de Rassenfosse, Gaétan & Schoen, Anja & Wastyn, Annelies, 2014. "Selection bias in innovation studies: A simple test," Technological Forecasting and Social Change, Elsevier, vol. 81(C), pages 287-299.
    16. Gary King, 1989. "A Seemingly Unrelated Poisson Regression Model," Sociological Methods & Research, , vol. 17(3), pages 235-255, February.
    17. Emilie Alberola & Julien Chevallier & Benoît Chèze, 2008. "The EU Emissions Trading Scheme : Disentangling the Effects of Industrial Production and CO2 Emissions on Carbon Prices," Working Papers hal-04140795, HAL.
    18. Czarnitzki, Dirk & Doherr, Thorsten & Hussinger, Katrin & Schliessler, Paula & Toole, Andrew A., 2016. "Knowledge Creates Markets: The influence of entrepreneurial support and patent rights on academic entrepreneurship," European Economic Review, Elsevier, vol. 86(C), pages 131-146.
    19. Alvarez, Javier & Arellano, Manuel, 2022. "Robust likelihood estimation of dynamic panel data models," Journal of Econometrics, Elsevier, vol. 226(1), pages 21-61.
    20. Blazsek, Szabolcs & Licht, Adrian, 2018. "Seasonal quasi-vector autoregressive models for macroeconomic data," UC3M Working papers. Economics 26316, Universidad Carlos III de Madrid. Departamento de Economía.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1210-1220. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.