IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v19y2020i3p14n2.html
   My bibliography  Save this article

A weighted empirical Bayes risk prediction model using multiple traits

Author

Listed:
  • Li Gengxin

    (Department of Mathematics and Statistics, University of Michigan Dearborn, 4901 Evergreen Rd, Dearborn, MI 48128, USA)

  • Hou Lin

    (Center for Statistical Science, Tsinghua University, 30 Shuangqing Rd, Haidian District, Beijing 100084, China)

  • Liu Xiaoyu

    (Department of Mathematics and Statistics, Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH 45435, USA)

  • Wu Cen

    (Department of Statistics, Kansas State University, 1116 Mid-Campus Drive N., Manhattan, KS 66506, USA)

Abstract

With rapid advances in high-throughput sequencing technology, millions of single-nucleotide variants (SNVs) can be simultaneously genotyped in a sequencing study. These SNVs residing in functional genomic regions such as exons may play a crucial role in biological process of the body. In particular, non-synonymous SNVs are closely related to the protein sequence and its function, which are important in understanding the biological mechanism of sequence evolution. Although statistically challenging, models incorporating such SNV annotation information can improve the estimation of genetic effects, and multiple responses may further strengthen the signals of these variants on the assessment of disease risk. In this work, we develop a new weighted empirical Bayes method to integrate SNV annotation information in a multi-trait design. The performance of this proposed model is evaluated in simulation as well as a real sequencing data; thus, the proposed method shows improved prediction accuracy compared to other approaches.

Suggested Citation

  • Li Gengxin & Hou Lin & Liu Xiaoyu & Wu Cen, 2020. "A weighted empirical Bayes risk prediction model using multiple traits," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 19(3), pages 1-14, June.
  • Handle: RePEc:bpj:sagmbi:v:19:y:2020:i:3:p:14:n:2
    DOI: 10.1515/sagmb-2019-0056
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2019-0056
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2019-0056?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Efron, Bradley, 2009. "Empirical Bayes Estimates for Large-Scale Prediction Problems," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1015-1028.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shigeyuki Matsui & Hisashi Noma, 2011. "Estimating Effect Sizes of Differentially Expressed Genes for Power and Sample-Size Assessments in Microarray Experiments," Biometrics, The International Biometric Society, vol. 67(4), pages 1225-1235, December.
    2. Li Gengxin & Cui Yuehua & Zhao Hongyu, 2015. "An Empirical Bayes risk prediction model using multiple traits for sequencing data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(6), pages 551-573, December.
    3. David Amar & Ron Shamir & Daniel Yekutieli, 2017. "Extracting replicable associations across multiple studies: Empirical Bayes algorithms for controlling the false discovery rate," PLOS Computational Biology, Public Library of Science, vol. 13(8), pages 1-22, August.
    4. She, Yiyuan, 2012. "An iterative algorithm for fitting nonconvex penalized generalized linear models with grouped predictors," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2976-2990.
    5. Chen Xu & Jiahua Chen, 2014. "The Sparse MLE for Ultrahigh-Dimensional Feature Screening," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1257-1269, September.
    6. van Iterson Maarten & van de Wiel Mark A. & Boer Judith M. & de Menezes Renée X., 2013. "General power and sample size calculations for high-dimensional genomic data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(4), pages 449-467, August.
    7. Pallavi Basu & Luella Fu & Alessio Saretto & Wenguang Sun, 2021. "Empirical Bayes Control of the False Discovery Exceedance," Working Papers 2115, Federal Reserve Bank of Dallas.
    8. Maharaj, Elizabeth Ann & Alonso, Andrés M., 2014. "Discriminant analysis of multivariate time series: Application to diagnosis based on ECG signals," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 67-87.
    9. Park, Junyong, 2018. "Simultaneous estimation based on empirical likelihood and general maximum likelihood estimation," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 19-31.
    10. Habiger, Joshua D. & Peña, Edsel A., 2014. "Compound p-value statistics for multiple testing procedures," Journal of Multivariate Analysis, Elsevier, vol. 126(C), pages 153-166.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:19:y:2020:i:3:p:14:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.