IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v75y2019i4p1051-1062.html
   My bibliography  Save this article

Detection of differentially expressed genes in discrete single‐cell RNA sequencing data using a hurdle model with correlated random effects

Author

Listed:
  • Michael Sekula
  • Jeremy Gaskins
  • Susmita Datta

Abstract

Single‐cell RNA sequencing (scRNA‐seq) technologies are revolutionary tools allowing researchers to examine gene expression at the level of a single cell. Traditionally, transcriptomic data have been analyzed from bulk samples, masking the heterogeneity now seen across individual cells. Even within the same cellular population, genes can be highly expressed in some cells but not expressed (or lowly expressed) in others. Therefore, the computational approaches used to analyze bulk RNA sequencing data are not appropriate for the analysis of scRNA‐seq data. Here, we present a novel statistical model for high dimensional and zero‐inflated scRNA‐seq count data to identify differentially expressed (DE) genes across cell types. Correlated random effects are employed based on an initial clustering of cells to capture the cell‐to‐cell variability within treatment groups. Moreover, this model is flexible and can be easily adapted to an independent random effect structure if needed. We apply our proposed methodology to both simulated and real data and compare results to other popular methods designed for detecting DE genes. Due to the hurdle model's ability to detect differences in the proportion of cells expressed and the average expression level (among the expressed cells), our methods naturally identify some genes as DE that other methods do not, and we demonstrate with real data that these uniquely detected genes are associated with similar biological processes and functions.

Suggested Citation

  • Michael Sekula & Jeremy Gaskins & Susmita Datta, 2019. "Detection of differentially expressed genes in discrete single‐cell RNA sequencing data using a hurdle model with correlated random effects," Biometrics, The International Biometric Society, vol. 75(4), pages 1051-1062, December.
  • Handle: RePEc:bla:biomet:v:75:y:2019:i:4:p:1051-1062
    DOI: 10.1111/biom.13074
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13074
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13074?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:75:y:2019:i:4:p:1051-1062. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.