IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1003696.html
   My bibliography  Save this article

Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells

Author

Listed:
  • Andrew McDavid
  • Lucas Dennis
  • Patrick Danaher
  • Greg Finak
  • Michael Krouse
  • Alice Wang
  • Philippa Webster
  • Joseph Beechem
  • Raphael Gottardo

Abstract

Advances in high-throughput, single cell gene expression are allowing interrogation of cell heterogeneity. However, there is concern that the cell cycle phase of a cell might bias characterizations of gene expression at the single-cell level. We assess the effect of cell cycle phase on gene expression in single cells by measuring 333 genes in 930 cells across three phases and three cell lines. We determine each cell's phase non-invasively without chemical arrest and use it as a covariate in tests of differential expression. We observe bi-modal gene expression, a previously-described phenomenon, wherein the expression of otherwise abundant genes is either strongly positive, or undetectable within individual cells. This bi-modality is likely both biologically and technically driven. Irrespective of its source, we show that it should be modeled to draw accurate inferences from single cell expression experiments. To this end, we propose a semi-continuous modeling framework based on the generalized linear model, and use it to characterize genes with consistent cell cycle effects across three cell lines. Our new computational framework improves the detection of previously characterized cell-cycle genes compared to approaches that do not account for the bi-modality of single-cell data. We use our semi-continuous modelling framework to estimate single cell gene co-expression networks. These networks suggest that in addition to having phase-dependent shifts in expression (when averaged over many cells), some, but not all, canonical cell cycle genes tend to be co-expressed in groups in single cells. We estimate the amount of single cell expression variability attributable to the cell cycle. We find that the cell cycle explains only 5%–17% of expression variability, suggesting that the cell cycle will not tend to be a large nuisance factor in analysis of the single cell transcriptome.Author Summary: Recent technological advances have enabled the measurement of gene expression in individual cells, revealing that there is substantial variability in expression, even within a homogeneous cell population. In this paper, we develop new analytical methods that account for the intrinsic, stochastic nature of single cell expression in order to characterize the effect of cell cycle on gene expression at the single-cell level. Applying these methods to populations of asynchronously cycling cells, we are able to identify large numbers of genes with cell cycle-associated expression patterns. By measuring and adjusting for cellular-level factors, we are able to derive estimates of co-expressing gene networks that more closely reflect cellular-level processes as opposed to sample-level processes. We find that cell cycle phase only accounts for a modest amount of the overall variability of gene expression within an individual cell. The analytical methods demonstrated in this paper are universally applicable to single cell expression data and represent a promising tool to the scientific community.

Suggested Citation

  • Andrew McDavid & Lucas Dennis & Patrick Danaher & Greg Finak & Michael Krouse & Alice Wang & Philippa Webster & Joseph Beechem & Raphael Gottardo, 2014. "Modeling Bi-modality Improves Characterization of Cell Cycle on Gene Expression in Single Cells," PLOS Computational Biology, Public Library of Science, vol. 10(7), pages 1-10, July.
  • Handle: RePEc:plo:pcbi00:1003696
    DOI: 10.1371/journal.pcbi.1003696
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003696
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1003696&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1003696?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gregory J Boggy & Peter J Woolf, 2010. "A Mechanistic Model of PCR for Accurate Quantification of Quantitative PCR Data," PLOS ONE, Public Library of Science, vol. 5(8), pages 1-7, August.
    2. Nir Yosef & Alex K. Shalek & Jellert T. Gaublomme & Hulin Jin & Youjin Lee & Amit Awasthi & Chuan Wu & Katarzyna Karwacz & Sheng Xiao & Marsela Jorgolli & David Gennert & Rahul Satija & Arvind Shakya , 2013. "Dynamic regulatory network controlling TH17 cell differentiation," Nature, Nature, vol. 496(7446), pages 461-468, April.
    3. Duan, Naihua, et al, 1983. "A Comparison of Alternative Models for the Demand for Medical Care," Journal of Business & Economic Statistics, American Statistical Association, vol. 1(2), pages 115-126, April.
    4. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    5. Cragg, John G, 1971. "Some Statistical Models for Limited Dependent Variables with Application to the Demand for Durable Goods," Econometrica, Econometric Society, vol. 39(5), pages 829-844, September.
    6. Alex K. Shalek & Rahul Satija & Xian Adiconis & Rona S. Gertner & Jellert T. Gaublomme & Raktima Raychowdhury & Schraga Schwartz & Nir Yosef & Christine Malboeuf & Diana Lu & John J. Trombetta & Dave , 2013. "Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells," Nature, Nature, vol. 498(7453), pages 236-240, June.
    7. Jones, Andrew M, 1989. "A Double-Hurdle Model of Cigarette Consumption," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 4(1), pages 23-39, Jan.-Mar..
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yael Korem & Pablo Szekely & Yuval Hart & Hila Sheftel & Jean Hausser & Avi Mayo & Michael E Rothenberg & Tomer Kalisky & Uri Alon, 2015. "Geometry of the Gene Expression Space of Individual Cells," PLOS Computational Biology, Public Library of Science, vol. 11(7), pages 1-27, July.
    2. Brian DeVeale & Leqian Liu & Ryan Boileau & Jennifer Swindlehurst-Chan & Bryan Marsh & Jacob W. Freimer & Adam Abate & Robert Blelloch, 2022. "G1/S restriction point coordinates phasic gene expression and cell differentiation," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    3. Manikandan Narayanan & Andrew J Martins & John S Tsang, 2016. "Robust Inference of Cell-to-Cell Expression Variations from Single- and K-Cell Profiling," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-33, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Madden, David, 2008. "Sample selection versus two-part models revisited: The case of female smoking and drinking," Journal of Health Economics, Elsevier, vol. 27(2), pages 300-307, March.
    2. Theodore Eisenberg & Thomas Eisenberg & Martin T. Wells & Min Zhang, 2015. "Addressing the Zeros Problem: Regression Models for Outcomes with a Large Proportion of Zeros, with an Application to Trial Outcomes," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 12(1), pages 161-186, March.
    3. Josephson, Anna Leigh & Marshall, Maria I., 2014. "The Demand and Supply for Post-Katrina Disaster Aid: A Triple-Hurdle Model of SBA Disaster Loans for Small Businesses in Mississippi," 2014 Annual Meeting, July 27-29, 2014, Minneapolis, Minnesota 170177, Agricultural and Applied Economics Association.
    4. Frank Crowley & John Eakins & Declan Jordan, 2012. "Participation,Expenditure and Regressivity in the Irish Lottery:Evidence from Irish Household Budget Survey 2004/2005," The Economic and Social Review, Economic and Social Studies, vol. 43(2), pages 199-225.
    5. Glenn W. Harrison & James P. Feehan & Alison C. Edwards & Jorge Segovia, 2003. "Cigarette Smoking and the Cost of Hospital and Physician Care," Canadian Public Policy, University of Toronto Press, vol. 29(1), pages 1-19, March.
    6. Kajal Lahiri & Chuanming Gao & Bernard Wixon, 2020. "Value of Sample Separation Information in a Sequential Probit Model," Arthaniti: Journal of Economic Theory and Practice, , vol. 19(2), pages 151-176, December.
    7. Richard Mussa, 2013. "Rural--urban differences in parental spending on children's primary education in Malawi," Development Southern Africa, Taylor & Francis Journals, vol. 30(6), pages 789-811, December.
    8. Pascal L. Ghazalian & Ali Fakih, 2017. "R&D and Innovation in Food Processing Firms in Transition Countries," Journal of Agricultural Economics, Wiley Blackwell, vol. 68(2), pages 427-450, June.
    9. Marion Kohler & Anthony Rossiter, 2005. "Property Owners in Australia: A Snapshot," RBA Research Discussion Papers rdp2005-03, Reserve Bank of Australia.
    10. Dong, Diansheng & Gould, Brian W., 1999. "A Double-Hurdle Model Of Food Demand With Endogenous Unit Values," 1999 Annual meeting, August 8-11, Nashville, TN 21635, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    11. Liu, Lei & Strawderman, Robert L. & Cowen, Mark E. & Shih, Ya-Chen T., 2010. "A flexible two-part random effects model for correlated medical costs," Journal of Health Economics, Elsevier, vol. 29(1), pages 110-123, January.
    12. Reneé van Eyden, 2012. "Consumer demand for alcoholic beverages and tobacco in Lesotho: A double-hurdle approach," Working Papers 315, Economic Research Southern Africa.
    13. Aedin Doris;, 1999. "The Means Testing Of Benefits And The Labour Supply Of The Wives Of Unemployed Men: Results From A Mover-Stayer Model," Economics Department Working Paper Series n940999, Department of Economics, National University of Ireland - Maynooth.
    14. Ana Cardoso & Elsa Fontainha & Chiara Monfardini, 2010. "Children’s and parents’ time use: empirical evidence on investment in human capital in France, Germany and Italy," Review of Economics of the Household, Springer, vol. 8(4), pages 479-504, December.
    15. Massimiliano Bratti & Alfonso Miranda, 2010. "Endogenous Treatment Effects for Count Data Models with Sample Selection or Endogenous Participation," DoQSS Working Papers 10-05, Quantitative Social Science - UCL Social Research Institute, University College London, revised 10 Dec 2010.
    16. Peter Z. Schochet, 2013. "A Statistical Model for Misreported Binary Outcomes in Clustered RCTs of Education Interventions," Journal of Educational and Behavioral Statistics, , vol. 38(5), pages 470-498, October.
    17. Kevin E. Staub, 2014. "A Causal Interpretation of Extensive and Intensive Margin Effects in Generalized Tobit Models," The Review of Economics and Statistics, MIT Press, vol. 96(2), pages 371-375, May.
    18. Bettin, Giulia & Lucchetti, Riccardo & Pigini, Claudia, 2018. "A dynamic double hurdle model for remittances: evidence from Germany," Economic Modelling, Elsevier, vol. 73(C), pages 365-377.
    19. Junious M Sichali & Jahangir A K Khan & Elvis M Gama & Hastings T Banda & Ireen Namakhoma & Grace Bongololo & Rachael Thomson & Berthe Stenberg & S Bertel Squire, 2019. "Direct costs of illness of patients with chronic cough in rural Malawi—Experiences from Dowa and Ntchisi districts," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-12, December.
    20. Bettin, Giulia & Lucchetti, Riccardo & Zazzaro, Alberto, 2012. "Endogeneity and sample selection in a model for remittances," Journal of Development Economics, Elsevier, vol. 99(2), pages 370-384.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1003696. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.