IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v14y2015i3p227-241n1.html
   My bibliography  Save this article

A novel method to prioritize RNAseq data for post-hoc analysis based on absolute changes in transcript abundance

Author

Listed:
  • McNutt Patrick

    (US Army Medical Research Institute of Chemical Defense, 3100 Ricketts Point Road, Gunpowder, MD 21010, USA)

  • Gut Ian

    (National Biodefense Analysis and Countermeasures Center, 110 Thomas Johnson Drive, Frederick, MD 21702, USA)

  • Hubbard Kyle

    (US Army Medical Research Institute of Chemical Defense, 3100 Ricketts Point Road, Gunpowder, MD 21010, USA)

  • Beske Phil

    (US Army Medical Research Institute of Chemical Defense, 3100 Ricketts Point Road, Gunpowder, MD 21010, USA)

Abstract

The use of fold-change (FC) to prioritize differentially expressed genes (DEGs) for post-hoc characterization is a common technique in the analysis of RNA sequencing datasets. However, the use of FC can overlook certain population of DEGs, such as high copy number transcripts which undergo metabolically expensive changes in expression yet fail to exceed the ratiometric FC cut-off, thereby missing potential important biological information. Here we evaluate an alternative approach to prioritizing RNAseq data based on absolute changes in normalized transcript counts (ΔT) between control and treatment conditions. In five pairwise comparisons with a wide range of effect sizes, rank-ordering of DEGs based on the magnitude of ΔT produced a power curve-like distribution, in which 4.7–5.0% of transcripts were responsible for 36–50% of the cumulative change. Thus, differential gene expression is characterized by the high production-cost expression of a small number of genes (large ΔT genes), while the differential expression of the majority of genes involves a much smaller metabolic investment by the cell. To determine whether the large ΔT datasets are representative of coordinated changes in the transcriptional program, we evaluated large ΔT genes for enrichment of gene ontologies (GOs) and predicted protein interactions. In comparison to randomly selected DEGs, the large ΔT transcripts were significantly enriched for both GOs and predicted protein interactions. Furthermore, enrichments were were consistent with the biological context of each comparison yet distinct from those produced using equal-sized populations of large FC genes, indicating that the large ΔT genes represent an orthagonal transcriptional response. Finally, the composition of the large ΔT gene sets were unique to each pairwise comparison, indicating that they represent coherent and context-specific responses to biological conditions rather than the non-specific upregulation of a family of genes. These findings suggest that the large ΔT genes are not a product of random or stochastic phenomenon, but rather represent biologically meaningful changes in the transcriptional program. They furthermore imply that high abundance transcripts are associated with particularly cellular states, and as cells change in response to internal or external conditions, the relative distribution of the abundant transcripts changes accordingly. Thus, prioritization of DEGs based on the concept of metabolic cost is a simple yet powerful method to identify biologically important transcriptional changes and provide novel insights into cellular behaviors.

Suggested Citation

  • McNutt Patrick & Gut Ian & Hubbard Kyle & Beske Phil, 2015. "A novel method to prioritize RNAseq data for post-hoc analysis based on absolute changes in transcript abundance," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 14(3), pages 227-241, June.
  • Handle: RePEc:bpj:sagmbi:v:14:y:2015:i:3:p:227-241:n:1
    DOI: 10.1515/sagmb-2014-0018
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/sagmb-2014-0018
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/sagmb-2014-0018?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jeff Alstott & Ed Bullmore & Dietmar Plenz, 2014. "powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-11, January.
    2. Björn Schwanhäusser & Dorothea Busse & Na Li & Gunnar Dittmar & Johannes Schuchhardt & Jana Wolf & Wei Chen & Matthias Selbach, 2011. "Global quantification of mammalian gene expression control," Nature, Nature, vol. 473(7347), pages 337-342, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ji Min Lee & Henrik M. Hammarén & Mikhail M. Savitski & Sung Hee Baek, 2023. "Control of protein stability by post-translational modifications," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    2. Mohammad Soltani & Cesar A Vargas-Garcia & Duarte Antunes & Abhyudai Singh, 2016. "Intercellular Variability in Protein Levels from Stochastic Expression and Noisy Cell Cycle Processes," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-23, August.
    3. Sumeet Kumar & Binxuan Huang & Ramon Alfonso Villa Cox & Kathleen M. Carley, 2021. "An anatomical comparison of fake-news and trusted-news sharing pattern on Twitter," Computational and Mathematical Organization Theory, Springer, vol. 27(2), pages 109-133, June.
    4. Rutten, Philip & Lees, Michael H. & Klous, Sander & Sloot, Peter M.A., 2021. "Intermittent and persistent movement patterns of dance event visitors in large sporting venues," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 563(C).
    5. Jasjot Singh & Hadeer Elhabashy & Pathma Muthukottiappan & Markus Stepath & Martin Eisenacher & Oliver Kohlbacher & Volkmar Gieselmann & Dominic Winter, 2022. "Cross-linking of the endolysosomal system reveals potential flotillin structures and cargo," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    6. Jovanovic, Franck & Schinckus, Christophe, 2016. "Breaking down the barriers between econophysics and financial economics," International Review of Financial Analysis, Elsevier, vol. 47(C), pages 256-266.
    7. Yuping Chen & Jo-Hsi Huang & Connie Phong & James E. Ferrell, 2024. "Viscosity-dependent control of protein synthesis and degradation," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    8. Khalilzadeh, Jalayer, 2022. "It is a small world, or is it? A look into two decades of tourism system," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).
    9. Shu Takahashi & Kento Yamamoto & Shumpei Kobayashi & Ryoma Kondo & Ryohei Hisano, 2024. "Dynamic Link and Flow Prediction in Bank Transfer Networks," Papers 2409.08718, arXiv.org, revised Oct 2024.
    10. Gábor Csárdi & Alexander Franks & David S Choi & Edoardo M Airoldi & D Allan Drummond, 2015. "Accounting for Experimental Noise Reveals That mRNA Levels, Amplified by Post-Transcriptional Processes, Largely Determine Steady-State Protein Levels in Yeast," PLOS Genetics, Public Library of Science, vol. 11(5), pages 1-32, May.
    11. Li, Heyang & Wu, Meijun & Wang, Yougui & Zeng, An, 2022. "Bibliographic coupling networks reveal the advantage of diversification in scientific projects," Journal of Informetrics, Elsevier, vol. 16(3).
    12. Jiaqi Liang & Linjing Li & Daniel Zeng, 2018. "Evolutionary dynamics of cryptocurrency transaction networks: An empirical study," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-18, August.
    13. Jiaqi Liang & Linjing Li & Daniel Zeng, 2018. "Evolutionary dynamics of cryptocurrency transaction networks: An empirical study," Papers 1808.08585, arXiv.org.
    14. Yan Li & Chen Xu & Bing Wang & Fujiang Xu & Fahan Ma & Yuanyuan Qu & Dongxian Jiang & Kai Li & Jinwen Feng & Sha Tian & Xiaohui Wu & Yunzhi Wang & Yang Liu & Zhaoyu Qin & Yalan Liu & Jing Qin & Qi Son, 2022. "Proteomic characterization of gastric cancer response to chemotherapy and targeted therapy reveals potential therapeutic strategies," Nature Communications, Nature, vol. 13(1), pages 1-26, December.
    15. Katahira, Kei & Chen, Yu & Akiyama, Eizo, 2021. "Self-organized Speculation Game for the spontaneous emergence of financial stylized facts," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 582(C).
    16. Yang, Alex Jie & Wu, Linwei & Zhang, Qi & Wang, Hao & Deng, Sanhong, 2023. "The k-step h-index in citation networks at the paper, author, and institution levels," Journal of Informetrics, Elsevier, vol. 17(4).
    17. Wang, Xuhui & Wu, Jiao & Yang, Zheng & Xu, Kesheng & Wang, Zhengling & Zheng, Muhua, 2024. "The correlation between independent edge and triangle degrees promote the explosive information spreading," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 640(C).
    18. Behrouzi, Saman & Shafaeipour Sarmoor, Zahra & Hajsadeghi, Khosrow & Kavousi, Kaveh, 2020. "Predicting scientific research trends based on link prediction in keyword networks," Journal of Informetrics, Elsevier, vol. 14(4).
    19. Hadi Arbabi & Martin Mayfield & Philip McCann, 2020. "Productivity, infrastructure and urban density—an allometric comparison of three European city regions across scales," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 211-228, January.
    20. Kaslik, Eva & Rădulescu, Ileana Rodica, 2022. "Stability and bifurcations in fractional-order gene regulatory networks," Applied Mathematics and Computation, Elsevier, vol. 421(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:14:y:2015:i:3:p:227-241:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.