IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v29y2020i3d10.1007_s10260-019-00496-4.html
   My bibliography  Save this article

Identifying atypically expressed chromosome regions using RNA-Seq data

Author

Listed:
  • Vinícius Diniz Mayrink

    (ICEx Universidade Federal de Minas Gerais, Av. Antônio Carlos)

  • Flávio B. Gonçalves

    (ICEx Universidade Federal de Minas Gerais, Av. Antônio Carlos)

Abstract

The number of studies dealing with RNA-Seq data analysis has experienced a fast increase in the past years making this type of gene expression a strong competitor to the DNA microarrays. This paper proposes a Bayesian model to detect low and highly-expressed chromosome regions using RNA-Seq data. The methodology is based on a recent work designed to detect highly-expressed (overexpressed) regions in the context of microarray data. A hidden Markov model is developed by considering a mixture of Gaussian distributions with ordered means in a way that first and last mixture components are supposed to accommodate the under and overexpressed genes, respectively. The model is flexible enough to efficiently deal with the highly irregular spaced configuration of the data by assuming a hierarchical Markov dependence structure. The analysis of four cancer data sets (breast, lung, ovarian and uterus) is presented. Results indicate that the proposed model is selective in determining the expression status, robust with respect to prior specifications and provides tools for a global or local search of under and overexpressed chromosome regions.

Suggested Citation

  • Vinícius Diniz Mayrink & Flávio B. Gonçalves, 2020. "Identifying atypically expressed chromosome regions using RNA-Seq data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(3), pages 619-649, September.
  • Handle: RePEc:spr:stmapp:v:29:y:2020:i:3:d:10.1007_s10260-019-00496-4
    DOI: 10.1007/s10260-019-00496-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-019-00496-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-019-00496-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kim‐Anh Do & Peter Müller & Feng Tang, 2005. "A Bayesian mixture model for differential gene expression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 627-644, June.
    2. Panagiotis Papastamoulis & Magnus Rattray, 2018. "A Bayesian model selection approach for identifying differentially expressed transcripts from RNA sequencing data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(1), pages 3-23, January.
    3. Christopher A. Maher & Chandan Kumar-Sinha & Xuhong Cao & Shanker Kalyana-Sundaram & Bo Han & Xiaojun Jing & Lee Sam & Terrence Barrette & Nallasivam Palanisamy & Arul M. Chinnaiyan, 2009. "Transcriptome sequencing to detect gene fusions in cancer," Nature, Nature, vol. 458(7234), pages 97-101, March.
    4. Bivand, Roger & Piras, Gianfranco, 2015. "Comparing Implementations of Estimation Methods for Spatial Econometrics," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i18).
    5. Lewin Alex & Bochkina Natalia & Richardson Sylvia, 2007. "Fully Bayesian Mixture Model for Differential Gene Expression: Simulations and Model Checks," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 6(1), pages 1-28, December.
    6. Vinícius Diniz Mayrink & Flávio Bambirra Gonçalves, 2017. "A Bayesian hidden Markov mixture model to detect overexpressed chromosome regions," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(2), pages 387-412, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vinícius Diniz Mayrink & Flávio Bambirra Gonçalves, 2017. "A Bayesian hidden Markov mixture model to detect overexpressed chromosome regions," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(2), pages 387-412, February.
    2. Guohuan Su & Adam Mertel & Sébastien Brosse & Justin M. Calabrese, 2023. "Species invasiveness and community invasibility of North American freshwater fish fauna revealed via trait-based analysis," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    3. Chakir, Raja & Lungarska, Anna, 2015. "Agricultural land rents in land use models: a spatial econometric analysis," 150th Seminar, October 22-23, 2015, Edinburgh, Scotland 212641, European Association of Agricultural Economists.
    4. Meilan An & Jeffrey Vitale & Kwideok Han & John N. Ng’ombe & Inbae Ji, 2021. "Effects of Spatial Characteristics on the Spread of the Highly Pathogenic Avian Influenza (HPAI) in Korea," IJERPH, MDPI, vol. 18(8), pages 1-13, April.
    5. Demidova, Olga, 2021. "Methods of spatial econometrics and evaluation of government programs effectiveness," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 64, pages 107-134.
    6. Iacopo Odoardi & Donatella Furia & Piera Cascioli, 2021. "Can social support compensate for missing family support? An examination of dropout rates in Italy," Regional Science Policy & Practice, Wiley Blackwell, vol. 13(1), pages 121-139, February.
    7. Ozgun, Burcu & Broekel, Tom, 2021. "The geography of innovation and technology news - An empirical study of the German news media," Technological Forecasting and Social Change, Elsevier, vol. 167(C).
    8. Han, Bing & Dalal, Siddhartha R., 2012. "A Bernstein-type estimator for decreasing density with application to p-value adjustments," Computational Statistics & Data Analysis, Elsevier, vol. 56(2), pages 427-437.
    9. Pinto, Allan & Griffin, Terry W., 2022. "Detecting bubbles via single time-series variable: applying spatial specification tests to farmland values," 2022 Annual Meeting, July 31-August 2, Anaheim, California 322534, Agricultural and Applied Economics Association.
    10. Gianfranco Piras & Mauricio Sarrias, 2023. "Heterogeneous spatial models in R: spatial regimes models," Journal of Spatial Econometrics, Springer, vol. 4(1), pages 1-32, December.
    11. Wang Chamont & Gevertz Jana L., 2016. "Finding causative genes from high-dimensional data: an appraisal of statistical and machine learning approaches," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(4), pages 321-347, August.
    12. Kandt, Jens & Leak, Alistair, 2019. "Examining inclusive mobility through smartcard data: What shall we make of senior citizens' declining bus patronage in the West Midlands?," Journal of Transport Geography, Elsevier, vol. 79(C), pages 1-1.
    13. Bivand, Roger & Piras, Gianfranco, 2015. "Comparing Implementations of Estimation Methods for Spatial Econometrics," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i18).
    14. Canale, Antonio & Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2023. "Inner spike and slab Bayesian nonparametric models," Econometrics and Statistics, Elsevier, vol. 27(C), pages 120-135.
    15. Paul Feichtinger & Klaus Salhofer, 2016. "The Fischler Reform of the Common Agricultural Policy and Agricultural Land Prices," Land Economics, University of Wisconsin Press, vol. 92(3), pages 411-432.
    16. Anastasiya Penska, 2015. "Determinants of Corruption in Ukrainian Regions: Spatial Analysis," Ekonomia journal, Faculty of Economic Sciences, University of Warsaw, vol. 42.
    17. Michael Lebacher & Paul W. Thurner & Göran Kauermann, 2021. "Censored regression for modelling small arms trade volumes and its ‘Forensic’ use for exploring unreported trades," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 909-933, August.
    18. Gianfranco Piras, 2014. "Impact estimates for static spatial panel data models in R," Letters in Spatial and Resource Sciences, Springer, vol. 7(3), pages 213-223, October.
    19. Cipolli III, William & Hanson, Timothy & McLain, Alexander C., 2016. "Bayesian nonparametric multiple testing," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 64-79.
    20. Barrientos, Andrés F. & Canale, Antonio, 2021. "A Bayesian goodness-of-fit test for regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:29:y:2020:i:3:d:10.1007_s10260-019-00496-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.