IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0233630.html
   My bibliography  Save this article

Characterization of histone modification patterns and prediction of novel promoters using functional principal component analysis

Author

Listed:
  • Mijeong Kim
  • Shili Lin

Abstract

Characterization of distinct histone methylation and acetylation binding patterns in promoters and prediction of novel regulatory regions remains an important area of genomic research, as it is hypothesized that distinct chromatin signatures may specify unique genomic functions. However, methods that have been proposed in the literature are either descriptive in nature or are fully parametric and hence more restrictive in pattern discovery. In this article, we propose a two-step non-parametric statistical inference procedure to characterize unique histone modification patterns and apply it to analyzing the binding patterns of four histone marks, H3K4me2, H3K4me3, H3K9ac, and H4K20me1, in human B-lymphoblastoid cells. In the first step, we used a functional principal component analysis method to represent the concatenated binding patterns of these four histone marks around the transcription start sites as smooth curves. In the second step, we clustered these curves to reveal several unique classes of binding patterns. These uncovered patterns were used in turn to scan the whole-genome to predict novel and alternative promoters. Our analyses show that there are three distinct promoter binding patterns of active genes. Further, 19654 regions not within known gene promoters were found to overlap with human ESTs, CpG islands, or common SNPs, indicative of their potential role in gene regulation, including being potential novel promoter regions.

Suggested Citation

  • Mijeong Kim & Shili Lin, 2020. "Characterization of histone modification patterns and prediction of novel promoters using functional principal component analysis," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-16, May.
  • Handle: RePEc:plo:pone00:0233630
    DOI: 10.1371/journal.pone.0233630
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0233630
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0233630&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0233630?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Julien Jacques & Cristian Preda, 2014. "Functional data clustering: a survey," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 231-255, September.
    2. Jason Ernst & Pouya Kheradpour & Tarjei S. Mikkelsen & Noam Shoresh & Lucas D. Ward & Charles B. Epstein & Xiaolan Zhang & Li Wang & Robbyn Issner & Michael Coyne & Manching Ku & Timothy Durham & Mano, 2011. "Mapping and analysis of chromatin state dynamics in nine human cell types," Nature, Nature, vol. 473(7345), pages 43-49, May.
    3. Erik Engelen & Johannes H. Brandsma & Maaike J. Moen & Luca Signorile & Dick H. W. Dekkers & Jeroen Demmers & Christel E. M. Kockx & Zehila Ozgür & Wilfred F. J. van IJcken & Debbie L. C. van den Berg, 2015. "Proteins that bind regulatory regions identified by histone modification chromatin immunoprecipitations and mass spectrometry," Nature Communications, Nature, vol. 6(1), pages 1-12, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yifan Zhu & Chongzhi Di & Ying Qing Chen, 2019. "Clustering Functional Data with Application to Electronic Medication Adherence Monitoring in HIV Prevention Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(2), pages 238-261, July.
    2. Chuyuan Lin & Ying Yu & Lucas Y. Wu & Jiguo Cao, 2023. "Unsupervised learning on U.S. weather forecast performance," Computational Statistics, Springer, vol. 38(3), pages 1193-1213, September.
    3. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers CWP06/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    4. Golovkine, Steven & Klutchnikoff, Nicolas & Patilea, Valentin, 2022. "Clustering multivariate functional data using unsupervised binary trees," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    5. Wang, Bingling & Li, Yingxing & Härdle, Wolfgang Karl, 2022. "K-expectiles clustering," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Seungsoo Hahn & Dongsup Kim, 2015. "Identifying and Reducing Systematic Errors in Chromosome Conformation Capture Data," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-17, December.
    7. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    8. Michael Vogt & Oliver Linton, 2017. "Classification of non-parametric regression functions in longitudinal data models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 5-27, January.
    9. Ye Cai & Huifen Cao & Fang Wang & Yufei Zhang & Philipp Kapranov, 2022. "Complex genomic patterns of abasic sites in mammalian DNA revealed by a high-resolution SSiNGLe-AP method," Nature Communications, Nature, vol. 13(1), pages 1-21, December.
    10. Chirag Nepal & Jesper B. Andersen, 2023. "Alternative promoters in CpG depleted regions are prevalently associated with epigenetic misregulation of liver cancer transcriptomes," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. O. I. Traore & P. Cristini & N. Favretto-Cristini & L. Pantera & P. Vieu & S. Viguier-Pla, 2019. "Clustering acoustic emission signals by mixing two stages dimension reduction and nonparametric approaches," Computational Statistics, Springer, vol. 34(2), pages 631-652, June.
    12. Haoxi Chai & Harianto Tjong & Peng Li & Wei Liao & Ping Wang & Chee Hong Wong & Chew Yee Ngan & Warren J. Leonard & Chia-Lin Wei & Yijun Ruan, 2023. "ChIATAC is an efficient strategy for multi-omics mapping of 3D epigenomes from low-cell inputs," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    13. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers 06/15, Institute for Fiscal Studies.
    14. Zhangyuan Pan & Yuelin Yao & Hongwei Yin & Zexi Cai & Ying Wang & Lijing Bai & Colin Kern & Michelle Halstead & Ganrea Chanthavixay & Nares Trakooljul & Klaus Wimmers & Goutam Sahana & Guosheng Su & M, 2021. "Pig genome functional annotation enhances the biological interpretation of complex traits and human disease," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    15. Maurizio Mangolini & Alba Maiques-Diaz & Stella Charalampopoulou & Elena Gerhard-Hartmann & Johannes Bloehdorn & Andrew Moore & Giorgia Giachetti & Junyan Lu & Valar Nila Roamio Franklin & Chandra Sek, 2022. "Viral transduction of primary human lymphoma B cells reveals mechanisms of NOTCH-mediated immune escape," Nature Communications, Nature, vol. 13(1), pages 1-21, December.
    16. Carlos Rivera & Hun-Goo Lee & Anna Lappala & Danni Wang & Verónica Noches & Montserrat Olivares-Costa & Marcela Sjöberg-Herrera & Jeannie T. Lee & María Estela Andrés, 2022. "Unveiling RCOR1 as a rheostat at transcriptionally permissive chromatin," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    17. Tapia, Mariela & Heinemann, Detlev & Ballari, Daniela & Zondervan, Edwin, 2022. "Spatio-temporal characterization of long-term solar resource using spatial functional data analysis: Understanding the variability and complementarity of global horizontal irradiance in Ecuador," Renewable Energy, Elsevier, vol. 189(C), pages 1176-1193.
    18. Noah Dukler & Mehreen R. Mughal & Ritika Ramani & Yi-Fei Huang & Adam Siepel, 2022. "Extreme purifying selection against point mutations in the human genome," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    19. Ting Shen & Ting Ni & Jiaxuan Chen & Haitao Chen & Xiaopin Ma & Guangwen Cao & Tianzhi Wu & Haisheng Xie & Bin Zhou & Gang Wei & Hexige Saiyin & Suqin Shen & Peng Yu & Qianyi Xiao & Hui Liu & Yuzheng , 2022. "An enhancer variant at 16q22.1 predisposes to hepatocellular carcinoma via regulating PRMT7 expression," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    20. Julia Truch & Damien J. Downes & Caroline Scott & E. Ravza Gür & Jelena M. Telenius & Emmanouela Repapi & Ron Schwessinger & Matthew Gosden & Jill M. Brown & Stephen Taylor & Pak Leng Cheong & Jim R. , 2022. "The chromatin remodeller ATRX facilitates diverse nuclear processes, in a stochastic manner, in both heterochromatin and euchromatin," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0233630. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.