IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v7y2016i1d10.1038_ncomms11305.html
   My bibliography  Save this article

High-dimensional genomic data bias correction and data integration using MANCIE

Author

Listed:
  • Chongzhi Zang

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute)

  • Tao Wang

    (Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center
    Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center)

  • Ke Deng

    (Center for Statistical Science, Tsinghua University)

  • Bo Li

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute
    Harvard University)

  • Sheng’en Hu

    (School of Life Sciences, Tongji University)

  • Qian Qin

    (School of Life Sciences, Tongji University)

  • Tengfei Xiao

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute
    Dana-Farber Cancer Institute and Harvard Medical School)

  • Shihua Zhang

    (National Center for Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences)

  • Clifford A. Meyer

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute)

  • Housheng Hansen He

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute
    Dana-Farber Cancer Institute and Harvard Medical School
    University of Toronto)

  • Myles Brown

    (Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute
    Dana-Farber Cancer Institute and Harvard Medical School)

  • Jun S. Liu

    (Harvard University)

  • Yang Xie

    (Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center
    University of Texas Southwestern Medical Center
    Simons Comprehensive Cancer Center, University of Texas Southwestern Medical Center)

  • X. Shirley Liu

    (Dana-Farber Cancer Institute and Harvard T.H. Chan School of Public Health
    Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute)

Abstract

High-dimensional genomic data analysis is challenging due to noises and biases in high-throughput experiments. We present a computational method matrix analysis and normalization by concordant information enhancement (MANCIE) for bias correction and data integration of distinct genomic profiles on the same samples. MANCIE uses a Bayesian-supported principal component analysis-based approach to adjust the data so as to achieve better consistency between sample-wise distances in the different profiles. MANCIE can improve tissue-specific clustering in ENCODE data, prognostic prediction in Molecular Taxonomy of Breast Cancer International Consortium and The Cancer Genome Atlas data, copy number and expression agreement in Cancer Cell Line Encyclopedia data, and has broad applications in cross-platform, high-dimensional data integration.

Suggested Citation

  • Chongzhi Zang & Tao Wang & Ke Deng & Bo Li & Sheng’en Hu & Qian Qin & Tengfei Xiao & Shihua Zhang & Clifford A. Meyer & Housheng Hansen He & Myles Brown & Jun S. Liu & Yang Xie & X. Shirley Liu, 2016. "High-dimensional genomic data bias correction and data integration using MANCIE," Nature Communications, Nature, vol. 7(1), pages 1-8, September.
  • Handle: RePEc:nat:natcom:v:7:y:2016:i:1:d:10.1038_ncomms11305
    DOI: 10.1038/ncomms11305
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/ncomms11305
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/ncomms11305?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shengen Shawn Hu & Lin Liu & Qi Li & Wenjing Ma & Michael J. Guertin & Clifford A. Meyer & Ke Deng & Tingting Zhang & Chongzhi Zang, 2022. "Intrinsic bias estimation for improved analysis of bulk and single-cell chromatin accessibility profiles using SELMA," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    2. Kaiwen Wang & Yuqiu Yang & Fangjiang Wu & Bing Song & Xinlei Wang & Tao Wang, 2023. "Comparative analysis of dimension reduction methods for cytometry by time-of-flight data," Nature Communications, Nature, vol. 14(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:7:y:2016:i:1:d:10.1038_ncomms11305. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.