IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1001154.html
   My bibliography  Save this article

Characterising and Predicting Haploinsufficiency in the Human Genome

Author

Listed:
  • Ni Huang
  • Insuk Lee
  • Edward M Marcotte
  • Matthew E Hurles

Abstract

Haploinsufficiency, wherein a single functional copy of a gene is insufficient to maintain normal function, is a major cause of dominant disease. Human disease studies have identified several hundred haploinsufficient (HI) genes. We have compiled a map of 1,079 haplosufficient (HS) genes by systematic identification of genes unambiguously and repeatedly compromised by copy number variation among 8,458 apparently healthy individuals and contrasted the genomic, evolutionary, functional, and network properties between these HS genes and known HI genes. We found that HI genes are typically longer and have more conserved coding sequences and promoters than HS genes. HI genes exhibit higher levels of expression during early development and greater tissue specificity. Moreover, within a probabilistic human functional interaction network HI genes have more interaction partners and greater network proximity to other known HI genes. We built a predictive model on the basis of these differences and annotated 12,443 genes with their predicted probability of being haploinsufficient. We validated these predictions of haploinsufficiency by demonstrating that genes with a high predicted probability of exhibiting haploinsufficiency are enriched among genes implicated in human dominant diseases and among genes causing abnormal phenotypes in heterozygous knockout mice. We have transformed these gene-based haploinsufficiency predictions into haploinsufficiency scores for genic deletions, which we demonstrate to better discriminate between pathogenic and benign deletions than consideration of the deletion size or numbers of genes deleted. These robust predictions of haploinsufficiency support clinical interpretation of novel loss-of-function variants and prioritization of variants and genes for follow-up studies.Author Summary: Humans, like most complex organisms, have two copies of most genes in their genome, one from the mother and one from the father. This redundancy provides a back-up copy for most genes, should one copy be lost through mutation. For a minority of genes, one functional copy is not enough to sustain normal human function, and mutations causing the loss of function of one of the copies of such genes are a major cause of childhood developmental diseases. Over the past 20 years medical geneticists have identified over 300 such genes, but it is not known how many of the 22,000 genes in our genome may also be sensitive to gene loss. By comparing these ∼300 genes known to be sensitive to gene loss with over 1,000 genes where loss of a single copy does not result in disease, we have identified some key evolutionary and functional similarities between genes sensitive to loss of a single copy. We have used these similarities to predict for most genes in the genome, whether loss of a single copy is likely to result in disease. These predictions will help in the interpretation of mutations seen in patients.

Suggested Citation

  • Ni Huang & Insuk Lee & Edward M Marcotte & Matthew E Hurles, 2010. "Characterising and Predicting Haploinsufficiency in the Human Genome," PLOS Genetics, Public Library of Science, vol. 6(10), pages 1-11, October.
  • Handle: RePEc:plo:pgen00:1001154
    DOI: 10.1371/journal.pgen.1001154
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1001154
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1001154&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1001154?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sarah B. Ng & Emily H. Turner & Peggy D. Robertson & Steven D. Flygare & Abigail W. Bigham & Choli Lee & Tristan Shaffer & Michelle Wong & Arindam Bhattacharjee & Evan E. Eichler & Michael Bamshad & D, 2009. "Targeted capture and massively parallel sequencing of 12 human exomes," Nature, Nature, vol. 461(7261), pages 272-276, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yingjun Xie & Xiaofang Sun & Wei Jian & Jingsi Chen & Dunjin Chen, 2018. "Identification of Microdeletion of 7q36.1-qter in Fetal Hemivertebrae with Scoliosis," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 9(5), pages 7458-7462, October.
    2. Shinsuke Ohnuki & Yoshikazu Ohya, 2018. "High-dimensional single-cell phenotyping reveals extensive haploinsufficiency," PLOS Biology, Public Library of Science, vol. 16(5), pages 1-23, May.
    3. István Bartha & Antonio Rausell & Paul J McLaren & Pejman Mohammadi & Manuel Tardaguila & Nimisha Chaturvedi & Jacques Fellay & Amalio Telenti, 2015. "The Characteristics of Heterozygous Protein Truncating Variants in the Human Genome," PLOS Computational Biology, Public Library of Science, vol. 11(12), pages 1-14, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Elaine T. Lim & Yingleong Chan & Pepper Dawes & Xiaoge Guo & Serkan Erdin & Derek J. C. Tai & Songlei Liu & Julia M. Reichert & Mannix J. Burns & Ying Kai Chan & Jessica J. Chiang & Katharina Meyer & , 2022. "Orgo-Seq integrates single-cell and bulk transcriptomic data to identify cell type specific-driver genes associated with autism spectrum disorder," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    2. Zura Kakushadze & Willie Yu, 2017. "Mutation Clusters from Cancer Exome," Papers 1707.08504, arXiv.org.
    3. Jason Flannick & Joshua M Korn & Pierre Fontanillas & George B Grant & Eric Banks & Mark A Depristo & David Altshuler, 2012. "Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation," PLOS Computational Biology, Public Library of Science, vol. 8(7), pages 1-13, July.
    4. Thomas J Hoffmann & Bronya J Keats & Noriko Yoshikawa & Catherine Schaefer & Neil Risch & Lawrence R Lustig, 2016. "A Large Genome-Wide Association Study of Age-Related Hearing Impairment Using Electronic Health Records," PLOS Genetics, Public Library of Science, vol. 12(10), pages 1-20, October.
    5. Miao-Xin Li & Johnny S H Kwan & Su-Ying Bao & Wanling Yang & Shu-Leong Ho & Yong-Qiang Song & Pak C Sham, 2013. "Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies," PLOS Genetics, Public Library of Science, vol. 9(1), pages 1-11, January.
    6. Degui Zhi & Rui Chen, 2012. "Statistical Guidance for Experimental Design and Data Analysis of Mutation Detection in Rare Monogenic Mendelian Diseases by Exome Sequencing," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-11, February.
    7. Kirsley Chennen & Thomas Weber & Xavière Lornage & Arnaud Kress & Johann Böhm & Julie Thompson & Jocelyn Laporte & Olivier Poch, 2020. "MISTIC: A prediction tool to reveal disease-relevant deleterious missense variants," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-23, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1001154. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.