IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-42531-9.html
   My bibliography  Save this article

Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation

Author

Listed:
  • Wouter Steyaert

    (Radboud University Medical Center
    Radboud Institute for Molecular Life Sciences)

  • Lonneke Haer-Wigman

    (Radboud University Medical Center)

  • Rolph Pfundt

    (Radboud University Medical Center)

  • Debby Hellebrekers

    (Maastricht University Medical Center + , Department of Clinical Genetics)

  • Marloes Steehouwer

    (Radboud University Medical Center)

  • Juliet Hampstead

    (Radboud University Medical Center)

  • Elke Boer

    (Radboud University Medical Center
    Radboud University, Donders Institute for Brain, Cognition and Behaviour)

  • Alexander Stegmann

    (Maastricht University Medical Center + , Department of Clinical Genetics)

  • Helger Yntema

    (Radboud University Medical Center)

  • Erik-Jan Kamsteeg

    (Radboud University Medical Center)

  • Han Brunner

    (Radboud University Medical Center
    Maastricht University Medical Center + , Department of Clinical Genetics)

  • Alexander Hoischen

    (Radboud University Medical Center
    Radboud Institute for Molecular Life Sciences
    Radboud University Medical Center, Department of Internal Medicine and Radboud Center for Infectious Diseases (RCI))

  • Christian Gilissen

    (Radboud University Medical Center
    Radboud Institute for Molecular Life Sciences)

Abstract

The short lengths of short-read sequencing reads challenge the analysis of paralogous genomic regions in exome and genome sequencing data. Most genetic variants within these homologous regions therefore remain unidentified in standard analyses. Here, we present a method (Chameleolyser) that accurately identifies single nucleotide variants and small insertions/deletions (SNVs/Indels), copy number variants and ectopic gene conversion events in duplicated genomic regions using whole-exome sequencing data. Application to a cohort of 41,755 exome samples yields 20,432 rare homozygous deletions and 2,529,791 rare SNVs/Indels, of which we show that 338,084 are due to gene conversion events. None of the SNVs/Indels are detectable using regular analysis techniques. Validation by high-fidelity long-read sequencing in 20 samples confirms >88% of called variants. Focusing on variation in known disease genes leads to a direct molecular diagnosis in 25 previously undiagnosed patients. Our method can readily be applied to existing exome data.

Suggested Citation

  • Wouter Steyaert & Lonneke Haer-Wigman & Rolph Pfundt & Debby Hellebrekers & Marloes Steehouwer & Juliet Hampstead & Elke Boer & Alexander Stegmann & Helger Yntema & Erik-Jan Kamsteeg & Han Brunner & A, 2023. "Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42531-9
    DOI: 10.1038/s41467-023-42531-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-42531-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-42531-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-42531-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.