IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-51282-0.html
   My bibliography  Save this article

VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing

Author

Listed:
  • Can Luo

    (Vanderbilt University)

  • Yichen Henry Liu

    (Vanderbilt University)

  • Xin Maizie Zhou

    (Vanderbilt University
    Vanderbilt University
    Vanderbilt University)

Abstract

Structural variants (SVs) significantly contribute to human genome diversity and play a crucial role in precision medicine. Although advancements in single-molecule long-read sequencing offer a groundbreaking resource for SV detection, identifying SV breakpoints and sequences accurately and robustly remains challenging. We introduce VolcanoSV, an innovative hybrid SV detection pipeline that utilizes both a reference genome and local de novo assembly to generate a phased diploid assembly. VolcanoSV uses phased SNPs and unique k-mer similarity analysis, enabling precise haplotype-resolved SV discovery. VolcanoSV is adept at constructing comprehensive genetic maps encompassing SNPs, small indels, and all types of SVs, making it well-suited for human genomics studies. Our extensive experiments demonstrate that VolcanoSV surpasses state-of-the-art assembly-based tools in the detection of insertion and deletion SVs, exhibiting superior recall, precision, F1 scores, and genotype accuracy across a diverse range of datasets, including low-coverage (10x) datasets. VolcanoSV outperforms assembly-based tools in the identification of complex SVs, including translocations, duplications, and inversions, in both simulated and real cancer data. Moreover, VolcanoSV is robust to various evaluation parameters and accurately identifies breakpoints and SV sequences.

Suggested Citation

  • Can Luo & Yichen Henry Liu & Xin Maizie Zhou, 2024. "VolcanoSV enables accurate and robust structural variant calling in diploid genomes from single-molecule long read sequencing," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-51282-0
    DOI: 10.1038/s41467-024-51282-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-51282-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-51282-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter Edge & Vikas Bansal, 2019. "Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
    2. Daniel C. Jeffares & Clemency Jolly & Mimoza Hoti & Doug Speed & Liam Shaw & Charalampos Rallis & Francois Balloux & Christophe Dessimoz & Jürg Bähler & Fritz J. Sedlazeck, 2017. "Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast," Nature Communications, Nature, vol. 8(1), pages 1-11, April.
    3. Konrad J. Karczewski & Laurent C. Francioli & Grace Tiao & Beryl B. Cummings & Jessica Alföldi & Qingbo Wang & Ryan L. Collins & Kristen M. Laricchia & Andrea Ganna & Daniel P. Birnbaum & Laura D. Gau, 2020. "The mutational constraint spectrum quantified from variation in 141,456 humans," Nature, Nature, vol. 581(7809), pages 434-443, May.
    4. Peter H. Sudmant & Tobias Rausch & Eugene J. Gardner & Robert E. Handsaker & Alexej Abyzov & John Huddleston & Yan Zhang & Kai Ye & Goo Jun & Markus Hsi-Yang Fritz & Miriam K. Konkel & Ankit Malhotra , 2015. "An integrated map of structural variation in 2,504 human genomes," Nature, Nature, vol. 526(7571), pages 75-81, October.
    5. Yichen Henry Liu & Can Luo & Staunton G. Golding & Jacob B. Ioffe & Xin Maizie Zhou, 2024. "Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    6. Wen-Wei Liao & Mobin Asri & Jana Ebler & Daniel Doerr & Marina Haukness & Glenn Hickey & Shuangjia Lu & Julian K. Lucas & Jean Monlong & Haley J. Abel & Silvia Buonaiuto & Xian H. Chang & Haoyu Cheng , 2023. "A draft human pangenome reference," Nature, Nature, vol. 617(7960), pages 312-324, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cristian Groza & Xun Chen & Travis J. Wheeler & Guillaume Bourque & Clément Goubert, 2024. "A unified framework to analyze transposable element insertion polymorphisms using graph genomes," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    2. Yichen Henry Liu & Can Luo & Staunton G. Golding & Jacob B. Ioffe & Xin Maizie Zhou, 2024. "Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    3. M. Mahmoud & Y. Huang & K. Garimella & P. A. Audano & W. Wan & N. Prasad & R. E. Handsaker & S. Hall & A. Pionzio & M. C. Schatz & M. E. Talkowski & E. E. Eichler & S. E. Levy & F. J. Sedlazeck, 2024. "Utility of long-read sequencing for All of Us," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    4. Xiaoling Tong & Min-Jin Han & Kunpeng Lu & Shuaishuai Tai & Shubo Liang & Yucheng Liu & Hai Hu & Jianghong Shen & Anxing Long & Chengyu Zhan & Xin Ding & Shuo Liu & Qiang Gao & Bili Zhang & Linli Zhou, 2022. "High-resolution silkworm pan-genome provides genetic insights into artificial selection and ecological adaptation," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    5. Joanna Hui Juan Tan & Zhihui Li & Mar Gonzalez Porta & Ramesh Rajaby & Weng Khong Lim & Ye An Tan & Rodrigo Toro Jimenez & Renyi Teo & Maxime Hebrard & Jack Ling Ow & Shimin Ang & Justin Jeyakani & Ya, 2024. "A Catalogue of Structural Variation across Ancestrally Diverse Asian Genomes," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    6. Tuomas Hämälä & Christopher Moore & Laura Cowan & Matthew Carlile & David Gopaulchan & Marie K. Brandrud & Siri Birkeland & Matthew Loose & Filip Kolář & Marcus A. Koch & Levi Yant, 2024. "Impact of whole-genome duplications on structural variant evolution in Cochlearia," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    7. Cristian Groza & Carl Schwendinger-Schreck & Warren A. Cheung & Emily G. Farrow & Isabelle Thiffault & Juniper Lake & William B. Rizzo & Gilad Evrony & Tom Curran & Guillaume Bourque & Tomi Pastinen, 2024. "Pangenome graphs improve the analysis of structural variants in rare genetic diseases," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    8. Yirong Shi & Yiwei Niu & Peng Zhang & Huaxia Luo & Shuai Liu & Sijia Zhang & Jiajia Wang & Yanyan Li & Xinyue Liu & Tingrui Song & Tao Xu & Shunmin He, 2023. "Characterization of genome-wide STR variation in 6487 human genomes," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    9. Arthur S. Lee & Lauren J. Ayers & Michael Kosicki & Wai-Man Chan & Lydia N. Fozo & Brandon M. Pratt & Thomas E. Collins & Boxun Zhao & Matthew F. Rose & Alba Sanchis-Juan & Jack M. Fu & Isaac Wong & X, 2024. "A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders," Nature Communications, Nature, vol. 15(1), pages 1-26, December.
    10. Manon Baudic & Hiroshige Murata & Fernanda M. Bosada & Uirá Souto Melo & Takanori Aizawa & Pierre Lindenbaum & Lieve E. Maarel & Amaury Guedon & Estelle Baron & Enora Fremy & Adrien Foucal & Taisuke I, 2024. "TAD boundary deletion causes PITX2-related cardiac electrical and structural defects," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    11. Marsha M. Wheeler & Adrienne M. Stilp & Shuquan Rao & Bjarni V. Halldórsson & Doruk Beyter & Jia Wen & Anna V. Mihkaylova & Caitlin P. McHugh & John Lane & Min-Zhi Jiang & Laura M. Raffield & Goo Jun , 2022. "Whole genome sequencing identifies structural variants contributing to hematologic traits in the NHLBI TOPMed program," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    12. Junho Kim & August Yue Huang & Shelby L. Johnson & Jenny Lai & Laura Isacco & Ailsa M. Jeffries & Michael B. Miller & Michael A. Lodato & Christopher A. Walsh & Eunjung Alice Lee, 2022. "Prevalence and mechanisms of somatic deletions in single human neurons during normal aging and in DNA repair disorders," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    13. Asmundur Oddsson & Patrick Sulem & Gardar Sveinbjornsson & Gudny A. Arnadottir & Valgerdur Steinthorsdottir & Gisli H. Halldorsson & Bjarni A. Atlason & Gudjon R. Oskarsson & Hannes Helgason & Henriet, 2023. "Deficit of homozygosity among 1.52 million individuals and genetic causes of recessive lethality," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    14. Vincent Michaud & Eulalie Lasseaux & David J. Green & Dave T. Gerrard & Claudio Plaisant & Tomas Fitzgerald & Ewan Birney & Benoît Arveiler & Graeme C. Black & Panagiotis I. Sergouniotis, 2022. "The contribution of common regulatory and protein-coding TYR variants to the genetic architecture of albinism," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    15. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    16. Sean A. Misek & Aaron Fultineer & Jeremie Kalfon & Javad Noorbakhsh & Isabella Boyle & Priyanka Roy & Joshua Dempster & Lia Petronio & Katherine Huang & Alham Saadat & Thomas Green & Adam Brown & John, 2024. "Germline variation contributes to false negatives in CRISPR-based experiments with varying burden across ancestries," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    17. Laura M. Mueller & Abigail Isaacson & Heather Wilson & Anna Salowka & Isabel Tay & Maolian Gong & Nancy Samir Elbarbary & Klemens Raile & Francesca M. Spagnoli, 2024. "Heterozygous missense variant in GLI2 impairs human endocrine pancreas development," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    18. Alexendar R. Perez & Laura Sala & Richard K. Perez & Joana A. Vidigal, 2021. "CSC software corrects off-target mediated gRNA depletion in CRISPR-Cas9 essentiality screens," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    19. Kian Hong Kock & Patrick K. Kimes & Stephen S. Gisselbrecht & Sachi Inukai & Sabrina K. Phanor & James T. Anderson & Gayatri Ramakrishnan & Colin H. Lipper & Dongyuan Song & Jesse V. Kurland & Julia M, 2024. "DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    20. Gaëlle Odelin & Adèle Faucherre & Damien Marchese & Amélie Pinard & Hager Jaouadi & Solena Scouarnec & Raphaël Chiarelli & Younes Achouri & Emilie Faure & Marine Herbane & Alexis Théron & Jean-Françoi, 2023. "Variations in the poly-histidine repeat motif of HOXA1 contribute to bicuspid aortic valve in mouse and zebrafish," Nature Communications, Nature, vol. 14(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-51282-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.