IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-35996-1.html
   My bibliography  Save this article

Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak

Author

Listed:
  • Yu Chen

    (University of Alabama at Birmingham
    University of Alabama at Birmingham)

  • Amy Y. Wang

    (University of Alabama at Birmingham
    University of Alabama at Birmingham)

  • Courtney A. Barkley

    (University of Alabama at Birmingham)

  • Yixin Zhang

    (University of Alabama at Birmingham)

  • Xinyang Zhao

    (University of Alabama at Birmingham)

  • Min Gao

    (University of Alabama at Birmingham
    University of Alabama at Birmingham)

  • Mick D. Edmonds

    (University of Alabama at Birmingham)

  • Zechen Chong

    (University of Alabama at Birmingham
    University of Alabama at Birmingham
    HudsonAlpha Institute for Biotechnology)

Abstract

Long-read sequencing has demonstrated great potential for characterizing all types of structural variations (SVs). However, existing algorithms have insufficient sensitivity and precision. To address these limitations, we present DeBreak, a computational method for comprehensive and accurate SV discovery. Based on alignment results, DeBreak employs a density-based approach for clustering SV candidates together with a local de novo assembly approach for reconstructing long insertions. A partial order alignment algorithm ensures precise SV breakpoints with single base-pair resolution, and a k-means clustering method can report multi-allele SV events. DeBreak outperforms existing tools on both simulated and real long-read sequencing data from both PacBio and Nanopore platforms. An important application of DeBreak is analyzing cancer genomes for potentially tumor-driving SVs. DeBreak can also be used for supplementing whole-genome assembly-based SV discovery.

Suggested Citation

  • Yu Chen & Amy Y. Wang & Courtney A. Barkley & Yixin Zhang & Xinyang Zhao & Min Gao & Mick D. Edmonds & Zechen Chong, 2023. "Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-35996-1
    DOI: 10.1038/s41467-023-35996-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-35996-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-35996-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wensheng Wang & Ramil Mauleon & Zhiqiang Hu & Dmytro Chebotarov & Shuaishuai Tai & Zhichao Wu & Min Li & Tianqing Zheng & Roven Rommel Fuentes & Fan Zhang & Locedie Mansueto & Dario Copetti & Millicen, 2018. "Genomic variation in 3,010 diverse accessions of Asian cultivated rice," Nature, Nature, vol. 557(7703), pages 43-49, May.
    2. Michael F. Berger & Michael S. Lawrence & Francesca Demichelis & Yotam Drier & Kristian Cibulskis & Andrey Y. Sivachenko & Andrea Sboner & Raquel Esgueva & Dorothee Pflueger & Carrie Sougnez & Robert , 2011. "The genomic complexity of primary human prostate cancer," Nature, Nature, vol. 470(7333), pages 214-220, February.
    3. Yilong Li & Nicola D. Roberts & Jeremiah A. Wala & Ofer Shapira & Steven E. Schumacher & Kiran Kumar & Ekta Khurana & Sebastian Waszak & Jan O. Korbel & James E. Haber & Marcin Imielinski & Joachim We, 2020. "Patterns of somatic structural variation in human cancer genomes," Nature, Nature, vol. 578(7793), pages 112-121, February.
    4. Peter H. Sudmant & Tobias Rausch & Eugene J. Gardner & Robert E. Handsaker & Alexej Abyzov & John Huddleston & Yan Zhang & Kai Ye & Goo Jun & Markus Hsi-Yang Fritz & Miriam K. Konkel & Ankit Malhotra , 2015. "An integrated map of structural variation in 2,504 human genomes," Nature, Nature, vol. 526(7571), pages 75-81, October.
    5. Ryan E. Mills & Klaudia Walter & Chip Stewart & Robert E. Handsaker & Ken Chen & Can Alkan & Alexej Abyzov & Seungtai Chris Yoon & Kai Ye & R. Keira Cheetham & Asif Chinwalla & Donald F. Conrad & Yuta, 2011. "Mapping copy number variation by population-scale genome sequencing," Nature, Nature, vol. 470(7332), pages 59-65, February.
    6. Richard Redon & Shumpei Ishikawa & Karen R. Fitch & Lars Feuk & George H. Perry & T. Daniel Andrews & Heike Fiegler & Michael H. Shapero & Andrew R. Carson & Wenwei Chen & Eun Kyung Cho & Stephanie Da, 2006. "Global variation in copy number in the human genome," Nature, Nature, vol. 444(7118), pages 444-454, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yichen Henry Liu & Can Luo & Staunton G. Golding & Jacob B. Ioffe & Xin Maizie Zhou, 2024. "Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-22, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yichen Henry Liu & Can Luo & Staunton G. Golding & Jacob B. Ioffe & Xin Maizie Zhou, 2024. "Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    2. Michelle Dietzen & Haoran Zhai & Olivia Lucas & Oriol Pich & Christopher Barrington & Wei-Ting Lu & Sophia Ward & Yanping Guo & Robert E. Hynds & Simone Zaccaria & Charles Swanton & Nicholas McGranaha, 2024. "Replication timing alterations are associated with mutation acquisition during breast and lung cancer evolution," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    3. Albertas Dvirnas & Callum Stewart & Vilhelm Müller & Santosh Kumar Bikkarolla & Karolin Frykholm & Linus Sandegren & Erik Kristiansson & Fredrik Westerlund & Tobias Ambjörnsson, 2021. "Detection of structural variations in densely-labelled optical DNA barcodes: A hidden Markov model approach," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-15, November.
    4. Liyuan Zhou & Qiongzi Qiu & Qing Zhou & Jianwei Li & Mengqian Yu & Kezhen Li & Lingling Xu & Xiaohui Ke & Haiming Xu & Bingjian Lu & Hui Wang & Weiguo Lu & Pengyuan Liu & Yan Lu, 2022. "Long-read sequencing unveils high-resolution HPV integration and its oncogenic progression in cervical cancer," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    5. Jinhyun Kim & Sungsik Kim & Huiran Yeom & Seo Woo Song & Kyoungseob Shin & Sangwook Bae & Han Suk Ryu & Ji Young Kim & Ahyoun Choi & Sumin Lee & Taehoon Ryu & Yeongjae Choi & Hamin Kim & Okju Kim & Yu, 2023. "Barcoded multiple displacement amplification for high coverage sequencing in spatial genomics," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    6. Rujia Chen & Ning Xiao & Yue Lu & Tianyun Tao & Qianfeng Huang & Shuting Wang & Zhichao Wang & Mingli Chuan & Qing Bu & Zhou Lu & Hanyao Wang & Yanze Su & Yi Ji & Jianheng Ding & Ahmed Gharib & Huixin, 2023. "A de novo evolved gene contributes to rice grain shape difference between indica and japonica," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    7. Jae Eun Lee & Jung Hye Sung & Daniel Sarpong & Jimmy T. Efird & Paul B. Tchounwou & Elizabeth Ofili & Keith Norris, 2018. "Knowledge Management for Fostering Biostatistical Collaboration within a Research Network: The RTRN Case Study," IJERPH, MDPI, vol. 15(11), pages 1-13, November.
    8. Michael Fraser & Julie Livingstone & Jeffrey L. Wrana & Antonio Finelli & Housheng Hansen He & Theodorus van der Kwast & Alexandre R. Zlotta & Robert G. Bristow & Paul C. Boutros, 2021. "Somatic driver mutation prevalence in 1844 prostate cancers identifies ZNRF3 loss as a predictor of metastatic relapse," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    9. Alexander Martinez-Fundichely & Austin Dixon & Ekta Khurana, 2022. "Modeling tissue-specific breakpoint proximity of structural variations from whole-genomes to identify cancer drivers," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    10. Anastasiya Kishkevich & Sanjeeta Tamang & Michael O. Nguyen & Judith Oehler & Elena Bulmaga & Christos Andreadis & Carl A. Morrow & Manisha Jalan & Fekret Osman & Matthew C. Whitby, 2022. "Rad52’s DNA annealing activity drives template switching associated with restarted DNA replication," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    11. Yi-Li Feng & Qian Liu & Ruo-Dan Chen & Si-Cheng Liu & Zhi-Cheng Huang & Kun-Ming Liu & Xiao-Ying Yang & An-Yong Xie, 2022. "DNA nicks induce mutational signatures associated with BRCA1 deficiency," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    12. Yingyan Yu & Zhen Zhang & Xiaorui Dong & Ruixin Yang & Zhongqu Duan & Zhen Xiang & Jun Li & Guichao Li & Fazhe Yan & Hongzhang Xue & Du Jiao & Jinyuan Lu & Huimin Lu & Wenmin Zhang & Yangzhen Wei & Sh, 2022. "Pangenomic analysis of Chinese gastric cancer," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    13. Julie Livingstone & Yu-Jia Shiah & Takafumi N. Yamaguchi & Lawrence E. Heisler & Vincent Huang & Robert Lesurf & Tsumugi Gebo & Benjamin Carlin & Stefan Eng & Erik Drysdale & Jeffrey Green & Theodorus, 2021. "The telomere length landscape of prostate cancer," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    14. Tao Zhu & Chunjiao Xia & Ranran Yu & Xinkai Zhou & Xingbing Xu & Lin Wang & Zhanxiang Zong & Junjiao Yang & Yinmeng Liu & Luchang Ming & Yuxin You & Dijun Chen & Weibo Xie, 2024. "Comprehensive mapping and modelling of the rice regulome landscape unveils the regulatory architecture underlying complex traits," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    15. Ramesh Rajaby & Dong-Xu Liu & Chun Hang Au & Yuen-Ting Cheung & Amy Yuet Ting Lau & Qing-Yong Yang & Wing-Kin Sung, 2023. "INSurVeyor: improving insertion calling from short read sequencing data," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    16. Luan Nguyen & Arne Hoeck & Edwin Cuppen, 2022. "Machine learning-based tissue of origin classification for cancer of unknown primary diagnostics using genome-wide mutation features," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    17. Eva G. Álvarez & Jonas Demeulemeester & Paula Otero & Clemency Jolly & Daniel García-Souto & Ana Pequeño-Valtierra & Jorge Zamora & Marta Tojo & Javier Temes & Adrian Baez-Ortega & Bernardo Rodriguez-, 2021. "Aberrant integration of Hepatitis B virus DNA promotes major restructuring of human hepatocellular carcinoma genome architecture," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    18. Yoshitaka Sakamoto & Shuhei Miyake & Miho Oka & Akinori Kanai & Yosuke Kawai & Satoi Nagasawa & Yuichi Shiraishi & Katsushi Tokunaga & Takashi Kohno & Masahide Seki & Yutaka Suzuki & Ayako Suzuki, 2022. "Phasing analysis of lung cancer genomes using a long read sequencer," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    19. Yoshihisa Kobayashi & Geoffrey R. Oxnard & Elizabeth F. Cohen & Navin R. Mahadevan & Joao V. Alessi & Yin P. Hung & Arrien A. Bertram & David E. Heppner & Mauricio F. Ribeiro & Karina P. Sacardo & Rod, 2022. "Genomic and biological study of fusion genes as resistance mechanisms to EGFR inhibitors," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    20. Xue Gao & Sheng Wang & Yan-Fen Wang & Shuang Li & Shi-Xin Wu & Rong-Ge Yan & Yi-Wen Zhang & Rui-Dong Wan & Zhen He & Ren-De Song & Xin-Quan Zhao & Dong-Dong Wu & Qi-En Yang, 2022. "Long read genome assemblies complemented by single cell RNA-sequencing reveal genetic and cellular mechanisms underlying the adaptive evolution of yak," Nature Communications, Nature, vol. 13(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-35996-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.