IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v10y2019i1d10.1038_s41467-019-13355-3.html
   My bibliography  Save this article

Assembly of chromosome-scale contigs by efficiently resolving repetitive sequences with long reads

Author

Listed:
  • Huilong Du

    (State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences
    University of Chinese Academy of Sciences)

  • Chengzhi Liang

    (State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences
    University of Chinese Academy of Sciences)

Abstract

The abundant repetitive sequences in complex eukaryotic genomes cause fragmented assemblies, which lose value as reference genomes, often due to incomplete gene sequences and unanchored or mispositioned contigs on chromosomes. Here we report a genome assembly method HERA, which resolves repeats efficiently by constructing a connection graph from an overlap graph. We test HERA on the genomes of rice, maize, human, and Tartary buckwheat with single-molecule sequencing and mapping data. HERA correctly assembles most of the previously unassembled regions, resulting in dramatically improved, highly contiguous genome assemblies with newly assembled gene sequences. For example, the maize contig N50 size reaches 61.2 Mb and the Tartary buckwheat genome comprises only 20 contigs. HERA can also be used to fill gaps and fix errors in reference genomes. The application of HERA will greatly improve the quality of new or existing assemblies of complex genomes.

Suggested Citation

  • Huilong Du & Chengzhi Liang, 2019. "Assembly of chromosome-scale contigs by efficiently resolving repetitive sequences with long reads," Nature Communications, Nature, vol. 10(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:10:y:2019:i:1:d:10.1038_s41467-019-13355-3
    DOI: 10.1038/s41467-019-13355-3
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-019-13355-3
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-019-13355-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jinquan Chao & Shaohua Wu & Minjing Shi & Xia Xu & Qiang Gao & Huilong Du & Bin Gao & Dong Guo & Shuguang Yang & Shixin Zhang & Yan Li & Xiuli Fan & Chunyan Hai & Liquan Kou & Jiao Zhang & Zhiwei Wang, 2023. "Genomic insight into domestication of rubber tree," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Gabriel E. Rech & Santiago Radío & Sara Guirao-Rico & Laura Aguilera & Vivien Horvath & Llewellyn Green & Hannah Lindstadt & Véronique Jamilloux & Hadi Quesneville & Josefa González, 2022. "Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    3. Jiantao Guan & Jintao Zhang & Dan Gong & Zhengquan Zhang & Yang Yu & Gaoling Luo & Prakit Somta & Zheng Hu & Suhua Wang & Xingxing Yuan & Yaowen Zhang & Yanlan Wang & Yanhua Chen & Kularb Laosatit & X, 2022. "Genomic analyses of rice bean landraces reveal adaptation and yield related loci to accelerate breeding," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:10:y:2019:i:1:d:10.1038_s41467-019-13355-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.