IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-021-26865-w.html
   My bibliography  Save this article

Fast alignment and preprocessing of chromatin profiles with Chromap

Author

Listed:
  • Haowen Zhang

    (Georgia Institute of Technology)

  • Li Song

    (Dana-Farber Cancer Institute
    Harvard T.H. Chan School of Public Health)

  • Xiaotao Wang

    (Northwestern University)

  • Haoyu Cheng

    (Dana-Farber Cancer Institute
    Harvard Medical School)

  • Chenfei Wang

    (Dana-Farber Cancer Institute
    Harvard T.H. Chan School of Public Health)

  • Clifford A. Meyer

    (Dana-Farber Cancer Institute
    Harvard T.H. Chan School of Public Health
    Dana-Farber Cancer Institute)

  • Tao Liu

    (Roswell Park Comprehensive Cancer Center)

  • Ming Tang

    (Dana-Farber Cancer Institute)

  • Srinivas Aluru

    (Georgia Institute of Technology
    Georgia Institute of Technology)

  • Feng Yue

    (Northwestern University
    Robert H. Lurie Comprehensive Cancer Center of Northwestern University)

  • X. Shirley Liu

    (Dana-Farber Cancer Institute
    Harvard T.H. Chan School of Public Health
    Dana-Farber Cancer Institute)

  • Heng Li

    (Dana-Farber Cancer Institute
    Harvard Medical School)

Abstract

As sequencing depth of chromatin studies continually grows deeper for sensitive profiling of regulatory elements or chromatin spatial structures, aligning and preprocessing of these sequencing data have become the bottleneck for analysis. Here we present Chromap, an ultrafast method for aligning and preprocessing high throughput chromatin profiles. Chromap is comparable to BWA-MEM and Bowtie2 in alignment accuracy and is over 10 times faster than traditional workflows on bulk ChIP-seq/Hi-C profiles and than 10x Genomics’ CellRanger v2.0.0 pipeline on single-cell ATAC-seq profiles.

Suggested Citation

  • Haowen Zhang & Li Song & Xiaotao Wang & Haoyu Cheng & Chenfei Wang & Clifford A. Meyer & Tao Liu & Ming Tang & Srinivas Aluru & Feng Yue & X. Shirley Liu & Heng Li, 2021. "Fast alignment and preprocessing of chromatin profiles with Chromap," Nature Communications, Nature, vol. 12(1), pages 1-6, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26865-w
    DOI: 10.1038/s41467-021-26865-w
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-021-26865-w
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-021-26865-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Job Dekker & Andrew S. Belmont & Mitchell Guttman & Victor O. Leshyk & John T. Lis & Stavros Lomvardas & Leonid A. Mirny & Clodagh C. O’Shea & Peter J. Park & Bing Ren & Joan C. Ritland Politz & Jay S, 2017. "The 4D nucleome project," Nature, Nature, vol. 549(7671), pages 219-226, September.
    2. Jesse R. Dixon & Siddarth Selvaraj & Feng Yue & Audrey Kim & Yan Li & Yin Shen & Ming Hu & Jun S. Liu & Bing Ren, 2012. "Topological domains in mammalian genomes identified by analysis of chromatin interactions," Nature, Nature, vol. 485(7398), pages 376-380, May.
    3. Jason D. Buenrostro & Beijing Wu & Ulrike M. Litzenburger & Dave Ruff & Michael L. Gonzales & Michael P. Snyder & Howard Y. Chang & William J. Greenleaf, 2015. "Single-cell chromatin accessibility reveals principles of regulatory variation," Nature, Nature, vol. 523(7561), pages 486-490, July.
    4. Job Dekker & Andrew S. Belmont & Mitchell Guttman & Victor O. Leshyk & John T. Lis & Stavros Lomvardas & Leonid A. Mirny & Clodagh C. O’Shea & Peter J. Park & Bing Ren & Joan C. Ritland Politz & Jay S, 2017. "Correction: Corrigendum: The 4D nucleome project," Nature, Nature, vol. 552(7684), pages 278-278, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chunhong Yu & Xiaoyun Lei & Fang Chen & Song Mao & Lu Lv & Honglu Liu & Xueying Hu & Runhan Wang & Licong Shen & Na Zhang & Yang Meng & Yunfan Shen & Jiale Chen & Pishun Li & Shi Huang & Changwei Lin , 2022. "ARID1A loss derepresses a group of human endogenous retrovirus-H loci to modulate BRD4-dependent transcription," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mayank N. K. Choudhary & Kara Quaid & Xiaoyun Xing & Heather Schmidt & Ting Wang, 2023. "Widespread contribution of transposable elements to the rewiring of mammalian 3D genomes," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Sarah B. Reiff & Andrew J. Schroeder & Koray Kırlı & Andrea Cosolo & Clara Bakker & Luisa Mercado & Soohyun Lee & Alexander D. Veit & Alexander K. Balashov & Carl Vitzthum & William Ronchetti & Kent M, 2022. "The 4D Nucleome Data Portal as a resource for searching and visualizing curated nucleomics data," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    3. Riccardo Calandrelli & Xingzhao Wen & John Lalith Charles Richard & Zhifei Luo & Tri C. Nguyen & Chien-Ju Chen & Zhijie Qi & Shuanghong Xue & Weizhong Chen & Zhangming Yan & Weixin Wu & Kathia Zaleta-, 2023. "Genome-wide analysis of the interplay between chromatin-associated RNA and 3D genome organization in human cells," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    4. Jorine M. Eeftens & Manya Kapoor & Davide Michieletto & Clifford P. Brangwynne, 2021. "Polycomb condensates can promote epigenetic marks but are not required for sustained chromatin compaction," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    5. Ryan J. Geusz & Allen Wang & Dieter K. Lam & Nicholas K. Vinckier & Konstantinos-Dionysios Alysandratos & David A. Roberts & Jinzhao Wang & Samy Kefalopoulou & Araceli Ramirez & Yunjiang Qiu & Joshua , 2021. "Sequence logic at enhancers governs a dual mechanism of endodermal organ fate induction by FOXA pioneer factors," Nature Communications, Nature, vol. 12(1), pages 1-19, December.
    6. Andrea M. Chiariello & Alex Abraham & Simona Bianco & Andrea Esposito & Andrea Fontana & Francesca Vercellone & Mattia Conte & Mario Nicodemi, 2024. "Multiscale modelling of chromatin 4D organization in SARS-CoV-2 infected cells," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    7. Abhijit Chakraborty & Jeffrey G. Wang & Ferhat Ay, 2022. "dcHiC detects differential compartments across multiple Hi-C datasets," Nature Communications, Nature, vol. 13(1), pages 1-21, December.
    8. Hao Wang & Jiaxin Yang & Yu Zhang & Jianliang Qian & Jianrong Wang, 2022. "Reconstruct high-resolution 3D genome structures for diverse cell-types using FLAMINGO," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    9. Zhongxuan Zhang & Xiaoxiao Rong & Tianjin Xie & Zehao Li & Haozhi Song & Shujun Zhen & Haifeng Wang & Jiahui Wu & Samie R. Jaffrey & Xing Li, 2024. "Fluorogenic CRISPR for genomic DNA imaging," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    10. Hannah L. Harris & Huiya Gu & Moshe Olshansky & Ailun Wang & Irene Farabella & Yossi Eliaz & Achyuth Kalluchi & Akshay Krishna & Mozes Jacobs & Gesine Cauer & Melanie Pham & Suhas S. P. Rao & Olga Dud, 2023. "Chromatin alternates between A and B compartments at kilobase scale for subgenic organization," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    11. Zhen-Hui Wang & Xin-Feng Wang & Tianyuan Lu & Ming-Rui Li & Peng Jiang & Jing Zhao & Si-Tong Liu & Xue-Qi Fu & Jonathan F. Wendel & Yves Peer & Bao Liu & Lin-Feng Li, 2022. "Reshuffling of the ancestral core-eudicot genome shaped chromatin topology and epigenetic modification in Panax," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    12. Bhuwan Khatri & Kandice L. Tessneer & Astrid Rasmussen & Farhang Aghakhanian & Tove Ragna Reksten & Adam Adler & Ilias Alevizos & Juan-Manuel Anaya & Lara A. Aqrawi & Eva Baecklund & Johan G. Brun & S, 2022. "Genome-wide association study identifies Sjögren’s risk loci with functional implications in immune and glandular cells," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    13. Liyuan Zhou & Qiongzi Qiu & Qing Zhou & Jianwei Li & Mengqian Yu & Kezhen Li & Lingling Xu & Xiaohui Ke & Haiming Xu & Bingjian Lu & Hui Wang & Weiguo Lu & Pengyuan Liu & Yan Lu, 2022. "Long-read sequencing unveils high-resolution HPV integration and its oncogenic progression in cervical cancer," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    14. Vinícius G. Contessoto & Olga Dudchenko & Erez Lieberman Aiden & Peter G. Wolynes & José N. Onuchic & Michele Pierro, 2023. "Interphase chromosomes of the Aedes aegypti mosquito are liquid crystalline and can sense mechanical cues," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    15. Alon Diament & Tamir Tuller, 2015. "Improving 3D Genome Reconstructions Using Orthologous and Functional Constraints," PLOS Computational Biology, Public Library of Science, vol. 11(5), pages 1-22, May.
    16. Andrea Wilderman & Eva D’haene & Machteld Baetens & Tara N. Yankee & Emma Wentworth Winchester & Nicole Glidden & Ellen Roets & Jo Dorpe & Sandra Janssens & Danny E. Miller & Miranda Galey & Kari M. B, 2024. "A distant global control region is essential for normal expression of anterior HOXA genes during mouse and human craniofacial development," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
    17. Brault, Vincent & Ouadah, Sarah & Sansonnet, Laure & Lévy-Leduc, Céline, 2018. "Nonparametric multiple change-point estimation for analyzing large Hi-C data matrices," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 143-165.
    18. Aaron T L Lun & Hervé Pagès & Mike L Smith, 2018. "beachmat: A Bioconductor C++ API for accessing high-throughput biological data from a variety of R matrix types," PLOS Computational Biology, Public Library of Science, vol. 14(5), pages 1-15, May.
    19. Seungsoo Hahn & Dongsup Kim, 2015. "Identifying and Reducing Systematic Errors in Chromosome Conformation Capture Data," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-17, December.
    20. Julia Minderjahn & Alexander Fischer & Konstantin Maier & Karina Mendes & Margit Nuetzel & Johanna Raithel & Hanna Stanewsky & Ute Ackermann & Robert Månsson & Claudia Gebhard & Michael Rehli, 2022. "Postmitotic differentiation of human monocytes requires cohesin-structured chromatin," Nature Communications, Nature, vol. 13(1), pages 1-19, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26865-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.