IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v12y2021i1d10.1038_s41467-021-26152-8.html
   My bibliography  Save this article

BAMboozle removes genetic variation from human sequence data for open data sharing

Author

Listed:
  • Christoph Ziegenhain

    (Karolinska Institute)

  • Rickard Sandberg

    (Karolinska Institute)

Abstract

The risks associated with re-identification of human genetic data are severely limiting open data sharing in life sciences, even in studies where donor-related genetic variant information is not of primary interest. Here, we developed BAMboozle, a versatile tool to eliminate critical types of sensitive genetic information in human sequence data by reverting aligned reads to the genome reference sequence. Applying BAMboozle to functional genomics data, such as single-cell RNA-seq (scRNA-seq) and scATAC-seq datasets, confirmed the removal of donor-related single nucleotide polymorphisms (SNPs) and indels in a manner that did not disclose the altered positions. Importantly, BAMboozle only removes the genetic sequence variants of the sample (i.e., donor) while preserving other important aspects of the raw sequence data. For example, BAMboozled scRNA-seq data contained accurate cell-type associated gene expression signatures, splice kinetic information, and can be used for methods benchmarking. Altogether, BAMboozle efficiently removes genetic variation in aligned sequence data, which represents a step forward towards open data sharing in many areas of genomics where the genetic variant information is not of primary interest.

Suggested Citation

  • Christoph Ziegenhain & Rickard Sandberg, 2021. "BAMboozle removes genetic variation from human sequence data for open data sharing," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
  • Handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26152-8
    DOI: 10.1038/s41467-021-26152-8
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-021-26152-8
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-021-26152-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Orit Rozenblatt-Rosen & Michael J. T. Stubbington & Aviv Regev & Sarah A. Teichmann, 2017. "The Human Cell Atlas: from vision to reality," Nature, Nature, vol. 550(7677), pages 451-453, October.
    2. Gioele La Manno & Ruslan Soldatov & Amit Zeisel & Emelie Braun & Hannah Hochgerner & Viktor Petukhov & Katja Lidschreiber & Maria E. Kastriti & Peter Lönnerberg & Alessandro Furlan & Jean Fan & Lars E, 2018. "RNA velocity of single cells," Nature, Nature, vol. 560(7719), pages 494-498, August.
    3. Jay Shendure & Shankar Balasubramanian & George M. Church & Walter Gilbert & Jane Rogers & Jeffery A. Schloss & Robert H. Waterston, 2017. "DNA sequencing at 40: past, present and future," Nature, Nature, vol. 550(7676), pages 345-353, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alexander Bernier & Hanshi Liu & Bartha Maria Knoppers, 2021. "Computational tools for genomic data de-identification: facilitating data protection law compliance," Nature Communications, Nature, vol. 12(1), pages 1-3, December.
    2. Tao Qi & Fangzhao Wu & Chuhan Wu & Liang He & Yongfeng Huang & Xing Xie, 2023. "Differentially private knowledge transfer for federated learning," Nature Communications, Nature, vol. 14(1), pages 1-9, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ziye Xu & Tianyu Zhang & Hongyu Chen & Yuyi Zhu & Yuexiao Lv & Shunji Zhang & Jiaye Chen & Haide Chen & Lili Yang & Weiqin Jiang & Shengyu Ni & Fangru Lu & Zhaolun Wang & Hao Yang & Ling Dong & Feng C, 2023. "High-throughput single nucleus total RNA sequencing of formalin-fixed paraffin-embedded tissues by snRandom-seq," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Huanhuan Tan & Weixu Wang & Congjin Zhou & Yanfeng Wang & Shu Zhang & Pinglan Yang & Rui Guo & Wei Chen & Jinwen Zhang & Lan Ye & Yiqiang Cui & Ting Ni & Ke Zheng, 2023. "Single-cell RNA-seq uncovers dynamic processes orchestrated by RNA-binding protein DDX43 in chromatin remodeling during spermiogenesis," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    3. Yoshiaki Yasumizu & Naganari Ohkura & Hisashi Murata & Makoto Kinoshita & Soichiro Funaki & Satoshi Nojima & Kansuke Kido & Masaharu Kohara & Daisuke Motooka & Daisuke Okuzaki & Shuji Suganami & Eriko, 2022. "Myasthenia gravis-specific aberrant neuromuscular gene expression by medullary thymic epithelial cells in thymoma," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    4. Lichun Ma & Sophia Heinrich & Limin Wang & Friederike L. Keggenhoff & Subreen Khatib & Marshonna Forgues & Michael Kelly & Stephen M. Hewitt & Areeba Saif & Jonathan M. Hernandez & Donna Mabry & Roman, 2022. "Multiregional single-cell dissection of tumor and immune cells reveals stable lock-and-key features in liver cancer," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    5. Keyong Sun & Runda Xu & Fuhai Ma & Naixue Yang & Yang Li & Xiaofeng Sun & Peng Jin & Wenzhe Kang & Lemei Jia & Jianping Xiong & Haitao Hu & Yantao Tian & Xun Lan, 2022. "scRNA-seq of gastric tumor shows complex intercellular interaction with an alternative T cell exhaustion trajectory," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Jeff Yat-Fai Chung & Philip Chiu-Tsun Tang & Max Kam-Kwan Chan & Vivian Weiwen Xue & Xiao-Ru Huang & Calvin Sze-Hang Ng & Dongmei Zhang & Kam-Tong Leung & Chun-Kwok Wong & Tin-Lap Lee & Eric W-F Lam &, 2023. "Smad3 is essential for polarization of tumor-associated neutrophils in non-small cell lung carcinoma," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    7. Fabian Peisker & Maurice Halder & James Nagai & Susanne Ziegler & Nadine Kaesler & Konrad Hoeft & Ronghui Li & Eric M. J. Bindels & Christoph Kuppe & Julia Moellmann & Michael Lehrke & Christian Stopp, 2022. "Mapping the cardiac vascular niche in heart failure," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    8. Aiko Sekita & Hiroshi Kawasaki & Ayano Fukushima-Nomura & Kiyoshi Yashiro & Keiji Tanese & Susumu Toshima & Koichi Ashizaki & Tomohiro Miyai & Junshi Yazaki & Atsuo Kobayashi & Shinichi Namba & Tatsuh, 2023. "Multifaceted analysis of cross-tissue transcriptomes reveals phenotype–endotype associations in atopic dermatitis," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    9. Yan Tang & David J. Kwiatkowski & Elizabeth P. Henske, 2022. "Midkine expression by stem-like tumor cells drives persistence to mTOR inhibition and an immune-suppressive microenvironment," Nature Communications, Nature, vol. 13(1), pages 1-22, December.
    10. Jun Dai & Shuyu Zheng & Matías M. Falco & Jie Bao & Johanna Eriksson & Sanna Pikkusaari & Sofia Forstén & Jing Jiang & Wenyu Wang & Luping Gao & Fernando Perez-Villatoro & Olli Dufva & Khalid Saeed & , 2024. "Tracing back primed resistance in cancer via sister cells," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    11. Ryuki Shimada & Yuzuru Kato & Naoki Takeda & Sayoko Fujimura & Kei-ichiro Yasunaga & Shingo Usuki & Hitoshi Niwa & Kimi Araki & Kei-ichiro Ishiguro, 2023. "STRA8–RB interaction is required for timely entry of meiosis in mouse female germ cells," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    12. Yingxin Lin & Yue Cao & Elijah Willie & Ellis Patrick & Jean Y. H. Yang, 2023. "Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    13. Xiaojun Ren & Jianqing Liang & Yiming Zhang & Ning Jiang & Yuhui Xu & Mengdi Qiu & Yiqin Wang & Bing Zhao & Xiaojun Chen, 2022. "Single-cell transcriptomic analysis highlights origin and pathological process of human endometrioid endometrial carcinoma," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    14. Gaofei Li & Yicong Sun & Immanuel Kwok & Liting Yang & Wanying Wen & Peixian Huang & Mei Wu & Jing Li & Zhibin Huang & Zhaoyuan Liu & Shuai He & Wan Peng & Jin-Xin Bei & Florent Ginhoux & Lai Guan Ng , 2024. "Cebp1 and Cebpβ transcriptional axis controls eosinophilopoiesis in zebrafish," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    15. Adele M. Alchahin & Shenglin Mei & Ioanna Tsea & Taghreed Hirz & Youmna Kfoury & Douglas Dahl & Chin-Lee Wu & Alexander O. Subtelny & Shulin Wu & David T. Scadden & John H. Shin & Philip J. Saylor & D, 2022. "A transcriptional metastatic signature predicts survival in clear cell renal cell carcinoma," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Luisa Santus & Maria Sopena-Rios & Raquel García-Pérez & Aaron E. Lin & Gordon C. Adams & Kayla G. Barnes & Katherine J. Siddle & Shirlee Wohl & Ferran Reverter & John L. Rinn & Richard S. Bennett & L, 2023. "Single-cell profiling of lncRNA expression during Ebola virus infection in rhesus macaques," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    17. Anneke Brümmer & Sven Bergmann, 2024. "Disentangling genetic effects on transcriptional and post-transcriptional gene regulation through integrating exon and intron expression QTLs," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    18. Bibiana Costa & Jennifer Becker & Tobias Krammer & Felix Mulenge & Verónica Durán & Andreas Pavlou & Olivia Luise Gern & Xiaojing Chu & Yang Li & Luka Čičin-Šain & Britta Eiz-Vesper & Martin Messerle , 2024. "Human cytomegalovirus exploits STING signaling and counteracts IFN/ISG induction to facilitate infection of dendritic cells," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    19. Patrick Aouad & Yueyun Zhang & Fabio Martino & Céline Stibolt & Simak Ali & Giovanna Ambrosini & Sendurai A. Mani & Kelly Maggs & Hazel M. Quinn & George Sflomos & Cathrin Brisken, 2022. "Epithelial-mesenchymal plasticity determines estrogen receptor positive breast cancer dormancy and epithelial reconversion drives recurrence," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    20. Rachael G. Aubin & Emma C. Troisi & Javier Montelongo & Adam N. Alghalith & Maclean P. Nasrallah & Mariarita Santi & Pablo G. Camara, 2022. "Pro-inflammatory cytokines mediate the epithelial-to-mesenchymal-like transition of pediatric posterior fossa ependymoma," Nature Communications, Nature, vol. 13(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-26152-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.