IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-49957-9.html
   My bibliography  Save this article

Olivar: towards automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens

Author

Listed:
  • Michael X. Wang

    (Rice University)

  • Esther G. Lou

    (Rice University)

  • Nicolae Sapoval

    (Rice University)

  • Eddie Kim

    (Rice University)

  • Prashant Kalvapalle

    (Rice University)

  • Bryce Kille

    (Rice University)

  • R. A. Leo Elworth

    (Rice University)

  • Yunxi Liu

    (Rice University)

  • Yilei Fu

    (Rice University)

  • Lauren B. Stadler

    (Rice University)

  • Todd J. Treangen

    (Rice University
    Rice University)

Abstract

Tiled amplicon sequencing has served as an essential tool for tracking the spread and evolution of pathogens. Over 15 million complete SARS-CoV-2 genomes are now publicly available, most sequenced and assembled via tiled amplicon sequencing. While computational tools for tiled amplicon design exist, they require downstream manual optimization both computationally and experimentally, which is slow and costly. Here we present Olivar, a first step towards a fully automated, variant-aware design of tiled amplicons for pathogen genomes. Olivar converts each nucleotide of the target genome into a numeric risk score, capturing undesired sequence features that should be avoided. In a direct comparison with PrimalScheme, we show that Olivar has fewer mismatches overlapping with primers and predicted PCR byproducts. We also compare Olivar head-to-head with ARTIC v4.1, the most widely used primer set for SARS-CoV-2 sequencing, and show Olivar yields similar read mapping rates (~90%) and better coverage to the manually designed ARTIC v4.1 amplicons. We also evaluate Olivar on real wastewater samples and found that Olivar has up to 3-fold higher mapping rates while retaining similar coverage. In summary, Olivar automates and accelerates the generation of tiled amplicons, even in situations of high mutation frequency and/or density. Olivar is available online as a web application at https://olivar.rice.edu and can be installed locally as a command line tool with Bioconda. Source code, installation guide, and usage are available at https://github.com/treangenlab/Olivar .

Suggested Citation

  • Michael X. Wang & Esther G. Lou & Nicolae Sapoval & Eddie Kim & Prashant Kalvapalle & Bryce Kille & R. A. Leo Elworth & Yunxi Liu & Yilei Fu & Lauren B. Stadler & Todd J. Treangen, 2024. "Olivar: towards automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-49957-9
    DOI: 10.1038/s41467-024-49957-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-49957-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-49957-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nina G. Xie & Michael X. Wang & Ping Song & Shiqi Mao & Yifan Wang & Yuxia Yang & Junfeng Luo & Shengxiang Ren & David Yu Zhang, 2022. "Designing highly multiplex PCR primer sets with Simulated Annealing Design using Dimer Likelihood Estimation (SADDLE)," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    2. Jinny X. Zhang & Boyan Yordanov & Alexander Gaunt & Michael X. Wang & Peng Dai & Yuan-Jyue Chen & Kerou Zhang & John Z. Fang & Neil Dalchau & Jiaming Li & Andrew Phillips & David Yu Zhang, 2021. "A deep learning model for predicting next-generation sequencing depth from DNA sequence," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    3. Anthony P. West & Joel O. Wertheim & Jade C. Wang & Tetyana I. Vasylyeva & Jennifer L. Havens & Moinuddin A. Chowdhury & Edimarlyn Gonzalez & Courtney E. Fang & Steve S. Lonardo & Scott Hughes & Jenni, 2021. "Detection and characterization of the SARS-CoV-2 lineage B.1.526 in New York," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    4. Lucia Ruojia Wu & Peng Dai & Michael Xiangjiang Wang & Sherry Xi Chen & Evan N. Cohen & Gitanjali Jayachandran & Jinny Xuemeng Zhang & Angela V. Serrano & Nina Guanyi Xie & Naoto T. Ueno & James M. Re, 2022. "Ensemble of nucleic acid absolute quantitation modules for copy number variation detection and RNA profiling," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    5. Hayden C. Metsky & Christian B. Matranga & Shirlee Wohl & Stephen F. Schaffner & Catherine A. Freije & Sarah M. Winnicki & Kendra West & James Qu & Mary Lynn Baniecki & Adrianne Gladden-Young & Aaron , 2017. "Zika virus evolution and spread in the Americas," Nature, Nature, vol. 546(7658), pages 411-415, June.
    6. Athanasios Kousathanas & Erola Pairo-Castineira & Konrad Rawlik & Alex Stuckey & Christopher A. Odhams & Susan Walker & Clark D. Russell & Tomas Malinauskas & Yang Wu & Jonathan Millar & Xia Shen & Ka, 2022. "Whole-genome sequencing reveals host factors underlying critical COVID-19," Nature, Nature, vol. 607(7917), pages 97-103, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hideki Ogura & Jin Gohda & Xiuyuan Lu & Mizuki Yamamoto & Yoshio Takesue & Aoi Son & Sadayuki Doi & Kazuyuki Matsushita & Fumitaka Isobe & Yoshihiro Fukuda & Tai-Ping Huang & Takamasa Ueno & Naomi Mam, 2022. "Dysfunctional Sars-CoV-2-M protein-specific cytotoxic T lymphocytes in patients recovering from severe COVID-19," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    2. Klymak, Margaryta, 2023. "The trade effects of information provision about forced and child labor," World Development, Elsevier, vol. 167(C).
    3. Abdur Rasool & Qiang Qu & Yang Wang & Qingshan Jiang, 2022. "Bio-Constrained Codes with Neural Network for Density-Based DNA Data Storage," Mathematics, MDPI, vol. 10(5), pages 1-21, March.
    4. Guo, Kun & Kang, Yuxin & Ma, Dandan & Lei, Lei, 2024. "How do climate risks impact the contagion in China's energy market?," Energy Economics, Elsevier, vol. 133(C).
    5. Tomoko Nakanishi & Julian Willett & Yossi Farjoun & Richard J. Allen & Beatriz Guillen-Guio & Darin Adra & Sirui Zhou & J. Brent Richards, 2023. "Alternative splicing in lung influences COVID-19 severity and respiratory diseases," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    6. Joel O. Wertheim & Jade C. Wang & Mindy Leelawong & Darren P. Martin & Jennifer L. Havens & Moinuddin A. Chowdhury & Jonathan E. Pekar & Helly Amin & Anthony Arroyo & Gordon A. Awandare & Hoi Yan Chow, 2022. "Detection of SARS-CoV-2 intra-host recombination during superinfection with Alpha and Epsilon variants in New York City," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    7. Claude Roth & Félix G. Delgado & Etienne Simon-Lorière & Anavaj Sakuntabhai, 2018. "Immune Responses to Dengue and Zika Viruses—Guidance for T Cell Vaccine Development," IJERPH, MDPI, vol. 15(2), pages 1-12, February.
    8. Yunxi Liu & Nicolae Sapoval & Pilar Gallego-García & Laura Tomás & David Posada & Todd J. Treangen & Lauren B. Stadler, 2024. "Crykey: Rapid identification of SARS-CoV-2 cryptic mutations in wastewater," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    9. Osuna Ramírez, Sergio Andrés & Veloutsou, Cleopatra & Morgan-Thomas, Anna, 2024. "On the antipodes of love and hate: The conception and measurement of brand polarization," Journal of Business Research, Elsevier, vol. 179(C).
    10. Jolyon M. Medlock & Kayleigh M. Hansford & Alexander G. C. Vaux & Ben Cull & Emma Gillingham & Steve Leach, 2018. "Assessment of the Public Health Threats Posed by Vector-Borne Disease in the United Kingdom (UK)," IJERPH, MDPI, vol. 15(10), pages 1-22, September.
    11. Junqing Xie & Yuliang Feng & Danielle Newby & Bang Zheng & Qi Feng & Albert Prats-Uribe & Chunxiao Li & Nicholas J. Wareham & R. Paredes & Daniel Prieto-Alhambra, 2023. "Genetic risk, adherence to healthy lifestyle and acute cardiovascular and thromboembolic complications following SARS-COV-2 infection," Nature Communications, Nature, vol. 14(1), pages 1-10, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-49957-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.