IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0238108.html
   My bibliography  Save this article

Shiny-SoSV: A web-based performance calculator for somatic structural variant detection

Author

Listed:
  • Tingting Gong
  • Vanessa M Hayes
  • Eva K F Chan

Abstract

Somatic structural variants are an important contributor to cancer development and evolution. Accurate detection of these complex variants from whole genome sequencing data is influenced by a multitude of parameters. However, there are currently no tools for guiding study design nor are there applications that could predict the performance of somatic structural variant detection. To address this gap, we developed Shiny-SoSV, a user-friendly web-based calculator for determining the impact of common variables on the sensitivity, precision and F1 score of somatic structural variant detection, including choice of variant detection tool, sequencing depth of coverage, variant allele fraction, and variant breakpoint resolution. Using simulation studies, we determined singular and combinatoric effects of these variables, modelled the results using a generalised additive model, allowing structural variant detection performance to be predicted for any combination of predictors. Shiny-SoSV provides an interactive and visual platform for users to easily compare individual and combined impact of different parameters. It predicts the performance of a proposed study design, on somatic structural variant detection, prior to the commencement of benchwork. Shiny-SoSV is freely available at https://hcpcg.shinyapps.io/Shiny-SoSV with accompanying user’s guide and example use-cases.

Suggested Citation

  • Tingting Gong & Vanessa M Hayes & Eva K F Chan, 2020. "Shiny-SoSV: A web-based performance calculator for somatic structural variant detection," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-20, August.
  • Handle: RePEc:plo:pone00:0238108
    DOI: 10.1371/journal.pone.0238108
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0238108
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0238108&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0238108?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Simon N. Wood, 2011. "Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(1), pages 3-36, January.
    2. Daniel L. Cameron & Leon Stefano & Anthony T. Papenfuss, 2019. "Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software," Nature Communications, Nature, vol. 10(1), pages 1-11, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerhard Tutz & Moritz Berger, 2018. "Tree-structured modelling of categorical predictors in generalized additive regression," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 737-758, September.
    2. Tommaso Luzzati & Angela Parenti & Tommaso Rughi, 2017. "Spatial error regressions for testing the Cancer-EKC," Discussion Papers 2017/218, Dipartimento di Economia e Management (DEM), University of Pisa, Pisa, Italy.
    3. Davide Fiaschi & Andrea Mario Lavezzi & Angela Parenti, 2020. "Deep and Proximate Determinants of the World Income Distribution," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 66(3), pages 677-710, September.
    4. Longhi, Christian & Musolesi, Antonio & Baumont, Catherine, 2014. "Modeling structural change in the European metropolitan areas during the process of economic integration," Economic Modelling, Elsevier, vol. 37(C), pages 395-407.
    5. Sihvonen, Markus, 2021. "Yield curve momentum," Research Discussion Papers 15/2021, Bank of Finland.
    6. Roberto Basile & Luigi Benfratello & Davide Castellani, 2012. "Geoadditive models for regional count data: an application to industrial location," ERSA conference papers ersa12p83, European Regional Science Association.
    7. Dillon T. Fogarty & Caleb P. Roberts & Daniel R. Uden & Victoria M. Donovan & Craig R. Allen & David E. Naugle & Matthew O. Jones & Brady W. Allred & Dirac Twidwell, 2020. "Woody Plant Encroachment and the Sustainability of Priority Conservation Areas," Sustainability, MDPI, vol. 12(20), pages 1-15, October.
    8. E. Zanini & E. Eastoe & M. J. Jones & D. Randell & P. Jonathan, 2020. "Flexible covariate representations for extremes," Environmetrics, John Wiley & Sons, Ltd., vol. 31(5), August.
    9. Daniel Melser & Robert J. Hill, 2019. "Residential Real Estate, Risk, Return and Diversification: Some Empirical Evidence," The Journal of Real Estate Finance and Economics, Springer, vol. 59(1), pages 111-146, July.
    10. Maciej Berȩsewicz & Dagmara Nikulin, 2021. "Estimation of the size of informal employment based on administrative records with non‐ignorable selection mechanism," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 667-690, June.
    11. Robert J. Hill & Michael Scholz, 2014. "Incorporating Geospatial Data in House Price Indexes: A Hedonic Imputation Approach with Splines," Graz Economics Papers 2014-05, University of Graz, Department of Economics.
    12. Cathrine Ulla Jensen & Toke Emil Panduro, 2016. "PanJen: A test for functional form with continuous variables," IFRO Working Paper 2016/08, University of Copenhagen, Department of Food and Resource Economics.
    13. Ronald E. Gangnon & Natasha K. Stout & Oguzhan Alagoz & John M. Hampton & Brian L. Sprague & Amy Trentham-Dietz, 2018. "Contribution of Breast Cancer to Overall Mortality for US Women," Medical Decision Making, , vol. 38(1_suppl), pages 24-31, April.
    14. Yuko Araki & Atsushi Kawaguchi & Fumio Yamashita, 2013. "Regularized logistic discrimination with basis expansions for the early detection of Alzheimer’s disease based on three-dimensional MRI data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(1), pages 109-119, March.
    15. Weishampel, Anthony & Staicu, Ana-Maria & Rand, William, 2023. "Classification of social media users with generalized functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    16. Megan K. Jennings & Emily Haeuser & Diane Foote & Rebecca L. Lewison & Erin Conlisk, 2020. "Planning for Dynamic Connectivity: Operationalizing Robust Decision-Making and Prioritization Across Landscapes Experiencing Climate and Land-Use Change," Land, MDPI, vol. 9(10), pages 1-18, September.
    17. Robert J. Hill & Alicia N. Rambaldi & Michael Scholz, 2021. "Higher frequency hedonic property price indices: a state-space approach," Empirical Economics, Springer, vol. 61(1), pages 417-441, July.
    18. Adam R. Pines & Bart Larsen & Zaixu Cui & Valerie J. Sydnor & Maxwell A. Bertolero & Azeez Adebimpe & Aaron F. Alexander-Bloch & Christos Davatzikos & Damien A. Fair & Ruben C. Gur & Raquel E. Gur & H, 2022. "Dissociable multi-scale patterns of development in personalized brain networks," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    19. Marra, Giampiero & Wood, Simon N., 2011. "Practical variable selection for generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2372-2387, July.
    20. Wenyi Lin & Jingjing Zou & Chongzhi Di & Dorothy D. Sears & Cheryl L. Rock & Loki Natarajan, 2023. "Longitudinal Associations Between Timing of Physical Activity Accumulation and Health: Application of Functional Data Methods," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 309-329, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0238108. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.