IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0110268.html
   My bibliography  Save this article

The Dawn of Open Access to Phylogenetic Data

Author

Listed:
  • Andrew F Magee
  • Michael R May
  • Brian R Moore

Abstract

The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation – extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.

Suggested Citation

  • Andrew F Magee & Michael R May & Brian R Moore, 2014. "The Dawn of Open Access to Phylogenetic Data," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-10, October.
  • Handle: RePEc:plo:pone00:0110268
    DOI: 10.1371/journal.pone.0110268
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0110268
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0110268&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0110268?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jelte M Wicherts & Marjan Bakker & Dylan Molenaar, 2011. "Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results," PLOS ONE, Public Library of Science, vol. 6(11), pages 1-7, November.
    2. Cédric Notredame, 2007. "Recent Evolutions of Multiple Sequence Alignment Algorithms," PLOS Computational Biology, Public Library of Science, vol. 3(8), pages 1-4, August.
    3. Caroline J Savage & Andrew J Vickers, 2009. "Empirical Study of Data Sharing by Authors Publishing in PLoS Journals," PLOS ONE, Public Library of Science, vol. 4(9), pages 1-3, September.
    4. Alawi A Alsheikh-Ali & Waqas Qureshi & Mouaz H Al-Mallah & John P A Ioannidis, 2011. "Public Availability of Published Research Data in High-Impact Journals," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-4, September.
    5. Heather A. Piwowar & Todd J. Vision & Michael C. Whitlock, 2011. "Data archiving is a good investment," Nature, Nature, vol. 473(7347), pages 285-285, May.
    6. Heather A Piwowar & Roger S Day & Douglas B Fridsma, 2007. "Sharing Detailed Research Data Is Associated with Increased Citation Rate," PLOS ONE, Public Library of Science, vol. 2(3), pages 1-5, March.
    7. Bryan T Drew & Romina Gazis & Patricia Cabezas & Kristen S Swithers & Jiabin Deng & Roseana Rodriguez & Laura A Katz & Keith A Crandall & David S Hibbett & Douglas E Soltis, 2013. "Lost Branches on the Tree of Life," PLOS Biology, Public Library of Science, vol. 11(9), pages 1-5, September.
    8. Julie D Thompson & Benjamin Linard & Odile Lecompte & Olivier Poch, 2011. "A Comprehensive Benchmark Study of Multiple Sequence Alignment Methods: Current Challenges and Future Perspectives," PLOS ONE, Public Library of Science, vol. 6(3), pages 1-14, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mallory C Kidwell & Ljiljana B Lazarević & Erica Baranski & Tom E Hardwicke & Sarah Piechowski & Lina-Sophia Falkenberg & Curtis Kennett & Agnieszka Slowik & Carina Sonnleitner & Chelsey Hess-Holden &, 2016. "Badges to Acknowledge Open Practices: A Simple, Low-Cost, Effective Method for Increasing Transparency," PLOS Biology, Public Library of Science, vol. 14(5), pages 1-15, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicola Milia & Alessandra Congiu & Paolo Anagnostou & Francesco Montinaro & Marco Capocasa & Emanuele Sanna & Giovanni Destro Bisol, 2012. "Mine, Yours, Ours? Sharing Data on Human Genetic Variation," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-8, June.
    2. Genevieve Pham-Kanter & Darren E Zinner & Eric G Campbell, 2014. "Codifying Collegiality: Recent Developments in Data Sharing Policy in the Life Sciences," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-8, September.
    3. Zeng, Tong & Wu, Longfeng & Bratt, Sarah & Acuna, Daniel E., 2020. "Assigning credit to scientific datasets using article citation networks," Journal of Informetrics, Elsevier, vol. 14(2).
    4. John Ernest Kratz & Carly Strasser, 2015. "Researcher Perspectives on Publication and Peer Review of Data," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-21, February.
    5. Christopher W Belter, 2014. "Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-9, March.
    6. Dominique G Roche & Loeske E B Kruuk & Robert Lanfear & Sandra A Binning, 2015. "Public Data Archiving in Ecology and Evolution: How Well Are We Doing?," PLOS Biology, Public Library of Science, vol. 13(11), pages 1-12, November.
    7. Jennifer C Molloy, 2012. "The Open Knowledge Foundation: Open Data Means Better Science," Working Papers id:4686, eSocialSciences.
    8. Isabella Peters & Peter Kraker & Elisabeth Lex & Christian Gumpenberger & Juan Gorraiz, 2016. "Research data explored: an extended analysis of citations and altmetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 723-744, May.
    9. Bryan T Drew & Romina Gazis & Patricia Cabezas & Kristen S Swithers & Jiabin Deng & Roseana Rodriguez & Laura A Katz & Keith A Crandall & David S Hibbett & Douglas E Soltis, 2013. "Lost Branches on the Tree of Life," PLOS Biology, Public Library of Science, vol. 11(9), pages 1-5, September.
    10. Jennifer C Molloy, 2011. "The Open Knowledge Foundation: Open Data Means Better Science," PLOS Biology, Public Library of Science, vol. 9(12), pages 1-4, December.
    11. Vanessa V Sochat & Cameron J Prybol & Gregory M Kurtzer, 2017. "Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-24, November.
    12. Garret Christensen & Allan Dafoe & Edward Miguel & Don A Moore & Andrew K Rose, 2019. "A study of the impact of data sharing on article citations using journal policies as a natural experiment," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-13, December.
    13. Andreoli-Versbach, Patrick & Mueller-Langer, Frank, 2014. "Open access to data: An ideal professed but not practised," Research Policy, Elsevier, vol. 43(9), pages 1621-1633.
    14. Benedikt Fecher & Sascha Friesike & Marcel Hebing, 2014. "What Drives Academic Data Sharing?," SOEPpapers on Multidisciplinary Panel Data Research 655, DIW Berlin, The German Socio-Economic Panel (SOEP).
    15. Javier Martínez-Vega & David Rodríguez-Rodríguez, 2022. "Protected Area Effectiveness in the Scientific Literature: A Decade-Long Bibliometric Analysis," Land, MDPI, vol. 11(6), pages 1-14, June.
    16. Ya-Mei Ding & Xiao-Xu Pang & Yu Cao & Wei-Ping Zhang & Susanne S. Renner & Da-Yong Zhang & Wei-Ning Bai, 2023. "Genome structure-based Juglandaceae phylogenies contradict alignment-based phylogenies and substitution rates vary with DNA repair genes," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    17. Mark J. McCabe & Frank Mueller-Langer, 2019. "Does Data Disclosure Increase Citations? Empirical Evidence from a Natural Experiment in Leading Economics Journals," JRC Working Papers on Digital Economy 2019-02, Joint Research Centre.
    18. Stephanie B Linek & Benedikt Fecher & Sascha Friesike & Marcel Hebing, 2017. "Data sharing as social dilemma: Influence of the researcher’s personality," PLOS ONE, Public Library of Science, vol. 12(8), pages 1-24, August.
    19. Coosje L S Veldkamp & Michèle B Nuijten & Linda Dominguez-Alvarez & Marcel A L M van Assen & Jelte M Wicherts, 2014. "Statistical Reporting Errors and Collaboration on Statistical Analyses in Psychological Science," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-19, December.
    20. Saeedeh Akbari Rokn Abadi & Negin Hashemi Dijujin & Somayyeh Koohi, 2021. "Optical pattern generator for efficient bio-data encoding in a photonic sequence comparison architecture," PLOS ONE, Public Library of Science, vol. 16(1), pages 1-27, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0110268. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.