IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0104798.html
   My bibliography  Save this article

How Do Astronomers Share Data? Reliability and Persistence of Datasets Linked in AAS Publications and a Qualitative Study of Data Practices among US Astronomers

Author

Listed:
  • Alberto Pepe
  • Alyssa Goodman
  • August Muench
  • Merce Crosas
  • Christopher Erdmann

Abstract

We analyze data sharing practices of astronomers over the past fifteen years. An analysis of URL links embedded in papers published by the American Astronomical Society reveals that the total number of links included in the literature rose dramatically from 1997 until 2005, when it leveled off at around 1500 per year. The analysis also shows that the availability of linked material decays with time: in 2011, 44% of links published a decade earlier, in 2001, were broken. A rough analysis of link types reveals that links to data hosted on astronomers' personal websites become unreachable much faster than links to datasets on curated institutional sites. To gauge astronomers' current data sharing practices and preferences further, we performed in-depth interviews with 12 scientists and online surveys with 173 scientists, all at a large astrophysical research institute in the United States: the Harvard-Smithsonian Center for Astrophysics, in Cambridge, MA. Both the in-depth interviews and the online survey indicate that, in principle, there is no philosophical objection to data-sharing among astronomers at this institution. Key reasons that more data are not presently shared more efficiently in astronomy include: the difficulty of sharing large data sets; over reliance on non-robust, non-reproducible mechanisms for sharing data (e.g. emailing it); unfamiliarity with options that make data-sharing easier (faster) and/or more robust; and, lastly, a sense that other researchers would not want the data to be shared. We conclude with a short discussion of a new effort to implement an easy-to-use, robust, system for data sharing in astronomy, at theastrodata.org, and we analyze the uptake of that system to-date.

Suggested Citation

  • Alberto Pepe & Alyssa Goodman & August Muench & Merce Crosas & Christopher Erdmann, 2014. "How Do Astronomers Share Data? Reliability and Persistence of Datasets Linked in AAS Publications and a Qualitative Study of Data Practices among US Astronomers," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-11, August.
  • Handle: RePEc:plo:pone00:0104798
    DOI: 10.1371/journal.pone.0104798
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0104798
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0104798&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0104798?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Christine L. Borgman, 2012. "The conundrum of sharing research data," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(6), pages 1059-1078, June.
    2. Alyssa Goodman & Alberto Pepe & Alexander W Blocker & Christine L Borgman & Kyle Cranmer & Merce Crosas & Rosanne Di Stefano & Yolanda Gil & Paul Groth & Margaret Hedstrom & David W Hogg & Vinay Kashy, 2014. "Ten Simple Rules for the Care and Feeding of Scientific Data," PLOS Computational Biology, Public Library of Science, vol. 10(4), pages 1-5, April.
    3. Christine L. Borgman, 2012. "The conundrum of sharing research data," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(6), pages 1059-1078, June.
    4. Gary King, 2007. "An Introduction to the Dataverse Network as an Infrastructure for Data Sharing," Sociological Methods & Research, , vol. 36(2), pages 173-199, November.
    5. Michael J. Kurtz & Guenther Eichhorn & Alberto Accomazzi & Carolyn Grant & Markus Demleitner & Stephen S. Murray, 2005. "Worldwide use and impact of the NASA Astrophysics Data System digital library," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 56(1), pages 36-45, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ana Trisovic & Katherine Mika & Ceilyn Boyd & Sebastian Feger & Mercè Crosas, 2021. "Repository Approaches to Improving the Quality of Shared Data and Code," Data, MDPI, vol. 6(2), pages 1-12, February.
    2. Vikas Jaiman & Leonard Pernice & Visara Urovi, 2022. "User incentives for blockchain-based data sharing platforms," PLOS ONE, Public Library of Science, vol. 17(4), pages 1-22, April.
    3. Benedikt Fecher & Sascha Friesike & Marcel Hebing, 2014. "What Drives Academic Data Sharing?," SOEPpapers on Multidisciplinary Panel Data Research 655, DIW Berlin, The German Socio-Economic Panel (SOEP).
    4. Mike Thelwall, 2020. "Data in Brief: Can a mega-journal for data be useful?," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 697-709, July.
    5. Carol Tenopir & Elizabeth D Dalton & Suzie Allard & Mike Frame & Ivanka Pjesivac & Ben Birch & Danielle Pollock & Kristina Dorsett, 2015. "Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-24, August.
    6. Federica Cugnata & Chiara Brombin & Chiara Maria Poli & Roberto Buccione & Clelia Serio, 2024. "Modelling perception and resilience factors to data sharing in clinical and basic research: an observational study," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(6), pages 3169-3192, June.
    7. Andrea K. Thomer, 2022. "Integrative data reuse at scientifically significant sites: Case studies at Yellowstone National Park and the La Brea Tar Pits," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(8), pages 1155-1170, August.
    8. Christopher W Belter, 2014. "Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-9, March.
    9. Koenraad De Smedt & Dimitris Koureas & Peter Wittenburg, 2020. "FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units," Publications, MDPI, vol. 8(2), pages 1-17, April.
    10. Plantin, Jean-Christophe, 2021. "The data archive as factory: alienation and resistance of data processors," LSE Research Online Documents on Economics 109692, London School of Economics and Political Science, LSE Library.
    11. Keren Weinshall & Lee Epstein, 2020. "Developing High‐Quality Data Infrastructure for Legal Analytics: Introducing the Israeli Supreme Court Database," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 17(2), pages 416-434, June.
    12. Jenny Bossaller & Anthony J. Million, 2023. "The research data life cycle, legacy data, and dilemmas in research data management," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(6), pages 701-706, June.
    13. Guillaume Cabanac & Thomas Preuss, 2013. "Capitalizing on order effects in the bids of peer-reviewed conferences to secure reviews by expert referees," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(2), pages 405-415, February.
    14. Liwei Zhang & Liang Ma, 2021. "Does open data boost journal impact: evidence from Chinese economics," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3393-3419, April.
    15. Shibayama, Sotaro & Lawson, Cornelia, 2021. "The use of rewards in the sharing of research resources," Research Policy, Elsevier, vol. 50(7).
    16. Koutroumpis, Pantelis & Leiponen, Aija & Thomas, Llewellyn D W, 2017. "The (Unfulfilled) Potential of Data Marketplaces," ETLA Working Papers 53, The Research Institute of the Finnish Economy.
    17. Jillian C Wallis & Elizabeth Rolando & Christine L Borgman, 2013. "If We Share Data, Will Anyone Use Them? Data Sharing and Reuse in the Long Tail of Science and Technology," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-17, July.
    18. Paolo Anagnostou & Marco Capocasa & Nicola Milia & Emanuele Sanna & Cinzia Battaggia & Daniela Luzi & Giovanni Destro Bisol, 2015. "When Data Sharing Gets Close to 100%: What Human Paleogenetics Can Teach the Open Science Movement," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-14, March.
    19. Gary A. Hoover & Christian Hopp, 2017. "What Crisis? Taking Stock of Management Researchers' Experiences with and Views of Scholarly Misconduct," CESifo Working Paper Series 6611, CESifo.
    20. Ryan P Womack, 2015. "Research Data in Core Journals in Biology, Chemistry, Mathematics, and Physics," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-22, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0104798. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.