IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v6y2021i8p83-d607890.html
   My bibliography  Save this article

A Global Book Reading Dataset

Author

Listed:
  • Nazanin Sabri

    (School of Electrical and Computer Engineering, University of Tehran, Tehran 1439957131, Iran)

  • Ingmar Weber

    (Qatar Computing Research Institute, Doha P.O. Box 34110, Qatar)

Abstract

The choice of what to read is both influenced by and indicative of such factors as a person’s beliefs, culture, gender, and socioeconomic status. However, obtaining data including such personal attributes, as well as detailed reading habits and activities of individuals is difficult and would usually require either (i) data from e-readers, such as the Amazon Kindle, or from library checkouts, both of which are hard to obtain, or (ii) distributing questionnaires and conducting interviews, which can be expensive and suffers from recall bias. In this study, we present a dataset of over 40 million reading instances of 1,872,677 unique individuals collected from Goodreads. Goodreads is a book-cataloging social media platform with millions of users, where users share comments on the books they have read, while creating and maintaining social connections. We enrich the dataset with gender and location information. The dataset presented in this study can be used to perform cross-national and cross-gender analyses of reading behavior among book enthusiasts.

Suggested Citation

  • Nazanin Sabri & Ingmar Weber, 2021. "A Global Book Reading Dataset," Data, MDPI, vol. 6(8), pages 1-11, August.
  • Handle: RePEc:gam:jdataj:v:6:y:2021:i:8:p:83-:d:607890
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/6/8/83/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/6/8/83/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mike Thelwall & Kayvan Kousha, 2017. "Goodreads: A social network site for book readers," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(4), pages 972-983, April.
    2. Kayvan Kousha & Mike Thelwall & Mahshid Abdoli, 2017. "Goodreads reviews to assess the wider impacts of books," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(8), pages 2004-2016, August.
    3. Ahmad Mohammad Alghamdi & Hisham Ihshaish, 2021. "The use and impact of Goodreads rating and reviews, for readers of Arabic books," International Journal of Business Information Systems, Inderscience Enterprises Ltd, vol. 37(4), pages 442-466.
    4. Kayvan Kousha & Mike Thelwall, 2016. "Can Amazon.com reviews help to assess the wider impacts of books?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(3), pages 566-581, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Kai & Liu, Xiaojuan & Han, Yutong, 2019. "Exploring Goodreads reviews for book impact assessment," Journal of Informetrics, Elsevier, vol. 13(3), pages 874-886.
    2. Mohammadamin Erfanmanesh & A. Noorhidawati & A. Abrizah, 2019. "What can Bookmetrix tell us about the impact of Springer Nature’s books," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 521-536, October.
    3. Siluo Yang & Xin Xing & Fan Qi & Maria Cláudia Cabrini Grácio, 2021. "Comparison of academic book impact from a disciplinary perspective: an analysis of citations and altmetric indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 1101-1123, February.
    4. Ashraf Maleki, 2022. "OCLC library holdings: assessing availability of academic books in libraries in print and electronic compared to citations and altmetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 991-1020, February.
    5. Ashraf Maleki, 2022. "Why does library holding format really matter for book impact assessment?: Modelling the relationship between citations and altmetrics with print and electronic holdings," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 1129-1160, February.
    6. Maja Jokić & Andrea Mervar & Stjepan Mateljan, 2019. "Comparative analysis of book citations in social science journals by Central and Eastern European authors," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1005-1029, September.
    7. Eleonora Dagienė, 2024. "Mapping scholarly books: library metadata and research assessment," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(9), pages 5689-5714, September.
    8. Yajie Wang & Alesia Zuccala, 2021. "Scholarly book publishers as publicity agents for SSH titles on Twitter," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(6), pages 4817-4840, June.
    9. Zhou, Qingqing & Zhang, Chengzhi, 2021. "Impacts towards a comprehensive assessment of the book impact by integrating multiple evaluation sources," Journal of Informetrics, Elsevier, vol. 15(3).
    10. Qingqing Zhou & Chengzhi Zhang, 2020. "Evaluating wider impacts of books via fine-grained mining on citation literatures," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 1923-1948, December.
    11. Daniel Torres-Salinas & Nicolás Robinson-Garcia & Juan Gorraiz, 2017. "Filling the citation gap: measuring the multidimensional impact of the academic book at institutional level with PlumX," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1371-1384, December.
    12. Mingkun Wei & Abdolreza Noroozi Chakoli, 2020. "Evaluating the relationship between the academic and social impact of open access books based on citation behaviors and social media attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2401-2420, December.
    13. Anton Oleinik, 2024. "A Bayesian index of association: comparison with other measures and performance," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 277-305, February.
    14. Mojisola Erdt & Aarthy Nagarajan & Sei-Ching Joanna Sin & Yin-Leng Theng, 2016. "Altmetrics: an analysis of the state-of-the-art in measuring research impact on social media," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(2), pages 1117-1166, November.
    15. Zhang, Chengzhi & Zhou, Qingqing, 2020. "Assessing books’ depth and breadth via multi-level mining on tables of contents," Journal of Informetrics, Elsevier, vol. 14(2).
    16. Anton Oleinik, 2022. "Relevance in Web search: between content, authority and popularity," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(1), pages 173-194, February.
    17. Qingqing Zhou & Chengzhi Zhang & Star X. Zhao & Bikun Chen, 2016. "Measuring book impact based on the multi-granularity online review mining," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1435-1455, June.
    18. Daniel Torres-Salinas & Wenceslao Arroyo-Machado & Mike Thelwall, 2021. "Exploring WorldCat identities as an altmetric information source: a library catalog analysis experiment in the field of Scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 1725-1743, February.

    More about this item

    Keywords

    reading; dataset; Goodreads;
    All these keywords.

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:6:y:2021:i:8:p:83-:d:607890. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.