IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v110y2017i3d10.1007_s11192-016-2236-3.html
   My bibliography  Save this article

Exploratory search of academic publication and citation data using interactive tag cloud visualizations

Author

Listed:
  • Marcel Dunaiski

    (Stellenbosch University)

  • Gillian J. Greene

    (Stellenbosch University
    CAIR, CSIR Meraka)

  • Bernd Fischer

    (Stellenbosch University
    CAIR, CSIR Meraka)

Abstract

Acquiring an overview of an unfamiliar discipline and exploring relevant papers and journals is often a laborious task for researchers. In this paper we show how exploratory search can be supported on a large collection of academic papers to allow users to answer complex scientometric questions which traditional retrieval approaches do not support optimally. We use our ConceptCloud browser, which makes use of a combination of concept lattices and tag clouds, to visually present academic publication data (specifically, the ACM Digital Library) in a browsable format that facilitates exploratory search. We augment this dataset with semantic categories, obtained through automatic keyphrase extraction from papers’ titles and abstracts, in order to provide the user with uniform keyphrases of the underlying data collection. We use the citations and references of papers to provide additional mechanisms for exploring relevant research by presenting aggregated reference and citation data not only for a single paper but also across topics, authors and journals, which is novel in our approach. We conduct a user study to evaluate our approach in which we asked 34 participants, from different academic backgrounds with varying degrees of research experience, to answer a variety of scientometric questions using our ConceptCloud browser. Participants were able to answer complex scientometric questions using our ConceptCloud browser with a mean correctness of 73%, with the user’s prior research experience having no statistically significant effect on the results.

Suggested Citation

  • Marcel Dunaiski & Gillian J. Greene & Bernd Fischer, 2017. "Exploratory search of academic publication and citation data using interactive tag cloud visualizations," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1539-1571, March.
  • Handle: RePEc:spr:scient:v:110:y:2017:i:3:d:10.1007_s11192-016-2236-3
    DOI: 10.1007/s11192-016-2236-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-016-2236-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-016-2236-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cody Dunne & Ben Shneiderman & Robert Gove & Judith Klavans & Bonnie Dorr, 2012. "Rapid understanding of scientific paper collections: Integrating statistics, text analytics, and visualization," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2351-2369, December.
    2. Helmut A. Abt, 2007. "The future of single-authored papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 73(3), pages 353-358, December.
    3. Cody Dunne & Ben Shneiderman & Robert Gove & Judith Klavans & Bonnie Dorr, 2012. "Rapid understanding of scientific paper collections: Integrating statistics, text analytics, and visualization," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(12), pages 2351-2369, December.
    4. Chen, P. & Xie, H. & Maslov, S. & Redner, S., 2007. "Finding scientific gems with Google’s PageRank algorithm," Journal of Informetrics, Elsevier, vol. 1(1), pages 8-15.
    5. Ping Liu & Qiong Wu & Xiangming Mu & Kaipeng Yu & Yiting Guo, 2015. "Detecting the intellectual structure of library and information science based on formal concept analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(3), pages 737-762, September.
    6. Isidro F. Aguillo & Judit Bar-Ilan & Mark Levene & José Luis Ortega, 2010. "Comparing university rankings," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 243-256, October.
    7. Jevin D. West & Michael C. Jensen & Ralph J. Dandrea & Gregory J. Gordon & Carl T. Bergstrom, 2013. "Author‐level Eigenfactor metrics: Evaluating the influence of authors, institutions, and countries within the social science research network community," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(4), pages 787-801, April.
    8. Juan Zhang & Qi Yu & Fashan Zheng & Chao Long & Zuxun Lu & Zhiguang Duan, 2016. "Comparing keywords plus of WOS and author keywords: A case study of patient adherence research," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(4), pages 967-972, April.
    9. Dunaiski, Marcel & Visser, Willem & Geldenhuys, Jaco, 2016. "Evaluating paper and author ranking algorithms using impact and contribution awards," Journal of Informetrics, Elsevier, vol. 10(2), pages 392-407.
    10. Martin Rosvall & Carl T Bergstrom, 2010. "Mapping Change in Large Networks," PLOS ONE, Public Library of Science, vol. 5(1), pages 1-7, January.
    11. Parolo, Pietro Della Briotta & Pan, Raj Kumar & Ghosh, Rumi & Huberman, Bernardo A. & Kaski, Kimmo & Fortunato, Santo, 2015. "Attention decay in science," Journal of Informetrics, Elsevier, vol. 9(4), pages 734-745.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chunxiu Qin & Yaxi Liu & Xubu Ma & Jiangping Chen & Huigang Liang, 2022. "Designing for serendipity in online knowledge communities: An investigation of tag presentation formats and openness to experience," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(10), pages 1401-1417, October.
    2. Shiwei Fan & Lan Xue & Jianhua Xu, 2018. "What Drives Policy Attention to Climate Change in China? An Empirical Analysis through the Lens of People’s Daily," Sustainability, MDPI, vol. 10(9), pages 1-20, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xu, Shuqi & Mariani, Manuel Sebastian & Lü, Linyuan & Medo, Matúš, 2020. "Unbiased evaluation of ranking metrics reveals consistent performance in science and technology citation data," Journal of Informetrics, Elsevier, vol. 14(1).
    2. Vaccario, Giacomo & Medo, Matúš & Wider, Nicolas & Mariani, Manuel Sebastian, 2017. "Quantifying and suppressing ranking bias in a large citation network," Journal of Informetrics, Elsevier, vol. 11(3), pages 766-782.
    3. Dunaiski, Marcel & Geldenhuys, Jaco & Visser, Willem, 2019. "On the interplay between normalisation, bias, and performance of paper impact metrics," Journal of Informetrics, Elsevier, vol. 13(1), pages 270-290.
    4. Dunaiski, Marcel & Geldenhuys, Jaco & Visser, Willem, 2019. "Globalised vs averaged: Bias and ranking performance on the author level," Journal of Informetrics, Elsevier, vol. 13(1), pages 299-313.
    5. Mariani, Manuel Sebastian & Medo, Matúš & Zhang, Yi-Cheng, 2016. "Identification of milestone papers through time-balanced network centrality," Journal of Informetrics, Elsevier, vol. 10(4), pages 1207-1223.
    6. Dunaiski, Marcel & Geldenhuys, Jaco & Visser, Willem, 2018. "Author ranking evaluation at scale," Journal of Informetrics, Elsevier, vol. 12(3), pages 679-702.
    7. Dejian Yu & Wanru Wang & Shuai Zhang & Wenyu Zhang & Rongyu Liu, 2017. "A multiple-link, mutually reinforced journal-ranking model to measure the prestige of journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 521-542, April.
    8. Yu Zhang & Min Wang & Morteza Saberi & Elizabeth Chang, 2022. "Analysing academic paper ranking algorithms using test data and benchmarks: an investigation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 4045-4074, July.
    9. Xipeng Liu & Xinmiao Li, 2024. "Unbiased evaluation of ranking algorithms applied to the Chinese green patents citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(6), pages 2999-3021, June.
    10. Holly N. Wolcott & Matthew J. Fouch & Elizabeth R. Hsu & Leo G. DiJoseph & Catherine A. Bernaciak & James G. Corrigan & Duane E. Williams, 2016. "Modeling time-dependent and -independent indicators to facilitate identification of breakthrough research papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 807-817, May.
    11. Massucci, Francesco Alessandro & Docampo, Domingo, 2019. "Measuring the academic reputation through citation networks via PageRank," Journal of Informetrics, Elsevier, vol. 13(1), pages 185-201.
    12. Alan Peter Matthews, 2012. "South African universities in world rankings," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(3), pages 675-695, September.
    13. Nykl, Michal & Campr, Michal & Ježek, Karel, 2015. "Author ranking based on personalized PageRank," Journal of Informetrics, Elsevier, vol. 9(4), pages 777-799.
    14. Yanbo Zhou & Xin-Li Xu & Xu-Hua Yang & Qu Li, 2022. "The influence of disruption on evaluating the scientific significance of papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(10), pages 5931-5945, October.
    15. Zhou, Yanbo & Li, Qu & Yang, Xuhua & Cheng, Hongbing, 2021. "Predicting the popularity of scientific publications by an age-based diffusion model," Journal of Informetrics, Elsevier, vol. 15(4).
    16. Dunaiski, Marcel & Visser, Willem & Geldenhuys, Jaco, 2016. "Evaluating paper and author ranking algorithms using impact and contribution awards," Journal of Informetrics, Elsevier, vol. 10(2), pages 392-407.
    17. Fiala, Dalibor & Šubelj, Lovro & Žitnik, Slavko & Bajec, Marko, 2015. "Do PageRank-based author rankings outperform simple citation counts?," Journal of Informetrics, Elsevier, vol. 9(2), pages 334-348.
    18. Jun Zhang & Zhaolong Ning & Xiaomei Bai & Xiangjie Kong & Jinmeng Zhou & Feng Xia, 2017. "Exploring time factors in measuring the scientific impact of scholars," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1301-1321, September.
    19. Jiang, Xiaorui & Zhuge, Hai, 2019. "Forward search path count as an alternative indirect citation impact indicator," Journal of Informetrics, Elsevier, vol. 13(4).
    20. Qing Ping & Chaomei Chen, 2018. "LitStoryTeller+: an interactive system for multi-level scientific paper visual storytelling with a supportive text mining toolbox," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1887-1944, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:110:y:2017:i:3:d:10.1007_s11192-016-2236-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.