IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v74y2023i4p388-401.html
   My bibliography  Save this article

Predicting coauthorship using bibliographic network embedding

Author

Listed:
  • Yongjun Zhu
  • Lihong Quan
  • Pei‐Ying Chen
  • Meen Chul Kim
  • Chao Che

Abstract

Coauthorship prediction applies predictive analytics to bibliographic data to predict authors who are highly likely to be coauthors. In this study, we propose an approach for coauthorship prediction based on bibliographic network embedding through a graph‐based bibliographic data model that can be used to model common bibliographic data, including papers, terms, sources, authors, departments, research interests, universities, and countries. A real‐world dataset released by AMiner that includes more than 2 million papers, 8 million citations, and 1.7 million authors were integrated into a large bibliographic network using the proposed bibliographic data model. Translation‐based methods were applied to the entities and relationships to generate their low‐dimensional embeddings while preserving their connectivity information in the original bibliographic network. We applied machine learning algorithms to embeddings that represent the coauthorship relationships of the two authors and achieved high prediction results. The reference model, which is the combination of a network embedding size of 100, the most basic translation‐based method, and a gradient boosting method achieved an F1 score of 0.9 and even higher scores are obtainable with different embedding sizes and more advanced embedding methods. Thus, the strengths of the proposed approach lie in its customizable components under a unified framework.

Suggested Citation

  • Yongjun Zhu & Lihong Quan & Pei‐Ying Chen & Meen Chul Kim & Chao Che, 2023. "Predicting coauthorship using bibliographic network embedding," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(4), pages 388-401, April.
  • Handle: RePEc:bla:jinfst:v:74:y:2023:i:4:p:388-401
    DOI: 10.1002/asi.24711
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24711
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24711?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Blaise Cronin & Debora Shaw & Kathryn La Barre, 2003. "A cast of thousands: Coauthorship and subauthorship collaboration in the 20th century as manifested in the scholarly journal literature of psychology and philosophy," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(9), pages 855-871, July.
    2. Blaise Cronin & Debora Shaw & Kathryn La Barre, 2004. "Visible, less visible, and invisible work: Patterns of collaboration in 20th century chemistry," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 55(2), pages 160-168, January.
    3. Blaise Cronin, 2001. "Hyperauthorship: A postmodern perversion or evidence of a structural shift in scholarly communication practices?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 52(7), pages 558-569.
    4. Yongjun Zhu & Erjia Yan & Il-Yeol Song, 2017. "The use of a graph-based system to improve bibliographic information retrieval: System design, implementation, and evaluation," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(2), pages 480-490, February.
    5. Xiangjie Kong & Huizhen Jiang & Zhuo Yang & Zhenzhen Xu & Feng Xia & Amr Tolba, 2016. "Exploiting Publication Contents and Collaboration Networks for Collaborator Recommendation," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-13, February.
    6. Choi, Jaewoong & Yoon, Janghyeok, 2022. "Measuring knowledge exploration distance at the patent level: Application of network embedding and citation analysis," Journal of Informetrics, Elsevier, vol. 16(2).
    7. Katz, J. Sylvan & Martin, Ben R., 1997. "What is research collaboration?," Research Policy, Elsevier, vol. 26(1), pages 1-18, March.
    8. Jie Chen & Xin Wang & Shu Zhao & Yanping Zhang & Ning Cai, 2021. "Content-Enhanced Network Embedding for Academic Collaborator Recommendation," Complexity, Hindawi, vol. 2021, pages 1-12, February.
    9. Cummings, Jonathon N. & Kiesler, Sara, 2007. "Coordination costs and project outcomes in multi-university collaborations," Research Policy, Elsevier, vol. 36(10), pages 1620-1634, December.
    10. Erjia Yan & Ying Ding, 2009. "Applying centrality measures to impact analysis: A coauthorship network analysis," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(10), pages 2107-2118, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alberto Baccini & Eugenio Petrovich, 2022. "Normative versus strategic accounts of acknowledgment data: The case of the top-five journals of economics," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 603-635, January.
    2. Ding, Ying, 2011. "Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks," Journal of Informetrics, Elsevier, vol. 5(1), pages 187-203.
    3. Nadine Desrochers & Adèle Paul‐Hus & Jen Pecoskie, 2017. "Five decades of gratitude: A meta‐synthesis of acknowledgments research," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(12), pages 2821-2833, December.
    4. Zhu, Yongjun & Kim, Donghun & Jiang, Ting & Zhao, Yi & He, Jiangen & Chen, Xinyi & Lou, Wen, 2024. "Dependency, reciprocity, and informal mentorship in predicting long-term research collaboration: A co-authorship matrix-based multivariate time series analysis," Journal of Informetrics, Elsevier, vol. 18(1).
    5. Franceschet, Massimo & Costantini, Antonio, 2010. "The effect of scholar collaboration on impact and quality of academic papers," Journal of Informetrics, Elsevier, vol. 4(4), pages 540-553.
    6. Jordi Ardanuy, 2012. "Scientific collaboration in Library and Information Science viewed through the Web of Knowledge: the Spanish case," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(3), pages 877-890, March.
    7. Thelwall, Mike & Sud, Pardeep, 2014. "No citation advantage for monograph-based collaborations?," Journal of Informetrics, Elsevier, vol. 8(1), pages 276-283.
    8. Michaël Bikard & Fiona Murray & Joshua S. Gans, 2015. "Exploring Trade-offs in the Organization of Scientific Work: Collaboration and Scientific Reward," Management Science, INFORMS, vol. 61(7), pages 1473-1495, July.
    9. Elizabeth S. Vieira, 2023. "The influence of research collaboration on citation impact: the countries in the European Innovation Scoreboard," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(6), pages 3555-3579, June.
    10. Dorte Henriksen, 2016. "The rise in co-authorship in the social sciences (1980–2013)," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 455-476, May.
    11. Sameer Kumar, 2018. "Ethical Concerns in the Rise of Co-Authorship and Its Role as a Proxy of Research Collaborations," Publications, MDPI, vol. 6(3), pages 1-9, August.
    12. Chen, Kaihua & Zhang, Yi & Fu, Xiaolan, 2019. "International research collaboration: An emerging domain of innovation studies?," Research Policy, Elsevier, vol. 48(1), pages 149-168.
    13. Chin-Chang Tsai & Elizabeth A. Corley & Barry Bozeman, 2016. "Collaboration experiences across scientific disciplines and cohorts," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(2), pages 505-529, August.
    14. Hajdeja Iglič & Patrick Doreian & Luka Kronegger & Anuška Ferligoj, 2017. "With whom do researchers collaborate and why?," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(1), pages 153-174, July.
    15. Mehmet Ali Koseoglu, 2016. "Mapping the institutional collaboration network of strategic management research: 1980–2014," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(1), pages 203-226, October.
    16. Hajibabaei, Anahita & Schiffauerova, Andrea & Ebadi, Ashkan, 2022. "Gender-specific patterns in the artificial intelligence scientific ecosystem," Journal of Informetrics, Elsevier, vol. 16(2).
    17. Jarno Hoekman & Koen Frenken, 2013. "Proximity and Stratification in European Scientific Research Collaboration Networks: A Policy Perspective," Advances in Spatial Science, in: Thomas Scherngell (ed.), The Geography of Networks and R&D Collaborations, edition 127, chapter 0, pages 263-277, Springer.
    18. Rafols, Ismael & Leydesdorff, Loet & O’Hare, Alice & Nightingale, Paul & Stirling, Andy, 2012. "How journal rankings can suppress interdisciplinary research: A comparison between Innovation Studies and Business & Management," Research Policy, Elsevier, vol. 41(7), pages 1262-1282.
    19. Jo Royle & Louisa Coles & Dorothy Williams & Paul Evans, 2007. "Publishing in international journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 71(1), pages 59-86, April.
    20. Kazuki Nakajima & Kazuyuki Shudo & Naoki Masuda, 2023. "Higher-order rich-club phenomenon in collaborative research grant networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(4), pages 2429-2446, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:74:y:2023:i:4:p:388-401. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.