IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0111795.html
   My bibliography  Save this article

Computational Approaches for Predicting Biomedical Research Collaborations

Author

Listed:
  • Qing Zhang
  • Hong Yu

Abstract

Biomedical research is increasingly collaborative, and successful collaborations often produce high impact work. Computational approaches can be developed for automatically predicting biomedical research collaborations. Previous works of collaboration prediction mainly explored the topological structures of research collaboration networks, leaving out rich semantic information from the publications themselves. In this paper, we propose supervised machine learning approaches to predict research collaborations in the biomedical field. We explored both the semantic features extracted from author research interest profile and the author network topological features. We found that the most informative semantic features for author collaborations are related to research interest, including similarity of out-citing citations, similarity of abstracts. Of the four supervised machine learning models (naïve Bayes, naïve Bayes multinomial, SVMs, and logistic regression), the best performing model is logistic regression with an ROC ranging from 0.766 to 0.980 on different datasets. To our knowledge we are the first to study in depth how research interest and productivities can be used for collaboration prediction. Our approach is computationally efficient, scalable and yet simple to implement. The datasets of this study are available at https://github.com/qingzhanggithub/medline-collaboration-datasets.

Suggested Citation

  • Qing Zhang & Hong Yu, 2014. "Computational Approaches for Predicting Biomedical Research Collaborations," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-13, November.
  • Handle: RePEc:plo:pone00:0111795
    DOI: 10.1371/journal.pone.0111795
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0111795
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0111795&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0111795?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kyungjoon Lee & John S Brownstein & Richard G Mills & Isaac S Kohane, 2010. "Does Collocation Inform the Impact of Collaboration?," PLOS ONE, Public Library of Science, vol. 5(12), pages 1-6, December.
    2. Jingyuan Luo & Jesse M Flynn & Rachel E Solnick & Elaine Howard Ecklund & Kirstin R W Matthews, 2011. "International Stem Cell Collaboration: How Disparate Policies between the United States and the United Kingdom Impact Research," PLOS ONE, Public Library of Science, vol. 6(3), pages 1-7, March.
    3. Ding, Ying, 2011. "Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks," Journal of Informetrics, Elsevier, vol. 5(1), pages 187-203.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jeong, Yujin & Park, Inchae & Yoon, Byungun, 2019. "Identifying emerging Research and Business Development (R&BD) areas based on topic modeling and visualization with intellectual property right data," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 655-672.
    2. Marian-Gabriel Hâncean & Matjaž Perc & Jürgen Lerner, 2021. "The coauthorship networks of the most productive European researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 201-224, January.
    3. Yi Bu & Binglu Wang & Win-bin Huang & Shangkun Che & Yong Huang, 2018. "Using the appearance of citations in full text on author co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(1), pages 275-289, July.
    4. Luís Filipe Miranda Grochocki & Andrea Felippe Cabello, 2023. "Research collaboration networks in maturing academic environments," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(4), pages 2535-2556, April.
    5. Jun-Ping Qiu & Ke Dong & Hou-Qiang Yu, 2014. "Comparative study on structure and correlation among author co-occurrence networks in bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1345-1360, November.
    6. Maria Cristiana Martini & Elvira Pelle & Francesco Poggi & Andrea Sciandra, 2022. "The role of citation networks to explain academic promotions: an empirical analysis of the Italian national scientific qualification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(10), pages 5633-5659, October.
    7. Maxim Kotsemir & Tatiana Kuznetsova & Elena Nasybulina & Anna Pikalova, 2015. "Identifying Directions for Russia’s Science and Technology Cooperation," Foresight-Russia Форсайт, CyberLeninka;Федеральное государственное автономное образовательное учреждение высшего образования «Национальный исследовательский университет «Высшая школа экономики», vol. 9(4 (eng)), pages 54-72.
    8. Cheng, Yu & Huang, Lucheng & Ramlogan, Ronnie & Li, Xin, 2017. "Forecasting of potential impacts of disruptive technology in promising technological areas: Elaborating the SIRS epidemic model in RFID technology," Technological Forecasting and Social Change, Elsevier, vol. 117(C), pages 170-183.
    9. Corinne Cortese & Claire Wright, 2018. "Developing a Community of Practice: Michael Gaffikin and Critical Accounting Research," Abacus, Accounting Foundation, University of Sydney, vol. 54(3), pages 247-276, September.
    10. Jakub Rybacki & Dobromił Serwa, 2021. "What Makes a Successful Scientist in a Central Bank? Evidence From the RePEc Database," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 13(3), pages 331-357, September.
    11. Jeeyoung Lim & Joseph J. Kim & Sunkuk Kim, 2021. "A Holistic Review of Building Energy Efficiency and Reduction Based on Big Data," Sustainability, MDPI, vol. 13(4), pages 1-18, February.
    12. Chakresh Kumar Singh & Demival Vasques Filho & Shivakumar Jolad & Dion R. J. O’Neale, 2020. "Evolution of interdependent co-authorship and citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 385-404, October.
    13. Fan Jiang & Niancai Liu, 2018. "The hierarchical status of international academic awards in social sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 2091-2115, December.
    14. Jia-Yen Huang & Rong-Chang Chen, 2019. "Exploring the intellectual structure of cloud patents using non-exhaustive overlaps," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 739-769, November.
    15. Anuradha IDDAGODA & Hiranya DISSANYAKE & Rohita ABEYSINGHE, 2022. "Green Human Resource Management: A Bibliometric Analysis," Romanian Journal of Economics, Institute of National Economy, vol. 55(2(64)), pages 147-159, December.
    16. Sandeep Soni & Kristina Lerman & Jacob Eisenstein, 2021. "Follow the leader: Documents on the leading edge of semantic change get more citations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(4), pages 478-492, April.
    17. Qingnan Xie & Richard B. Freeman, 2020. "The Contribution of Chinese Diaspora Researchers to Global Science and China's Catching Up in Scientific Research," NBER Working Papers 27169, National Bureau of Economic Research, Inc.
    18. Maki Kato & Asao Ando, 2013. "The relationship between research performance and international collaboration in chemistry," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 535-553, December.
    19. Maxim N. Kotsemir & Tatiana E. Kuznetsova & Elena G. Nasybulina & Anna G. Pikalova, 2015. "Empirical Analysis of Multinational S&T Collaboration Priorities –The Case of Russia," HSE Working papers WP BRP 53/STI/2015, National Research University Higher School of Economics.
    20. Li, Eldon Y. & Liao, Chien Hsiang & Yen, Hsiuju Rebecca, 2013. "Co-authorship networks and research impact: A social capital perspective," Research Policy, Elsevier, vol. 42(9), pages 1515-1530.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0111795. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.