IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v124y2020i3d10.1007_s11192-020-03561-y.html
   My bibliography  Save this article

A context-aware citation recommendation model with BERT and graph convolutional networks

Author

Listed:
  • Chanwoo Jeong

    (Gachon University)

  • Sion Jang

    (Gachon University)

  • Eunjeong Park

    (Papago, NAVER)

  • Sungchul Choi

    (Gachon University)

Abstract

With the tremendous growth in the number of scientific papers being published, searching for references while writing a scientific paper is a time-consuming process. A technique that could add a reference citation at the appropriate place in a sentence will be beneficial. In this perspective, the context-aware citation recommendation has been researched for around two decades. Many researchers have utilized the text data called the context sentence, which surrounds the citation tag, and the metadata of the target paper to find the appropriate cited research. However, the lack of well-organized benchmarking datasets, and no model that can attain high performance has made the research difficult. In this paper, we propose a deep learning-based model and well-organized dataset for context-aware paper citation recommendation. Our model comprises a document encoder and a context encoder. For this, we use graph convolutional networks layer, and bidirectional encoder representations from transformers, a pre-trained model of textual data. By modifying the related PeerRead dataset, we propose a new dataset called FullTextPeerRead containing context sentences to cited references and paper metadata. To the best of our knowledge, this dataset is the first well-organized dataset for a context-aware paper recommendation. The results indicate that the proposed model with the proposed datasets can attain state-of-the-art performance and achieve a more than 28% improvement in mean average precision and recall@k.

Suggested Citation

  • Chanwoo Jeong & Sion Jang & Eunjeong Park & Sungchul Choi, 2020. "A context-aware citation recommendation model with BERT and graph convolutional networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 1907-1922, September.
  • Handle: RePEc:spr:scient:v:124:y:2020:i:3:d:10.1007_s11192-020-03561-y
    DOI: 10.1007/s11192-020-03561-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-020-03561-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-020-03561-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Meen Chul Kim & Chaomei Chen, 2015. "A scientometric review of emerging trends and new developments in recommendation systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(1), pages 239-263, July.
    2. Moed, Henk F., 2010. "Measuring contextual citation impact of scientific journals," Journal of Informetrics, Elsevier, vol. 4(3), pages 265-277.
    3. Bai, Xiaomei & Zhang, Fuli & Lee, Ivan, 2019. "Predicting the citations of scholarly paper," Journal of Informetrics, Elsevier, vol. 13(1), pages 407-418.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xiaojuan Zhang & Shuqi Song & Yuping Xiong, 2024. "Personalized global citation recommendation with diversification awareness," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 3625-3657, July.
    2. Antonina Dattolo & Marco Corbatto, 2022. "Assisting researchers in bibliographic tasks: A new usable, real‐time tool for analyzing bibliographies," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(6), pages 757-776, June.
    3. Khalid Haruna & Maizatul Akmar Ismail & Atika Qazi & Habeebah Adamu Kakudi & Mohammed Hassan & Sanah Abdullahi Muaz & Haruna Chiroma, 2020. "Research paper recommender system based on public contextual metadata," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 101-114, October.
    4. Choi, Seokkyu & Lee, Hyeonju & Park, Eunjeong & Choi, Sungchul, 2022. "Deep learning for patent landscaping using transformer and graph embedding," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    5. Kaiwen Shi & Kan Liu & Xinyan He, 2024. "Heterogeneous hypergraph learning for literature retrieval based on citation intents," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4167-4188, July.
    6. Chaker Jebari & Enrique Herrera-Viedma & Manuel Jesus Cobo, 2023. "Context-aware citation recommendation of scientific papers: comparative study, gaps and trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4243-4268, August.
    7. Diego Kozlowski & Jennifer Dusdal & Jun Pang & Andreas Zilian, 2021. "Semantic and relational spaces in science of science: deep learning models for article vectorisation," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5881-5910, July.
    8. Chanathip Pornprasit & Xin Liu & Pattararat Kiattipadungkul & Natthawut Kertkeidkachorn & Kyoung-Sook Kim & Thanapon Noraset & Saeed-Ul Hassan & Suppawong Tuarob, 2022. "Enhancing citation recommendation using citation network embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 233-264, January.
    9. Jaewoong Choi & Jiho Lee & Janghyeok Yoon & Sion Jang & Jaeyoung Kim & Sungchul Choi, 2022. "A two-stage deep learning-based system for patent citation recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6615-6636, November.
    10. Yonghe Lu & Meilu Yuan & Jiaxin Liu & Minghong Chen, 2023. "Research on semantic representation and citation recommendation of scientific papers with multiple semantics fusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1367-1393, February.
    11. Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
    12. Tianshuang Qiu & Chuanming Yu & Yunci Zhong & Lu An & Gang Li, 2021. "A scientific citation recommendation model integrating network and text representations," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(11), pages 9199-9221, November.
    13. Moreno La Quatra & Luca Cagliero & Elena Baralis, 2021. "Leveraging full-text article exploration for citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8275-8293, October.
    14. Hei-Chia Wang & Jen-Wei Cheng & Che-Tsung Yang, 2022. "SentCite: a sentence-level citation recommender based on the salient similarity among multiple segments," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2521-2546, May.
    15. Jialiang Lin & Yao Yu & Jiaxin Song & Xiaodong Shi, 2022. "Detecting and analyzing missing citations to published scientific entities," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2395-2412, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carlos Olmeda-Gómez & Maria-Antonia Ovalle-Perandones & Antonio Perianes-Rodríguez, 2017. "Co-word analysis and thematic landscapes in Spanish information science literature, 1985–2014," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 195-217, October.
    2. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    3. Wang, Xing & Zhang, Zhihui, 2020. "Improving the reliability of short-term citation impact indicators by taking into account the correlation between short- and long-term citation impact," Journal of Informetrics, Elsevier, vol. 14(2).
    4. Stephen Carley & Alan L. Porter, 2012. "A forward diversity index," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 407-427, February.
    5. Aurelia Magdalena Pisoschi & Claudia Gabriela Pisoschi, 2016. "Is open access the solution to increase the impact of scientific journals?," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(2), pages 1075-1095, November.
    6. Kaur, Jasleen & Radicchi, Filippo & Menczer, Filippo, 2013. "Universality of scholarly impact metrics," Journal of Informetrics, Elsevier, vol. 7(4), pages 924-932.
    7. Lu, Wei & Ren, Yan & Huang, Yong & Bu, Yi & Zhang, Yuehan, 2021. "Scientific collaboration and career stages: An ego-centric perspective," Journal of Informetrics, Elsevier, vol. 15(4).
    8. Solomon, David J. & Laakso, Mikael & Björk, Bo-Christer, 2013. "A longitudinal comparison of citation rates and growth among open access journals," Journal of Informetrics, Elsevier, vol. 7(3), pages 642-650.
    9. Gao, Qiang & Liang, Zhentao & Wang, Ping & Hou, Jingrui & Chen, Xiuxiu & Liu, Manman, 2021. "Potential index: Revealing the future impact of research topics based on current knowledge networks," Journal of Informetrics, Elsevier, vol. 15(3).
    10. Oleh Pasko & Mykola Hordiyenko & Fuli Chen & Yarmila Tkal & Yulia Abraham, 2021. "Mapping Global Research on International Financial Reporting Standards: A Scientometric Review," International Journal of Financial Research, International Journal of Financial Research, Sciedu Press, vol. 12(3), pages 116-134, May.
    11. Fiorenzo Franceschini & Maurizio Galetto & Domenico Maisano & Luca Mastrogiacomo, 2012. "The success-index: an alternative approach to the h-index for evaluating an individual’s research output," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(3), pages 621-641, September.
    12. Mohammad Rabiei & Seyyed-Mahdi Hosseini-Motlagh & Abdorrahman Haeri, 2017. "Using text mining techniques for identifying research gaps and priorities: a case study of the environmental science in Iran," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(2), pages 815-842, February.
    13. Walters, William H., 2017. "Do subjective journal ratings represent whole journals or typical articles? Unweighted or weighted citation impact?," Journal of Informetrics, Elsevier, vol. 11(3), pages 730-744.
    14. Dejian Yu & Wanru Wang & Shuai Zhang & Wenyu Zhang & Rongyu Liu, 2017. "A multiple-link, mutually reinforced journal-ranking model to measure the prestige of journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 521-542, April.
    15. Bornmann, Lutz & Leydesdorff, Loet, 2012. "Which are the best performing regions in information science in terms of highly cited papers? Some improvements of our previous mapping approaches," Journal of Informetrics, Elsevier, vol. 6(2), pages 336-345.
    16. Kong, Ling & Wang, Dongbo, 2020. "Comparison of citations and attention of cover and non-cover papers," Journal of Informetrics, Elsevier, vol. 14(4).
    17. Antonio Fernández-Cano & Manuel Torralbo & Mónica Vallejo, 2012. "Time series of scientific growth in Spanish doctoral theses (1848–2009)," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(1), pages 15-36, April.
    18. Shuo Xu & Liyuan Hao & Xin An & Hongshen Pang & Ting Li, 2020. "Review on emerging research topics with key-route main path analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 607-624, January.
    19. Alona Zharova & Janine Tellinger-Rice & Wolfgang Karl Härdle, 2018. "How to measure the performance of a Collaborative Research Center," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 1023-1040, November.
    20. Yu, Dejian & Xu, Chao, 2017. "Mapping research on carbon emissions trading: a co-citation analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 74(C), pages 1314-1322.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:124:y:2020:i:3:d:10.1007_s11192-020-03561-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.