Does deep learning help topic extraction? A kernel k-means clustering method with word embedding
Author
Abstract
Suggested Citation
DOI: 10.1016/j.joi.2018.09.004
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Nieminen, Paavo & Pölönen, Ilkka & Sipola, Tuomo, 2013. "Research literature clustering using diffusion maps," Journal of Informetrics, Elsevier, vol. 7(4), pages 874-886.
- Dangzhi Zhao & Andreas Strotmann, 2014. "The knowledge base and research front of information science 2006–2010: An author cocitation and bibliographic coupling analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(5), pages 995-1006, May.
- Zhang, Yi & Shang, Lining & Huang, Lu & Porter, Alan L. & Zhang, Guangquan & Lu, Jie & Zhu, Donghua, 2016. "A hybrid similarity measure method for patent portfolio analysis," Journal of Informetrics, Elsevier, vol. 10(4), pages 1108-1130.
- Jianhua Hou & Xiucai Yang & Chaomei Chen, 2018. "Emerging trends and new developments in information science: a document co-citation analysis (2009–2016)," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 869-892, May.
- Wanying Ding & Chaomei Chen, 2014. "Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(10), pages 2084-2097, October.
- Richard Klavans & Kevin W. Boyack, 2017. "Which Type of Citation Analysis Generates the Most Accurate Taxonomy of Scientific and Technical Knowledge?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(4), pages 984-998, April.
- Li, Guan-Cheng & Lai, Ronald & D’Amour, Alexander & Doolin, David M. & Sun, Ye & Torvik, Vetle I. & Yu, Amy Z. & Fleming, Lee, 2014. "Disambiguation and co-authorship networks of the U.S. patent inventor database (1975–2010)," Research Policy, Elsevier, vol. 43(6), pages 941-955.
- Zhang, Yi & Zhang, Guangquan & Chen, Hongshu & Porter, Alan L. & Zhu, Donghua & Lu, Jie, 2016. "Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research," Technological Forecasting and Social Change, Elsevier, vol. 105(C), pages 179-191.
- Loet Leydesdorff, 2008. "On the normalization and visualization of author co‐citation data: Salton's Cosine versus the Jaccard index," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(1), pages 77-85, January.
- Colavizza, Giovanni & Franceschet, Massimo, 2016. "Clustering citation histories in the Physical Review," Journal of Informetrics, Elsevier, vol. 10(4), pages 1037-1051.
- Peters, H. P. F. & van Raan, A. F. J., 1993. "Co-word-based science maps of chemical engineering. Part I: Representations by direct multidimensional scaling," Research Policy, Elsevier, vol. 22(1), pages 23-45, February.
- Zhang, Yi & Porter, Alan L. & Hu, Zhengyin & Guo, Ying & Newman, Nils C., 2014. "“Term clumping” for technical intelligence: A case study on dye-sensitized solar cells," Technological Forecasting and Social Change, Elsevier, vol. 85(C), pages 26-39.
- Arho Suominen & Hannes Toivanen, 2016. "Map of science with topic modeling: Comparison of unsupervised learning and human-assigned subject classification," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(10), pages 2464-2476, October.
- Theresa Velden & Kevin W. Boyack & Jochen Gläser & Rob Koopman & Andrea Scharnhorst & Shenghui Wang, 2017. "Comparison of topic extraction approaches and their results," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1169-1221, May.
- Yi Zhang & Guangquan Zhang & Donghua Zhu & Jie Lu, 2017. "Scientific evolutionary pathways: Identifying and visualizing relationships for scientific topics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(8), pages 1925-1939, August.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Zhang, Yi & Wu, Mengjia & Miao, Wen & Huang, Lu & Lu, Jie, 2021. "Bi-layer network analytics: A methodology for characterizing emerging general-purpose technologies," Journal of Informetrics, Elsevier, vol. 15(4).
- Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
- Yi Zhang & Xiaojing Cai & Caroline V. Fry & Mengjia Wu & Caroline S. Wagner, 2021. "Topic evolution, disruption and resilience in early COVID-19 research," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4225-4253, May.
- Carlos Olmeda-Gómez & Carlos Romá-Mateo & Maria-Antonia Ovalle-Perandones, 2019. "Overview of trends in global epigenetic research (2009–2017)," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1545-1574, June.
- Zhang, Yi & Huang, Ying & Porter, Alan L. & Zhang, Guangquan & Lu, Jie, 2019. "Discovering and forecasting interactions in big data research: A learning-enhanced bibliometric study," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 795-807.
- Kajikawa, Yuya & Mejia, Cristian & Wu, Mengjia & Zhang, Yi, 2022. "Academic landscape of Technological Forecasting and Social Change through citation network and topic analyses," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
- Huang, Lu & Chen, Xiang & Ni, Xingxing & Liu, Jiarun & Cao, Xiaoli & Wang, Changtian, 2021. "Tracking the dynamics of co-word networks for emerging topic identification," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
- Samira Ranaei & Arho Suominen & Alan Porter & Stephen Carley, 2020. "Evaluating technological emergence using text analytics: two case technologies and three approaches," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 215-247, January.
- Lu Huang & Xiang Chen & Yi Zhang & Yihe Zhu & Suyi Li & Xingxing Ni, 2021. "Dynamic network analytics for recommending scientific collaborators," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(11), pages 8789-8814, November.
- Li, Xin & Xie, Qianqian & Daim, Tugrul & Huang, Lucheng, 2019. "Forecasting technology trends using text mining of the gaps between science and technology: The case of perovskite solar cell technology," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 432-449.
- Peter Sjögårde & Per Ahlgren & Ludo Waltman, 2021. "Algorithmic labeling in hierarchical classifications of publications: Evaluation of bibliographic fields and term weighting approaches," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(7), pages 853-869, July.
- Xiao Zhou & Lu Huang & Yi Zhang & Miaomiao Yu, 2019. "A hybrid approach to detecting technological recombination based on text mining and patent network analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 699-737, November.
- Tingcan Ma & Ruinan Li & Guiyan Ou & Mingliang Yue, 2018. "Topic based research competitiveness evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 789-803, November.
- Paul Donner, 2021. "Validation of the Astro dataset clustering solutions with external data," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 1619-1645, February.
- Carlos Olmeda-Gómez & Maria-Antonia Ovalle-Perandones & Antonio Perianes-Rodríguez, 2017. "Co-word analysis and thematic landscapes in Spanish information science literature, 1985–2014," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(1), pages 195-217, October.
- Lu Huang & Yijie Cai & Erdong Zhao & Shengting Zhang & Yue Shu & Jiao Fan, 2022. "Measuring the interdisciplinarity of Information and Library Science interactions using citation analysis and semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6733-6761, November.
- Jong Hwan Suh, 2019. "SocialTERM-Extractor: Identifying and Predicting Social-Problem-Specific Key Noun Terms from a Large Number of Online News Articles Using Text Mining and Machine Learning Techniques," Sustainability, MDPI, vol. 11(1), pages 1-44, January.
- Lee, Changyong, 2021. "A review of data analytics in technological forecasting," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
- Matthias Held & Grit Laudel & Jochen Gläser, 2021. "Challenges to the validity of topic reconstruction," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4511-4536, May.
- Ballester, Omar & Penner, Orion, 2022. "Robustness, replicability and scalability in topic modelling," Journal of Informetrics, Elsevier, vol. 16(1).
More about this item
Keywords
Bibliometrics; Topic analysis; Cluster analysis; Text mining;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:12:y:2018:i:4:p:1099-1117. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.