IDEAS home Printed from https://ideas.repec.org/a/eee/techno/v112y2022ics0166497221001887.html
   My bibliography  Save this article

Screening ideas in the early stages of technology development: A word2vec and convolutional neural network approach

Author

Listed:
  • Hong, Suckwon
  • Kim, Juram
  • Woo, Han-Gyun
  • Kim, Young-Choon
  • Lee, Changyong

Abstract

Previous patent-based methods for assessing the value of technological ideas face challenges in screening ideas in the early stages of technology development because they require information available at the time or after a patent is granted. Given that the technical descriptions of ideas are usually available in the early stages, we propose an analytical framework for screening ideas by associating the technical descriptions of ideas implied in patents with the number of patent forward citations as a proxy for the technological value of the ideas. Accordingly, word2vec is used to examine the semantic relationships among words and construct matrices representing the technical content of ideas implied in patents. A convolutional neural network is used to model the nonlinear relationships between the matrices and the number of patent forward citations. Once trained, the proposed analytical framework can screen early-stage ideas using only the technical descriptions of the ideas. We explore the varying performance of our framework across different analysis contexts and discuss the research implications for theory and practice. A case study covering 35,376 patents in pharmaceutical technology confirms that the proposed analytical framework identifies most ideas with little technological value and outperforms existing models in terms of accuracy and reliability.

Suggested Citation

  • Hong, Suckwon & Kim, Juram & Woo, Han-Gyun & Kim, Young-Choon & Lee, Changyong, 2022. "Screening ideas in the early stages of technology development: A word2vec and convolutional neural network approach," Technovation, Elsevier, vol. 112(C).
  • Handle: RePEc:eee:techno:v:112:y:2022:i:c:s0166497221001887
    DOI: 10.1016/j.technovation.2021.102407
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0166497221001887
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.technovation.2021.102407?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Manuel Trajtenberg, 1990. "A Penny for Your Quotes: Patent Citations and the Value of Innovations," RAND Journal of Economics, The RAND Corporation, vol. 21(1), pages 172-187, Spring.
    2. Bronwyn H. Hall & Adam Jaffe & Manuel Trajtenberg, 2005. "Market Value and Patent Citations," RAND Journal of Economics, The RAND Corporation, vol. 36(1), pages 16-38, Spring.
    3. Arts, Sam & Hou, Jianan & Gomez, Juan Carlos, 2021. "Natural language processing to identify the creation and impact of new technologies in patent text: Code, data, and new measures," Research Policy, Elsevier, vol. 50(2).
    4. Kim, Juram & Kim, Seungho & Lee, Changyong, 2019. "Anticipating technological convergence: Link prediction using Wikipedia hyperlinks," Technovation, Elsevier, vol. 79(C), pages 25-34.
    5. Sam Arts & Bruno Cassiman & Juan Carlos Gomez, 2018. "Text matching to measure patent similarity," Strategic Management Journal, Wiley Blackwell, vol. 39(1), pages 62-84, January.
    6. Lee, Changyong & Kwon, Ohjin & Kim, Myeongjung & Kwon, Daeil, 2018. "Early identification of emerging technologies: A machine learning approach using multiple patent indicators," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 291-303.
    7. Dietmar Harhoff & Francis Narin & F. M. Scherer & Katrin Vopel, 1999. "Citation Frequency And The Value Of Patented Inventions," The Review of Economics and Statistics, MIT Press, vol. 81(3), pages 511-515, August.
    8. Lee, Changyong & Cho, Yangrae & Seol, Hyeonju & Park, Yongtae, 2012. "A stochastic patent citation analysis approach to assessing future technological impacts," Technological Forecasting and Social Change, Elsevier, vol. 79(1), pages 16-29.
    9. Lee, Changyong, 2021. "A review of data analytics in technological forecasting," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    10. Lee, Changyong & Kang, Bokyoung & Shin, Juneseuk, 2015. "Novelty-focused patent mapping for technology opportunity analysis," Technological Forecasting and Social Change, Elsevier, vol. 90(PB), pages 355-365.
    11. Briggs, Kristie, 2015. "Co-owner relationships conducive to high quality joint patents," Research Policy, Elsevier, vol. 44(8), pages 1566-1573.
    12. Dziallas, Marisa, 2020. "How to evaluate innovative ideas and concepts at the front-end?," Journal of Business Research, Elsevier, vol. 110(C), pages 502-518.
    13. Jang, Hyun Jin & Woo, Han-Gyun & Lee, Changyong, 2017. "Hawkes process-based technology impact analysis," Journal of Informetrics, Elsevier, vol. 11(2), pages 511-529.
    14. Sam Arts & Reinhilde Veugelers, 2015. "Technology familiarity, recombinant novelty, and breakthrough invention," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 24(6), pages 1215-1246.
    15. Lee, Changyong & Jeon, Daeseong & Ahn, Joon Mo & Kwon, Ohjin, 2020. "Navigating a product landscape for technology opportunity analysis: A word2vec approach using an integrated patent-product database," Technovation, Elsevier, vol. 96.
    16. DiMasi, Joseph A. & Hansen, Ronald W. & Grabowski, Henry G., 2003. "The price of innovation: new estimates of drug development costs," Journal of Health Economics, Elsevier, vol. 22(2), pages 151-185, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    2. Wang, Zongrun & Zhang, Taiyu & Ren, Xiaohang & Shi, Yukun, 2024. "AI adoption rate and corporate green innovation efficiency: Evidence from Chinese energy companies," Energy Economics, Elsevier, vol. 132(C).
    3. Kim, Juram & Lee, Gyumin & Lee, Seungbin & Lee, Changyong, 2022. "Towards expert–machine collaborations for technology valuation: An interpretable machine learning approach," Technological Forecasting and Social Change, Elsevier, vol. 183(C).
    4. Just, Julian, 2024. "Natural language processing for innovation search – Reviewing an emerging non-human innovation intermediary," Technovation, Elsevier, vol. 129(C).
    5. Chiarello, Filippo & Giordano, Vito & Spada, Irene & Barandoni, Simone & Fantoni, Gualtiero, 2024. "Future applications of generative large language models: A data-driven case study on ChatGPT," Technovation, Elsevier, vol. 133(C).
    6. Zhanfeng Wang & Lisha Yao & Xiaoyu Shao & Honghai Wang, 2023. "RETRACTED ARTICLE: A combination of TEXTCNN model and Bayesian classifier for microblog sentiment analysis," Journal of Combinatorial Optimization, Springer, vol. 45(4), pages 1-22, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Changyong Lee & Gyumin Lee, 2019. "Technology opportunity analysis based on recombinant search: patent landscape analysis for idea generation," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 603-632, November.
    2. Lee, Changyong, 2021. "A review of data analytics in technological forecasting," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    3. Chung, Park & Sohn, So Young, 2020. "Early detection of valuable patents using a deep learning model: Case of semiconductor industry," Technological Forecasting and Social Change, Elsevier, vol. 158(C).
    4. Changyong Lee & Suckwon Hong & Juram Kim, 2021. "Anticipating multi-technology convergence: a machine learning approach using patent information," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 1867-1896, March.
    5. Kim, Juram & Lee, Gyumin & Lee, Seungbin & Lee, Changyong, 2022. "Towards expert–machine collaborations for technology valuation: An interpretable machine learning approach," Technological Forecasting and Social Change, Elsevier, vol. 183(C).
    6. Jeon, Daeseong & Ahn, Joon Mo & Kim, Juram & Lee, Changyong, 2022. "A doc2vec and local outlier factor approach to measuring the novelty of patents," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    7. Serkan Altuntas & Zulfiye Erdogan & Turkay Dereli, 2020. "A clustering-based approach for the evaluation of candidate emerging technologies," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(2), pages 1157-1177, August.
    8. Manuel Acosta & Daniel Coronado & Esther Ferrándiz & Manuel Jiménez, 2022. "Effects of knowledge spillovers between competitors on patent quality: what patent citations reveal about a global duopoly," The Journal of Technology Transfer, Springer, vol. 47(5), pages 1451-1487, October.
    9. Sam Arts & Lee Fleming, 2018. "Paradise of Novelty—Or Loss of Human Capital? Exploring New Fields and Inventive Output," Organization Science, INFORMS, vol. 29(6), pages 1074-1092, December.
    10. Ren, Haiying & Zhao, Yuhui, 2021. "Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks," Technovation, Elsevier, vol. 101(C).
    11. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    12. Kim, Juram & Hong, Suckwon & Kang, Yubin & Lee, Changyong, 2023. "Domain-specific valuation of university technologies using bibliometrics, Jonckheere–Terpstra tests, and data envelopment analysis," Technovation, Elsevier, vol. 122(C).
    13. Antonio Messeni Petruzzelli & Daniele Rotolo & Vito Albino, 2014. "Determinants of Patent Citations in Biotechnology: An Analysis of Patent Influence Across the Industrial and Organizational Boundaries," SPRU Working Paper Series 2014-05, SPRU - Science Policy Research Unit, University of Sussex Business School.
    14. Ron Boschma & Ernest Miguelez & Rosina Moreno & Diego B. Ocampo-Corrales, 2021. "Technological breakthroughs in European regions: the role of related and unrelated combinations," Papers in Evolutionary Economic Geography (PEEG) 2118, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Jun 2021.
    15. Yuan Zhou & Fang Dong & Yufei Liu & Liang Ran, 2021. "A deep learning framework to early identify emerging technologies in large-scale outlier patents: an empirical study of CNC machine tool," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 969-994, February.
    16. Jeon, Daeseong & Lee, Junyoup & Ahn, Joon Mo & Lee, Changyong, 2023. "Measuring the novelty of scientific publications: A fastText and local outlier factor approach," Journal of Informetrics, Elsevier, vol. 17(4).
    17. Neil Gandal & Michal Shur-Ofry & Michael Crystal & Royee Shilony, 2021. "Out of sight: patents that have never been cited," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 2903-2929, April.
    18. Wei Du & Yibo Wang & Wei Xu & Jian Ma, 2021. "A personalized recommendation system for high-quality patent trading by leveraging hybrid patent analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9369-9391, December.
    19. Youngjae Choi & Sanghyun Park & Sungjoo Lee, 2021. "Identifying emerging technologies to envision a future innovation ecosystem: A machine learning approach to patent data," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5431-5476, July.
    20. Lee, Changyong & Kwon, Ohjin & Kim, Myeongjung & Kwon, Daeil, 2018. "Early identification of emerging technologies: A machine learning approach using multiple patent indicators," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 291-303.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:techno:v:112:y:2022:i:c:s0166497221001887. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/01664972 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.