IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v117y2018i1d10.1007_s11192-018-2857-9.html
   My bibliography  Save this article

A complement to lexical query’s search-term selection for emerging technologies: the case of “big data”

Author

Listed:
  • Santiago Ruiz-Navas

    (Tokyo Institute of Technology)

  • Kumiko Miyazaki

    (Tokyo Institute of Technology)

Abstract

Obtaining document sets to study emerging technologies is challenging. Researchers studying emerging technologies use lexical queries, e.g., core, expanded and evolutionary, to face this challenge. Creating lexical queries requires the selection of search-terms. Manual, automatic and semi-automatic techniques can be implemented to select search-terms. The current reported processes to select search-terms can be complemented by attending two issues. One is the lack of a systematic process for the selection of search-terms from previous literature, and the second is the evaluation of candidate search-terms’ document retrieval interdependence. We propose two steps to complement the process of selecting search-terms to create lexical queries to study emerging technologies. The first step consists of a process to systematically select search-terms from previous literature. The second is an evaluation of search-terms’ document retrieval interdependence, and for its evaluation, we propose the Significance of Interception Ratio (SIR). We tested our proposed steps setting as a reference the big-data lexical query proposed by Huang et al. (Scientometrics 105:2005–2022, 2015). The tests results show that the proposed steps can complement the current automatic methods to select search-terms. The first step increased around a 24% the recall of the reference lexical query. The increase in the recall was possible because of the addition of 37 additional search-terms and the elimination of three search-terms from the reference lexical query. In the second step (application of the SIR), five search-terms from the reference lexical query were optimized, showing a slight complementary ability when selecting search-terms.

Suggested Citation

  • Santiago Ruiz-Navas & Kumiko Miyazaki, 2018. "A complement to lexical query’s search-term selection for emerging technologies: the case of “big data”," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 141-162, October.
  • Handle: RePEc:spr:scient:v:117:y:2018:i:1:d:10.1007_s11192-018-2857-9
    DOI: 10.1007/s11192-018-2857-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-018-2857-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-018-2857-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniele Rotolo & Ismael Rafols & Michael M. Hopkins & Loet Leydesdorff, 2017. "Strategic intelligence on emerging technologies: Scientometric overlay mapping," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(1), pages 214-233, January.
    2. Park, Han Woo & Leydesdorff, Loet, 2013. "Decomposing social and semantic networks in emerging “big data” research," Journal of Informetrics, Elsevier, vol. 7(3), pages 756-765.
    3. Ying Huang & Jannik Schuehle & Alan L. Porter & Jan Youtie, 2015. "A systematic method to create search strategies for emerging technologies based on the Web of Science: illustrated for ‘Big Data’," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 2005-2022, December.
    4. Sanjay K. Arora & Alan L. Porter & Jan Youtie & Philip Shapira, 2013. "Capturing new developments in an emerging technology: an updated search strategy for identifying nanotechnology research outputs," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(1), pages 351-370, April.
    5. Gustavo Cattelan Nobre & Elaine Tavares, 2017. "Scientific literature analysis on big data and internet of things applications on circular economy: a bibliometric study," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 463-492, April.
    6. Can Huang & Ad Notten & Nico Rasters, 2011. "Nanoscience and technology publications and patents: a review of social science studies and search strategies," The Journal of Technology Transfer, Springer, vol. 36(2), pages 145-172, April.
    7. Rotolo, Daniele & Hicks, Diana & Martin, Ben R., 2015. "What is an emerging technology?," Research Policy, Elsevier, vol. 44(10), pages 1827-1843.
    8. Morteza Maghrebi & Ali Abbasi & Saeid Amiri & Reza Monsefi & Ahad Harati, 2011. "A collective and abridged lexical query for delineation of nanotechnology publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(1), pages 15-25, January.
    9. Mogoutov, Andrei & Kahane, Bernard, 2007. "Data search strategy for science and technology emergence: A scalable and evolutionary query for nanotechnology tracking," Research Policy, Elsevier, vol. 36(6), pages 893-903, July.
    10. Patricia Laurens & Michel Zitt & Elise Bassecoulard, 2010. "Delineation of the genomics field by hybrid citation-lexical methods: interaction with experts and validation process," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(3), pages 647-662, March.
    11. Moghadasi, Shiva Imani & Ravana, Sri Devi & Raman, Sudharshan N., 2013. "Low-cost evaluation techniques for information retrieval systems: A review," Journal of Informetrics, Elsevier, vol. 7(2), pages 301-312.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Na Liu & Philip Shapira & Xiaoxu Yue, 2021. "Tracking developments in artificial intelligence research: constructing and applying a new search strategy," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3153-3192, April.
    2. Muñoz-Écija, Teresa & Vargas-Quesada, Benjamín & Chinchilla Rodríguez, Zaida, 2019. "Coping with methods for delineating emerging fields: Nanoscience and nanotechnology as a case study," Journal of Informetrics, Elsevier, vol. 13(4).
    3. Zhang, Yi & Huang, Ying & Porter, Alan L. & Zhang, Guangquan & Lu, Jie, 2019. "Discovering and forecasting interactions in big data research: A learning-enhanced bibliometric study," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 795-807.
    4. Kwon, Seokbeom & Liu, Xiaoyu & Porter, Alan L. & Youtie, Jan, 2019. "Research addressing emerging technological ideas has greater scientific impact," Research Policy, Elsevier, vol. 48(9), pages 1-1.
    5. Ying Huang & Jannik Schuehle & Alan L. Porter & Jan Youtie, 2015. "A systematic method to create search strategies for emerging technologies based on the Web of Science: illustrated for ‘Big Data’," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 2005-2022, December.
    6. Philip Shapira & Seokbeom Kwon & Jan Youtie, 2017. "Tracking the emergence of synthetic biology," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1439-1469, September.
    7. Porter, Alan L. & Garner, Jon & Carley, Stephen F. & Newman, Nils C., 2019. "Emergence scoring to identify frontier R&D topics and key players," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 628-643.
    8. Coccia, Mario & Wang, Lili, 2015. "Path-breaking directions of nanotechnology-based chemotherapy and molecular cancer therapy," Technological Forecasting and Social Change, Elsevier, vol. 94(C), pages 155-169.
    9. Porter, Alan L. & Chiavetta, Denise & Newman, Nils C., 2020. "Measuring tech emergence: A contest," Technological Forecasting and Social Change, Elsevier, vol. 159(C).
    10. Sabatier, Mareva & Chollet, Barthélemy, 2017. "Is there a first mover advantage in science? Pioneering behavior and scientific production in nanotechnology," Research Policy, Elsevier, vol. 46(2), pages 522-533.
    11. Ahmad Barirani & Bruno Agard & Catherine Beaudry, 2013. "Discovering and assessing fields of expertise in nanomedicine: a patent co-citation network perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 1111-1136, March.
    12. T. Gorjiara & C. Baldock, 2014. "Nanoscience and nanotechnology research publications: a comparison between Australia and the rest of the world," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(1), pages 121-148, July.
    13. Wang, Zhinan & Porter, Alan L. & Wang, Xuefeng & Carley, Stephen, 2019. "An approach to identify emergent topics of technological convergence: A case study for 3D printing," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 723-732.
    14. Tomaz Bartol & Karmen Stopar, 2015. "Nano language and distribution of article title terms according to power laws," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 435-451, May.
    15. Petersen, Alexander M. & Rotolo, Daniele & Leydesdorff, Loet, 2016. "A triple helix model of medical innovation: Supply, demand, and technological capabilities in terms of Medical Subject Headings," Research Policy, Elsevier, vol. 45(3), pages 666-681.
    16. Patrick Herron & Aashish Mehta & Cong Cao & Timothy Lenoir, 2016. "Research diversification and impact: the case of national nanoscience development," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(2), pages 629-659, November.
    17. Burmaoglu, Serhat & Sartenaer, Olivier & Porter, Alan, 2019. "Conceptual definition of technology emergence: A long journey from philosophy of science to science policy," Technology in Society, Elsevier, vol. 59(C).
    18. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    19. Yun Liu & Zhe Yan & Yijie Cheng & Xuanting Ye, 2018. "Exploring the Technological Collaboration Characteristics of the Global Integrated Circuit Manufacturing Industry," Sustainability, MDPI, vol. 10(1), pages 1-23, January.
    20. Muhammad Omar & Arif Mehmood & Gyu Sang Choi & Han Woo Park, 2017. "Global mapping of artificial intelligence in Google and Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1269-1305, December.

    More about this item

    Keywords

    Big-data; Emerging technologies; Science reproducibility; Lexical query expansion; Search-terms selection;
    All these keywords.

    JEL classification:

    • C60 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - General
    • Q55 - Agricultural and Natural Resource Economics; Environmental and Ecological Economics - - Environmental Economics - - - Environmental Economics: Technological Innovation
    • C80 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - General
    • O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:117:y:2018:i:1:d:10.1007_s11192-018-2857-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.