IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v14y2022i1p25-d720733.html
   My bibliography  Save this article

Mobility in Unsupervised Word Embeddings for Knowledge Extraction—The Scholars’ Trajectories across Research Topics

Author

Listed:
  • Gianfranco Lombardo

    (Department of Engineering and Architecture (DIA), University of Parma, 43100 Parma, Italy
    These authors contributed equally to this work.)

  • Michele Tomaiuolo

    (Department of Engineering and Architecture (DIA), University of Parma, 43100 Parma, Italy
    These authors contributed equally to this work.)

  • Monica Mordonini

    (Department of Engineering and Architecture (DIA), University of Parma, 43100 Parma, Italy
    These authors contributed equally to this work.)

  • Gaia Codeluppi

    (Department of Engineering and Architecture (DIA), University of Parma, 43100 Parma, Italy
    These authors contributed equally to this work.)

  • Agostino Poggi

    (Department of Engineering and Architecture (DIA), University of Parma, 43100 Parma, Italy)

Abstract

In the knowledge discovery field of the Big Data domain the analysis of geographic positioning and mobility information plays a key role. At the same time, in the Natural Language Processing (NLP) domain pre-trained models such as BERT and word embedding algorithms such as Word2Vec enabled a rich encoding of words that allows mapping textual data into points of an arbitrary multi-dimensional space, in which the notion of proximity reflects an association among terms or topics. The main contribution of this paper is to show how analytical tools, traditionally adopted to deal with geographic data to measure the mobility of an agent in a time interval, can also be effectively applied to extract knowledge in a semantic realm, such as a semantic space of words and topics, looking for latent trajectories that can benefit the properties of neural network latent representations. As a case study, the Scopus database was queried about works of highly cited researchers in recent years. On this basis, we performed a dynamic analysis, for measuring the Radius of Gyration as an index of the mobility of researchers across scientific topics. The semantic space is built from the automatic analysis of the paper abstracts of each author. In particular, we evaluated two different methodologies to build the semantic space and we found that Word2Vec embeddings perform better than the BERT ones for this task. Finally, The scholars’ trajectories show some latent properties of this model, which also represent new scientific contributions of this work. These properties include ( i ) the correlation between the scientific mobility and the achievement of scientific results, measured through the H-index; ( ii ) differences in the behavior of researchers working in different countries and subjects; and ( iii ) some interesting similarities between mobility patterns in this semantic realm and those typically observed in the case of human mobility.

Suggested Citation

  • Gianfranco Lombardo & Michele Tomaiuolo & Monica Mordonini & Gaia Codeluppi & Agostino Poggi, 2022. "Mobility in Unsupervised Word Embeddings for Knowledge Extraction—The Scholars’ Trajectories across Research Topics," Future Internet, MDPI, vol. 14(1), pages 1-21, January.
  • Handle: RePEc:gam:jftint:v:14:y:2022:i:1:p:25-:d:720733
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/14/1/25/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/14/1/25/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Marta C. González & César A. Hidalgo & Albert-László Barabási, 2009. "Understanding individual human mobility patterns," Nature, Nature, vol. 458(7235), pages 238-238, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Esra Gündoğan & Mehmet Kaya & Ali Daud, 2023. "Deep learning for journal recommendation system of research papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 461-481, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jeong-Hui Park & Eunhye Yoo & Youngdeok Kim & Jung-Min Lee, 2021. "What Happened Pre- and during COVID-19 in South Korea? Comparing Physical Activity, Sleep Time, and Body Weight Status," IJERPH, MDPI, vol. 18(11), pages 1-13, May.
    2. Matteo Böhm & Mirco Nanni & Luca Pappalardo, 2022. "Gross polluters and vehicle emissions reduction," Nature Sustainability, Nature, vol. 5(8), pages 699-707, August.
    3. David Kofoed Wind & Piotr Sapiezynski & Magdalena Anna Furman & Sune Lehmann, 2016. "Inferring Stop-Locations from WiFi," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-15, February.
    4. Zhou, Xingang & Yeh, Anthony G.O. & Yue, Yang, 2018. "Spatial variation of self-containment and jobs-housing balance in Shenzhen using cellphone big data," Journal of Transport Geography, Elsevier, vol. 68(C), pages 102-108.
    5. Zheng Yan & Wenqian Robertson & Yaosheng Lou & Tom W. Robertson & Sung Yong Park, 2021. "Finding leading scholars in mobile phone behavior: a mixed-method analysis of an emerging interdisciplinary field," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9499-9517, December.
    6. Duan, Zhengyu & Zhao, Haoran & Li, Zhenming, 2023. "Non-linear effects of built environment and socio-demographics on activity space," Journal of Transport Geography, Elsevier, vol. 111(C).
    7. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    8. Zhai, Wei & Bai, Xueyin & Peng, Zhong-ren & Gu, Chaolin, 2019. "From edit distance to augmented space-time-weighted edit distance: Detecting and clustering patterns of human activities in Puget Sound region," Journal of Transport Geography, Elsevier, vol. 78(C), pages 41-55.
    9. Khajehnejad, Moein, 2019. "Efficiency of long-range navigation on Treelike fractals," Chaos, Solitons & Fractals, Elsevier, vol. 122(C), pages 102-110.
    10. Chaogui Kang & Yu Liu & Diansheng Guo & Kun Qin, 2015. "A Generalized Radiation Model for Human Mobility: Spatial Scale, Searching Direction and Trip Constraint," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-11, November.
    11. Yifeng Liu & Yuan Lai, 2024. "Analyzing jogging activity patterns and adaptation to public health regulation," Environment and Planning B, , vol. 51(3), pages 670-688, March.
    12. Li, Ze-Tao & Nie, Wei-Peng & Cai, Shi-Min & Zhao, Zhi-Dan & Zhou, Tao, 2023. "Exploring the topological characteristics of urban trip networks based on taxi trajectory data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
    13. Claudio Gariazzo & Armando Pelliccioni & Maria Paola Bogliolo, 2019. "Spatiotemporal Analysis of Urban Mobility Using Aggregate Mobile Phone Derived Presence and Demographic Data: A Case Study in the City of Rome, Italy," Data, MDPI, vol. 4(1), pages 1-25, January.
    14. Han Wang & Damien Fay & Kenneth N. Brown & Liam Kilmartin, 2016. "Modelling revenue generation in a dynamically priced mobile telephony service," Telecommunication Systems: Modelling, Analysis, Design and Management, Springer, vol. 62(4), pages 711-734, August.
    15. Toru Nakamura & Toru Takumi & Atsuko Takano & Fumiyuki Hatanaka & Yoshiharu Yamamoto, 2013. "Characterization and Modeling of Intermittent Locomotor Dynamics in Clock Gene-Deficient Mice," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-8, March.
    16. Fangye Du & Jiaoe Wang & Liang Mao & Jian Kang, 2024. "Daily rhythm of urban space usage: insights from the nexus of urban functions and human mobility," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    17. Torsten Thalheim & Tyll Krüger & Jörg Galle, 2022. "Indirect Virus Transmission via Fomites Can Counteract Lock-Down Effectiveness," IJERPH, MDPI, vol. 19(21), pages 1-14, October.
    18. D. Woods & A. Cunningham & C. E. Utazi & M. Bondarenko & L. Shengjie & G. E. Rogers & P. Koper & C. W. Ruktanonchai & E. zu Erbach-Schoenberg & A. J. Tatem & J. Steele & A. Sorichetta, 2022. "Exploring methods for mapping seasonal population changes using mobile phone data," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-17, December.
    19. Olle Järv & Kerli Müürisepp & Rein Ahas & Ben Derudder & Frank Witlox, 2015. "Ethnic differences in activity spaces as a characteristic of segregation: A study based on mobile phone usage in Tallinn, Estonia," Urban Studies, Urban Studies Journal Limited, vol. 52(14), pages 2680-2698, November.
    20. Francis Rathinam & Sayak Khatua & Zeba Siddiqui & Manya Malik & Pallavi Duggal & Samantha Watson & Xavier Vollenweider, 2021. "Using big data for evaluating development outcomes: A systematic map," Campbell Systematic Reviews, John Wiley & Sons, vol. 17(3), September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:14:y:2022:i:1:p:25-:d:720733. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.