IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v288y2020i1d10.1007_s10479-019-03508-4.html
   My bibliography  Save this article

New node anomaly detection algorithm based on nonnegative matrix factorization for directed citation networks

Author

Listed:
  • Ali Tosyali

    (University of Delaware)

  • Jinho Kim

    (Rutgers University)

  • Jeongsub Choi

    (Rutgers University)

  • Yunyi Kang

    (Arizona State University)

  • Myong K. Jeong

    (Rutgers University)

Abstract

Outlier detection is a crucial task for network data analysis, which identifies abnormal entities that deviate from the rest of the dataset. Ranking in outlierness is often used for identifying abnormal nodes in directed citation networks containing citation relationship among nodes. A challenging issue in outlier ranking is how to leverage the rich graph data of complex citation networks. In this paper, we propose a cluster-based outlier score function to identify outliers in citation networks based on nonnegative matrix factorization (NMF). We first represent the citation data as a directed graph, and cluster the directed graph into logical groupings of nodes using NMF. Based on the clustering results, we obtain the outlier score and ranking for each node using the proposed outlier scoring function. The proposed method leverages the direct and indirect citation links between nodes to measure the graph-based outlierness. We validate the proposed outlier ranking method using small artificial dataset and the real-world U.S. patent data.

Suggested Citation

  • Ali Tosyali & Jinho Kim & Jeongsub Choi & Yunyi Kang & Myong K. Jeong, 2020. "New node anomaly detection algorithm based on nonnegative matrix factorization for directed citation networks," Annals of Operations Research, Springer, vol. 288(1), pages 457-474, May.
  • Handle: RePEc:spr:annopr:v:288:y:2020:i:1:d:10.1007_s10479-019-03508-4
    DOI: 10.1007/s10479-019-03508-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-019-03508-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-019-03508-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jacques Michel & Bernd Bettels, 2001. "Patent citation analysis.A closer look at the basic input data from patent search reports," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(1), pages 185-201, April.
    2. Rajiv D. Banker & Hsihui Chang & Zhiqiang Zheng, 2017. "On the use of super-efficiency procedures for ranking efficient units and identifying outliers," Annals of Operations Research, Springer, vol. 250(1), pages 21-35, March.
    3. Lian Duan & Lida Xu & Ying Liu & Jun Lee, 2009. "Cluster-based outlier detection," Annals of Operations Research, Springer, vol. 168(1), pages 151-168, April.
    4. Sepideh Kaffash & Marianna Marra, 2017. "Data envelopment analysis in financial services: a citations network analysis of banks, insurance companies and money market funds," Annals of Operations Research, Springer, vol. 253(1), pages 307-344, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Henri A. Schildt & Markku V.J. Maula & Thomas Keil, 2005. "Explorative and Exploitative Learning from External Corporate Ventures," Entrepreneurship Theory and Practice, , vol. 29(4), pages 493-515, July.
    2. Helmi Hammami & Thanh Ngo & David Tripe & Dinh-Tri Vo, 2022. "Ranking with a Euclidean common set of weights in data envelopment analysis: with application to the Eurozone banking sector," Annals of Operations Research, Springer, vol. 311(2), pages 675-694, April.
    3. Su, Hsin-Ning & Moaniba, Igam M., 2017. "Investigating the dynamics of interdisciplinary evolution in technology developments," Technological Forecasting and Social Change, Elsevier, vol. 122(C), pages 12-23.
    4. Emanuele Bacchiocchi & Fabio Montobbio, 2010. "International Knowledge Diffusion and Home‐bias Effect: Do USPTO and EPO Patent Citations Tell the Same Story?," Scandinavian Journal of Economics, Wiley Blackwell, vol. 112(3), pages 441-470, September.
    5. Gaétan de Rassenfosse & Adam B. Jaffe, 2018. "Are patent fees effective at weeding out low‐quality patents?," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 27(1), pages 134-148, March.
    6. Inchae Park & Yujin Jeong & Byungun Yoon, 2017. "Analyzing the value of technology based on the differences of patent citations between applicants and examiners," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 665-691, May.
    7. Zervopoulos, Panagiotis & Emrouznejad, Ali & Sklavos, Sokratis, 2019. "A Bayesian approach for correcting bias of data envelopment analysis estimators," MPRA Paper 91886, University Library of Munich, Germany.
    8. Katja Rost, 2006. "Der Einfluss von Erfindernetzwerken auf die Relevanz von Patenten," Schmalenbach Journal of Business Research, Springer, vol. 58(3), pages 363-389, May.
    9. Marek Śmieja & Magdalena Wiercioch, 2017. "Constrained clustering with a complex cluster structure," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 493-518, September.
    10. Harhoff, Dietmar & Wagner, Stefan, 2005. "Modelling the duration of patent examination at the European Patent Office," CEPR Discussion Papers 5283, C.E.P.R. Discussion Papers.
    11. Kaffash, Sepideh & Azizi, Roza & Huang, Ying & Zhu, Joe, 2020. "A survey of data envelopment analysis applications in the insurance industry 1993–2018," European Journal of Operational Research, Elsevier, vol. 284(3), pages 801-813.
    12. Goto, Akira & Motohashi, Kazuyuki, 2007. "Construction of a Japanese Patent Database and a first look at Japanese patenting activities," Research Policy, Elsevier, vol. 36(9), pages 1431-1442, November.
    13. Julie Callaert & Maikel Pellens & Bart Looy, 2014. "Sources of inspiration? Making sense of scientific references in patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(3), pages 1617-1629, March.
    14. Dietmar Harhoff & Georg von Graevenitz & Stefan Wagner, 2016. "Conflict Resolution, Public Goods, and Patent Thickets," Management Science, INFORMS, vol. 62(3), pages 704-721, March.
    15. Wagner, Marcus, 2011. "To explore or to exploit? An empirical investigation of acquisitions by large incumbents," Research Policy, Elsevier, vol. 40(9), pages 1217-1225.
    16. Rakin Abrar & Showmitra Kumar Sarkar & Kashfia Tasnim Nishtha & Swapan Talukdar & Shahfahad & Atiqur Rahman & Abu Reza Md Towfiqul Islam & Amir Mosavi, 2022. "Assessing the Spatial Mapping of Heat Vulnerability under Urban Heat Island (UHI) Effect in the Dhaka Metropolitan Area," Sustainability, MDPI, vol. 14(9), pages 1-24, April.
    17. Jing-Yuan Chio & Laura Magazzini & Fabio Pammolli & Massimo Riccaboni, 2012. "The Value of Failures in Pharmaceutical R&D," Working Papers 1/2012, IMT School for Advanced Studies Lucca, revised Jan 2012.
    18. Stefan Wagner, 2008. "Business Method Patents In Europe And Their Strategic Use—Evidence From Franking Device Manufacturers," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 17(3), pages 173-194.
    19. Alfons Palangkaraya, 2010. "Patent Application Databases," Australian Economic Review, The University of Melbourne, Melbourne Institute of Applied Economic and Social Research, vol. 43(1), pages 77-87, March.
    20. Lazaridis, George & van Pottelsberghe de la Potterie, Bruno, 2007. "The rigour of EPO's patentability criteria: An insight into the "induced withdrawals"," World Patent Information, Elsevier, vol. 29(4), pages 317-326, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:288:y:2020:i:1:d:10.1007_s10479-019-03508-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.