IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v168y2022ics0167947321002061.html
   My bibliography  Save this article

Collaboration mechanisms and community detection of statisticians based on ERGMs and kNN-walktrap

Author

Listed:
  • Liu, Jie
  • Ge, Huilin

Abstract

Comprehensive information on coauthorship from 2014 to 2018 was gathered from four top statistical journals and subsequently cleaned to provide a review in the field from the perspective of a co-authorship network analysis. Data on productivity and trends, as well as a skew analysis of publications and collaborations, was provided by the analysis. The coauthorship network was analyzed for both global and individual properties. Exponential random graph models (ERGMs) were also used to explore the formation mechanisms of collaboration while simultaneously considering exogenous covariate effects and endogenous network structure processes. It was discovered that homophily (authors from the same universities and countries) and transitivity (the tendency to collaborate with a coauthor's coauthor) have a significant positive effect on the production of collaborative studies. Finally, the kNN-walktrap was proposed, which combines the structures of the network and the homophily features of authors to detect network communities. In this method, the cosine similarity calculated by the homophily features of the nodes is utilized to build a kNN (k Nearest Neighbor) network and apply walktrap to detect communities. Thus, more detailed and comprehensive community structures can be detected than when using the walktrap method. These results have practical significance for researching collaboration models and guiding future collaboration.

Suggested Citation

  • Liu, Jie & Ge, Huilin, 2022. "Collaboration mechanisms and community detection of statisticians based on ERGMs and kNN-walktrap," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
  • Handle: RePEc:eee:csdana:v:168:y:2022:i:c:s0167947321002061
    DOI: 10.1016/j.csda.2021.107372
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947321002061
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2021.107372?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. John P A Ioannidis, 2008. "Measuring Co-Authorship and Networking-Adjusted Scientific Impact," PLOS ONE, Public Library of Science, vol. 3(7), pages 1-8, July.
    2. Garry Robins & Philippa Pattison & Stanley Wasserman, 1999. "Logit models and logistic regressions for social networks: III. Valued relations," Psychometrika, Springer;The Psychometric Society, vol. 64(3), pages 371-394, September.
    3. Hildrun Kretschmer, 2004. "Author productivity and geodesic distance in bibliographic co-authorship networks, and visibility on the Web," Scientometrics, Springer;Akadémiai Kiadó, vol. 60(3), pages 409-420, August.
    4. Erjia Yan & Ying Ding & Qinghua Zhu, 2010. "Mapping library and information science in China: a coauthorship network analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(1), pages 115-131, April.
    5. Cao, Jie & Bu, Zhan & Gao, Guangliang & Tao, Haicheng, 2016. "Weighted modularity optimization for crisp and fuzzy community detection in large-scale networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 462(C), pages 386-395.
    6. Stanley Wasserman & Philippa Pattison, 1996. "Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp," Psychometrika, Springer;The Psychometric Society, vol. 61(3), pages 401-425, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Manuel E. Sosa & Steven D. Eppinger & Craig M. Rowles, 2004. "The Misalignment of Product Architecture and Organizational Structure in Complex Product Development," Management Science, INFORMS, vol. 50(12), pages 1674-1689, December.
    2. M. Ausloos, 2013. "A scientometrics law about co-authors and their ranking: the co-author core," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 895-909, June.
    3. Krivitsky, Pavel N., 2017. "Using contrastive divergence to seed Monte Carlo MLE for exponential-family random graph models," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 149-161.
    4. Alessandro Lomi & Philippa Pattison, 2006. "Manufacturing Relations: An Empirical Study of the Organization of Production Across Multiple Networks," Organization Science, INFORMS, vol. 17(3), pages 313-332, June.
    5. Chu-Shore, Jesse, 2010. "Homogenization and Specialization Effects of International Trade: Are Cultural Goods Exceptional?," World Development, Elsevier, vol. 38(1), pages 37-47, January.
    6. Slobodan Kacanski & Dean Lusher, 2017. "The Application of Social Network Analysis to Accounting and Auditing," International Journal of Academic Research in Accounting, Finance and Management Sciences, Human Resource Management Academic Research Society, International Journal of Academic Research in Accounting, Finance and Management Sciences, vol. 7(3), pages 182-197, July.
    7. Vögtle, Eva Maria & Windzio, Michael, 2015. "The network of international student mobility: Enlargement and consolidation of the European transnational education space?," TranState Working Papers 190, University of Bremen, Collaborative Research Center 597: Transformations of the State.
    8. Tom Broekel & Pierre-Alexandre Balland & Martijn Burger & Frank Oort, 2014. "Modeling knowledge networks in economic geography: a discussion of four methods," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(2), pages 423-452, September.
    9. Sebastian Spaeth & Sven Niederhöfer, 2022. "Compatibility promotion between platforms: The role of open technology standards and giant platforms," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(4), pages 1891-1915, December.
    10. Ivan Cucco, 2014. "Network-based policies and innovation networks in two Italian regions: a comparison through a social selection model," STUDI ECONOMICI, FrancoAngeli Editore, vol. 2014(114), pages 78-96.
    11. Bruce A Desmarais & Skyler J Cranmer, 2012. "Statistical Inference for Valued-Edge Networks: The Generalized Exponential Random Graph Model," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-12, January.
    12. Johannes Pol, 2019. "Introduction to Network Modeling Using Exponential Random Graph Models (ERGM): Theory and an Application Using R-Project," Computational Economics, Springer;Society for Computational Economics, vol. 54(3), pages 845-875, October.
    13. He, Xi-jun & Dong, Yan-bo & Wu, Yu-ying & Jiang, Guo-rui & Zheng, Yao, 2019. "Factors affecting evolution of the interprovincial technology patent trade networks in China based on exponential random graph models," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 514(C), pages 443-457.
    14. Johan Koskinen & Peng Wang & Garry Robins & Philippa Pattison, 2018. "Outliers and Influential Observations in Exponential Random Graph Models," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 809-830, December.
    15. Johannes VAN DER POL, 2016. "The modelling of networks using Exponential Random Graph Models: an introduction," Cahiers du GREThA (2007-2019) 2016-22, Groupe de Recherche en Economie Théorique et Appliquée (GREThA).
    16. Ashish Arora & Michelle Gittelman & Sarah Kaplan & John Lynch & Will Mitchell & Nicolaj Siggelkow & Ji Youn (Rose) Kim & Michael Howard & Emily Cox Pahnke & Warren Boeker, 2016. "Understanding network formation in strategy research: Exponential random graph models," Strategic Management Journal, Wiley Blackwell, vol. 37(1), pages 22-44, January.
    17. Olaf N. Rank & Garry L. Robins & Philippa E. Pattison, 2010. "Structural Logic of Intraorganizational Networks," Organization Science, INFORMS, vol. 21(3), pages 745-764, June.
    18. Johannes van Der Pol, 2017. "Introduction to network modeling using Exponential Random Graph models (ERGM)," Working Papers hal-01284994, HAL.
    19. Marian-Gabriel Hâncean & Matjaž Perc & Lazăr Vlăsceanu, 2014. "Fragmented Romanian Sociology: Growth and Structure of the Collaboration Network," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-9, November.
    20. Ke Hu & Ju Xiang & Yun-Xia Yu & Liang Tang & Qin Xiang & Jian-Ming Li & Yong-Hong Tang & Yong-Jun Chen & Yan Zhang, 2020. "Significance-based multi-scale method for network community detection and its application in disease-gene prediction," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-24, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:168:y:2022:i:c:s0167947321002061. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.