IDEAS home Printed from https://ideas.repec.org/a/oup/indcch/v29y2020i1p61-80..html
   My bibliography  Save this article

Explaining and predicting the impact of authors within a community: an assessment of the bibliometric literature and application of machine learning

Author

Listed:
  • Sen Chai
  • Alexander D’Amour
  • Lee Fleming

Abstract

Following widespread availability of computerized databases, much research has correlated bibliometric measures from papers or patents to subsequent success, typically measured as the number of publications or citations. Building on this large body of work, we ask the following questions: given available bibliometric information in one year, along with the combined theories on sources of creative breakthroughs from the literatures on creativity and innovation, how accurately can we explain the impact of authors in a given research community in the following year? In particular, who is most likely to publish, publish highly cited work, and even publish a highly cited outlier? And, how accurately can these existing theories predict breakthroughs using only contemporaneous data? After reviewing and synthesizing (often competing) theories from the literatures, we simultaneously model the collective hypotheses based on available data in the year before RNA interference was discovered. We operationalize author impact using publication count, forward citations, and the more stringent definition of being in the top decile of the citation distribution. Explanatory power of current theories altogether ranges from less than 9% for being top cited to 24% for productivity. Machine learning (ML) methods yield similar findings as the explanatory linear models, and tangible improvement only for non-linear Support Vector Machine models. We also perform predictions using only existing data until 1997, and find lower predictability than using explanatory models. We conclude with an agenda for future progress in the bibliometric study of creativity and look forward to ML research that can explain its models.

Suggested Citation

  • Sen Chai & Alexander D’Amour & Lee Fleming, 2020. "Explaining and predicting the impact of authors within a community: an assessment of the bibliometric literature and application of machine learning," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 29(1), pages 61-80.
  • Handle: RePEc:oup:indcch:v:29:y:2020:i:1:p:61-80.
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1093/icc/dtz042
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhou, Sifan & Chai, Sen & Freeman, Richard B., 2024. "Gender homophily: In-group citation preferences and the gender disadvantage," Research Policy, Elsevier, vol. 53(1).

    More about this item

    JEL classification:

    • O31 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Innovation and Invention: Processes and Incentives
    • O40 - Economic Development, Innovation, Technological Change, and Growth - - Economic Growth and Aggregate Productivity - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:indcch:v:29:y:2020:i:1:p:61-80.. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/icc .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.