IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v105y2015i3d10.1007_s11192-015-1610-x.html
   My bibliography  Save this article

Statistical tests for ‘related records’ search results

Author

Listed:
  • Charles H. Smith

    (Western Kentucky University)

  • Patrick Georges

    (University of Ottawa)

  • Ngoc Nguyen

    (Western Kentucky University)

Abstract

Related records searching, now a common option within bibliographic databases, is applied to an individual result record as a secondary way of refining the retrieval set obtained from the primary subject search operation. In one approach, an individual result record is linked to other article records on the basis of the number of references cited they share in common, the theory being that two articles that cite many of the same sources are likely to be highly similar in subject content. Results of the secondary search are usually displayed in the order of each item’s actual number of commonly-shared references. In the present paper we suggest an improved way of ranking the results, employing statistical significance tests. We suggest two approaches, one involving a statistical test previously unknown in bibliometric circles, the binomial index of dispersion, and the other employing the more familiar centralized cosine measure; these turn out to produce nearly identical results. An example demonstrating the application of these measures, and contrasting such with the use of raw totals, is provided. In the example the results rankings are found to be only modestly (positively) correlated, suggesting that much information is lost to the user when raw totals alone are made the basis for ordering results.

Suggested Citation

  • Charles H. Smith & Patrick Georges & Ngoc Nguyen, 2015. "Statistical tests for ‘related records’ search results," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1665-1677, December.
  • Handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1610-x
    DOI: 10.1007/s11192-015-1610-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-015-1610-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-015-1610-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dangzhi Zhao & Andreas Strotmann, 2014. "In-text author citation analysis: Feasibility, benefits, and limitations," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(11), pages 2348-2358, November.
    2. repec:ucp:bkecon:9780226320625 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Patrick Georges, 2017. "Western classical music development: a statistical analysis of composers similarity, differentiation and evolution," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(1), pages 21-53, July.
    2. Patrick Georges & Ngoc Nguyen, 2019. "Visualizing music similarity: clustering and mapping 500 classical music composers," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 975-1003, September.
    3. Müge Akbulut & Yaşar Tonta & Howard D. White, 2020. "Related records retrieval and pennant retrieval: an exploratory case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(2), pages 957-987, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dangzhi Zhao & Andreas Strotmann, 2020. "Telescopic and panoramic views of library and information science research 2011–2018: a comparison of four weighting schemes for author co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 255-270, July.
    2. Tahamtan, Iman & Bornmann, Lutz, 2018. "Core elements in the process of citing publications: Conceptual overview of the literature," Journal of Informetrics, Elsevier, vol. 12(1), pages 203-216.
    3. Marc Bertin & Iana Atanassova & Cassidy R. Sugimoto & Vincent Lariviere, 2016. "The linguistic patterns and rhetorical structure of citation context: an approach using n-grams," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(3), pages 1417-1434, December.
    4. Toluwase Victor Asubiaro & Isola Ajiferuke, 2022. "Semantic similarity-based credit attribution on citation paths: a method for allocating residual citation to and investigating depth of influence of scientific communications," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6257-6277, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1610-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.