IDEAS home Printed from https://ideas.repec.org/a/bla/jamest/v33y1982i5p285-293.html
   My bibliography  Save this article

The Theoretical Foundation of Zipf's Law and Its Application to the Bibliographic Database Environment

Author

Listed:
  • Jane Fedorowicz

Abstract

What does the frequency of occurrence of different words in an article have to do with the number of times an article is cited? Or, for that matter, with the number of publications an author has? All of these—word frequency, citation frequency, and publication frequency‐obey an ubiquitous distribution called Zipf's law. Zipf's law applies as well to such diverse subjects as income distribution, firm size, and biological genera and species. Zipf in 1949 described a hyperbolic rank‐frequency word distribution, which he fitted to a number of texts. He stated that if all unique words in a text are arranged (or ranked) in order of decreasing frequency of occurrence, the product of frequency times rank yields a constant which is approximately equal for all words in a text. The law has been shown to encompass many natural phenomena, and is equivalent to the distributions of Yule, Lotka, Pareto, Bradford, and Price. An ubiquitous empirical regularity suggests some universal principal. This article examines a number of theoretical derivations of the law, in order to show the relationship between the many attempts at ascertaining a theoretical justification for the phenomenon. We then briefly examine some of the ramifications of applying the law to the bibliographic database environment.

Suggested Citation

  • Jane Fedorowicz, 1982. "The Theoretical Foundation of Zipf's Law and Its Application to the Bibliographic Database Environment," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 33(5), pages 285-293, September.
  • Handle: RePEc:bla:jamest:v:33:y:1982:i:5:p:285-293
    DOI: 10.1002/asi.4630330507
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.4630330507
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.4630330507?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Agouzal, Abdellatif & Lafouge, Thierry & Bertin, Marc, 2024. "Relationship between the principle of least effort and the average cost of information in a zipfian context," Journal of Informetrics, Elsevier, vol. 18(1).
    2. Zahed Bigdeli & Morteza Kokabi & Gholam Reza Rajabi & Ali Gazni, 2013. "Patterns of authors’ information scattering: towards a causal explanation of information scattering from a scholarly information-seeking behavior perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(1), pages 103-131, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jamest:v:33:y:1982:i:5:p:285-293. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.