IDEAS home Printed from https://ideas.repec.org/h/wsi/wschap/9789812707482_0017.html
   My bibliography  Save this book chapter

From A Property Of The Average Of Fractions To A Text-Processing Interface

In: Creating Collaborative Advantage Through Knowledge And Innovation

Author

Listed:
  • Guillermo Oyarce

    (Texas Center for Digital Knowledge, School of Library and Information Sciences, University of North Texas, P.O. Box 311068, Denton, TX 76203, USA)

Abstract

The average of two natural numbers always falls between those two numbers. Partitioning a document set into two non overlapping subsets, the words in the set will appear only in one subset or on both. These properties can be used to present users with choices that can allow them to build a phrase where chosen terms have context. The average frequency of a term can be used to study relevance by comparing it to the same term's average in the relevant and the non-relevant subsets. A coefficient of variability (VAR) is defined as the normalized distance between these two values. The vast majority of words seem not to be significant because VAR as their relative comparative value is minimal. But a few words show very high values. This could be exploited by a system to find strong word instances that represent relevant concepts. It is possible to imagine an iterative procedure based on these properties through which a user identifies significant words with high VAR values. Such procedure would be desirable for a diversity of text-related computer-based tasks such as content analysis, thesauri construction, data mining, computer-based indexing and feature selection. A software instance to help users build context has been developed as a prototype to show the concept. Knowledge is always related to a given context and requires a support structure which has some cognitive elements such as other knowledge, data, concepts, information, etc. Using objective and subjective measures, users derive conceptual relationships. Users gauge and build topical relevance by engaging the system, which can then offer more suggestions. There is great advantage in reducing cognitive load and extraneous information. Users deal directly with easier to identify words, phrases and their combinations to form information capsules.

Suggested Citation

  • Guillermo Oyarce, 2007. "From A Property Of The Average Of Fractions To A Text-Processing Interface," World Scientific Book Chapters, in: Suliman Hawamdeh (ed.), Creating Collaborative Advantage Through Knowledge And Innovation, chapter 17, pages 263-277, World Scientific Publishing Co. Pte. Ltd..
  • Handle: RePEc:wsi:wschap:9789812707482_0017
    as

    Download full text from publisher

    File URL: https://www.worldscientific.com/doi/pdf/10.1142/9789812707482_0017
    Download Restriction: Ebook Access is available upon purchase.

    File URL: https://www.worldscientific.com/doi/abs/10.1142/9789812707482_0017
    Download Restriction: Ebook Access is available upon purchase.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:wschap:9789812707482_0017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscientific.com/page/worldscibooks .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.