IDEAS home Printed from https://ideas.repec.org/a/igg/jsds00/v6y2015i3p1-17.html
   My bibliography  Save this article

Effective and Efficient Classification of Topically-Enriched Domain-Specific Text Snippets: The TETSC Method

Author

Listed:
  • Marco Spruit

    (Department of Information and Computer Sciences, Utrecht University, Utrecht, The Netherlands)

  • Bas Vlug

    (Department of Information and Computer Sciences, Utrecht University, Utrecht, The Netherlands)

Abstract

Due to the explosive growth in the amount of text snippets over the past few years and their sparsity of text, organizations are unable to effectively and efficiently classify them, missing out on business opportunities. This paper presents TETSC: the Topically-Enriched Text Snippet Classification method. TETSC aims to solve the classification problem for text snippets in any domain. TETSC recognizes that there are different types of text snippets and, therefore, allows for stop word removal, named-entity recognition, and topical enrichment for the different types of text snippets. TETSC has been implemented in the production systems of a personal finance organization, which resulted in a classification error reduction of over 21%. Highlights: The authors create the TETSC method for classifying topically-enriched text snippets; the authors differentiate between different types of text snippets; the authors show a successful application of Named-Entity Recognition to text snippets; using multiple enrichment strategies appears to reduce effectivity.

Suggested Citation

  • Marco Spruit & Bas Vlug, 2015. "Effective and Efficient Classification of Topically-Enriched Domain-Specific Text Snippets: The TETSC Method," International Journal of Strategic Decision Sciences (IJSDS), IGI Global, vol. 6(3), pages 1-17, July.
  • Handle: RePEc:igg:jsds00:v:6:y:2015:i:3:p:1-17
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJSDS.2015070101
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jsds00:v:6:y:2015:i:3:p:1-17. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.