IDEAS home Printed from https://ideas.repec.org/a/ids/injdan/v9y2017i3p207-221.html
   My bibliography  Save this article

A feature-based selection technique for reduction of large scale data

Author

Listed:
  • Ritu Chauhan
  • Harleen Kaur

Abstract

The inflated development in public healthcare domain has forced numerous organisations to construct and maintain large scale databases or data warehouses. However, the prediction of knowledge should be an automated process to discover hidden information from large scale databases. The elaborated studies in the past suggest that minimum interesting variables can determine qualified information while preserving information among the data. In addition, it is determined that large scale databases usually comprise of redundant and irrelevant features which have proven to be a major setback for efficient and effective analysis of data. This paper intends to provide an integrated approach by utilising machine learning technique and other convention statistical techniques for extraction of information from large scale databases. In the formulated approach, we have potentially exploited two approaches where the first approach emphasises on retrieval of feature subsets using MODTree filtering technique from discretised datasets with relative application domain on real datasets of Substance Abuse and Mental Health Data Archive (SAMHDA) collected from different states of USA. The second phase of study exploits statistical techniques on potential targets for discovery of interesting information from reduced datasets. We present a novel perspective using feature selection and statistical techniques for determination of knowledge from large scale databases.

Suggested Citation

  • Ritu Chauhan & Harleen Kaur, 2017. "A feature-based selection technique for reduction of large scale data," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(3), pages 207-221.
  • Handle: RePEc:ids:injdan:v:9:y:2017:i:3:p:207-221
    as

    Download full text from publisher

    File URL: http://www.inderscience.com/link.php?id=86630
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ids:injdan:v:9:y:2017:i:3:p:207-221. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sarah Parker (email available below). General contact details of provider: http://www.inderscience.com/browse/index.php?journalID=282 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.