IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v29y2014i1p81-95.html
   My bibliography  Save this article

Identifying predictive hubs to condense the training set of $$k$$ -nearest neighbour classifiers

Author

Listed:
  • Ludwig Lausser
  • Christoph Müssel
  • Alexander Melkozerov
  • Hans Kestler

Abstract

The $$k$$ -Nearest Neighbour classifier is widely used and popular due to its inherent simplicity and the avoidance of model assumptions. Although the approach has been shown to yield a near-optimal classification performance for an infinite number of samples, a selection of the most decisive data points can improve the classification accuracy considerably in real settings with a limited number of samples. At the same time, a selection of a subset of representative training samples reduces the required amount of storage and computational resources. We devised a new approach that selects a representative training subset on the basis of an evolutionary optimization procedure. This method chooses those training samples that have a strong influence on the correct prediction of other training samples, in particular those that have uncertain labels. The performance of the algorithm is evaluated on different data sets. Additionally, we provide graphical examples of the selection procedure. Copyright Springer-Verlag Berlin Heidelberg 2014

Suggested Citation

  • Ludwig Lausser & Christoph Müssel & Alexander Melkozerov & Hans Kestler, 2014. "Identifying predictive hubs to condense the training set of $$k$$ -nearest neighbour classifiers," Computational Statistics, Springer, vol. 29(1), pages 81-95, February.
  • Handle: RePEc:spr:compst:v:29:y:2014:i:1:p:81-95
    DOI: 10.1007/s00180-012-0379-0
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00180-012-0379-0
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00180-012-0379-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Harald Binder & Hans Kestler & Matthias Schmid, 2014. "Proceedings of Reisensburg 2011," Computational Statistics, Springer, vol. 29(1), pages 1-2, February.
    2. Asma Gul & Aris Perperoglou & Zardad Khan & Osama Mahmoud & Miftahuddin Miftahuddin & Werner Adler & Berthold Lausen, 2018. "Ensemble of a subset of kNN classifiers," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(4), pages 827-840, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:29:y:2014:i:1:p:81-95. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.