IDEAS home Printed from https://ideas.repec.org/a/hin/complx/1803525.html
   My bibliography  Save this article

An Incremental Kernel Density Estimator for Data Stream Computation

Author

Listed:
  • Yulin He
  • Jie Jiang
  • Dexin Dai
  • Klohoun Fabrice

Abstract

Probability density function ( p.d.f. ) estimation plays a very important role in the field of data mining. Kernel density estimator (KDE) is the mostly used technology to estimate the unknown p.d.f. for the given dataset. The existing KDEs are usually inefficient when handling the p.d.f. estimation problem for stream data because a bran-new KDE has to be retrained based on the combination of current data and newly coming data. This process increases the training time and wastes the computation resource. This article proposes an incremental kernel density estimator (I-KDE) which deals with the p.d.f. estimation problem in the way of data stream computation. The I-KDE updates the current KDE dynamically and gradually with the newly coming data rather than retraining the bran-new KDE with the combination of current data and newly coming data. The theoretical analysis proves the convergence of the I-KDE only if the estimated p.d.f. of newly coming data is convergent to its true p.d.f . In order to guarantee the convergence of the I-KDE, a new multivariate fixed-point iteration algorithm based on the unbiased cross validation (UCV) method is developed to determine the optimal bandwidth of the KDE. The experimental results on 10 univariate and 4 multivariate probability distributions demonstrate the feasibility and effectiveness of the I-KDE.

Suggested Citation

  • Yulin He & Jie Jiang & Dexin Dai & Klohoun Fabrice, 2020. "An Incremental Kernel Density Estimator for Data Stream Computation," Complexity, Hindawi, vol. 2020, pages 1-17, February.
  • Handle: RePEc:hin:complx:1803525
    DOI: 10.1155/2020/1803525
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/8503/2020/1803525.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/8503/2020/1803525.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2020/1803525?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tulaci Bhakti & Fernanda Rossi & Pedro Oliveira Mafia & Eduardo Franco Almeida & Maria Augusta Gonçalves Fujaco & Cristiano Schetini Azevedo, 2021. "Preservation of historical heritage increases bird biodiversity in urban centers," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(6), pages 8758-8773, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:complx:1803525. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.