IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i6p1484-1504.html
   My bibliography  Save this article

Optimized fixed-size kernel models for large data sets

Author

Listed:
  • De Brabanter, K.
  • De Brabanter, J.
  • Suykens, J.A.K.
  • De Moor, B.

Abstract

A modified active subset selection method based on quadratic Rényi entropy and a fast cross-validation for fixed-size least squares support vector machines is proposed for classification and regression with optimized tuning process. The kernel bandwidth of the entropy based selection criterion is optimally determined according to the solve-the-equation plug-in method. Also a fast cross-validation method based on a simple updating scheme is developed. The combination of these two techniques is suitable for handling large scale data sets on standard personal computers. Finally, the performance on test data and computational time of this fixed-size method are compared to those for standard support vector machines and [nu]-support vector machines resulting in sparser models with lower computational cost and comparable accuracy.

Suggested Citation

  • De Brabanter, K. & De Brabanter, J. & Suykens, J.A.K. & De Moor, B., 2010. "Optimized fixed-size kernel models for large data sets," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1484-1504, June.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:6:p:1484-1504
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00039-3
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hall, Peter & Wolff, Rodney C. L. & Yao, Qiwei, 1999. "Methods for estimating a conditional distribution function," LSE Research Online Documents on Economics 6631, London School of Economics and Political Science, LSE Library.
    2. Fan, Jianqing & Yao, Qiwei & Tong, Howell, 1996. "Estimation of conditional densities and sensitivity measures in nonlinear dynamical systems," LSE Research Online Documents on Economics 6704, London School of Economics and Political Science, LSE Library.
    3. Liu, Yufeng & Helen Zhang, Hao & Park, Cheolwoo & Ahn, Jeongyoun, 2007. "Support vector machines with adaptive Lq penalty," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6380-6394, August.
    4. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    5. Huang, Chien-Ming & Lee, Yuh-Jye & Lin, Dennis K.J. & Huang, Su-Yun, 2007. "Model selection for support vector machines via uniform design," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 335-346, September.
    6. Sheather, Simon J., 1986. "An improved data-based algorithm for choosing the window width when estimating the density at a point," Computational Statistics & Data Analysis, Elsevier, vol. 4(1), pages 61-65, June.
    7. L. Ingber, 1989. "Very fast simulated re-annealing," Lester Ingber Papers 89vf, Lester Ingber.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. De Brabanter, Kris & Suykens, Johan & De Moor, Bart, 2013. "Nonparametric Regression via StatLSSVM," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 55(i02).
    2. Luts, Jan & Molenberghs, Geert & Verbeke, Geert & Van Huffel, Sabine & Suykens, Johan A.K., 2012. "A mixed effects least squares support vector machine model for classification of longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 611-628.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Patrick Saart & Jiti Gao & Nam Hyun Kim, 2014. "Semiparametric methods in nonlinear time series analysis: a selective review," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(1), pages 141-169, March.
    2. Dingshi Tian & Zongwu Cai & Ying Fang, 2018. "Econometric Modeling of Risk Measures: A Selective Review of the Recent Literature," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 201807, University of Kansas, Department of Economics, revised Oct 2018.
    3. Frandsen, Brigham R. & Frölich, Markus & Melly, Blaise, 2012. "Quantile treatment effects in the regression discontinuity design," Journal of Econometrics, Elsevier, vol. 168(2), pages 382-395.
    4. Luke Taylor & Taisuke Otsu, 2019. "Estimation of nonseparable models with censored dependent variables and endogenous regressors," Econometric Reviews, Taylor & Francis Journals, vol. 38(1), pages 4-24, January.
    5. Ruiz-Castillo, Javier, 2012. "From the “European Paradox” to a European Drama in citation impact," UC3M Working papers. Economics we1211, Universidad Carlos III de Madrid. Departamento de Economía.
    6. Bhattacharya, Debopam & Dupas, Pascaline, 2012. "Inferring welfare maximizing treatment assignment under budget constraints," Journal of Econometrics, Elsevier, vol. 167(1), pages 168-196.
    7. Dette, Holger & Volgushev, Stanislav, 2007. "Non-crossing nonparametric estimates of quantile curves," Technical Reports 2007,18, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    8. Jianqing Fan, 2004. "A selective overview of nonparametric methods in financial econometrics," Papers math/0411034, arXiv.org.
    9. Roberto Basile, 2010. "Intra-distribution dynamics of regional per-capita income in Europe: evidence from alternative conditional density estimators," Statistica, Department of Statistics, University of Bologna, vol. 70(1), pages 3-22.
    10. Xu, Ke-Li & Phillips, Peter C. B., 2011. "Tilted Nonparametric Estimation of Volatility Functions With Empirical Applications," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(4), pages 518-528.
    11. François Gerard & Miikka Rokkanen & Christoph Rothe, 2020. "Bounds on treatment effects in regression discontinuity designs with a manipulated running variable," Quantitative Economics, Econometric Society, vol. 11(3), pages 839-870, July.
    12. Valentina Corradi & Norman Swanson & Walter Distaso, 2006. "Predictive Inference for Integrated Volatility," Departmental Working Papers 200616, Rutgers University, Department of Economics.
    13. Abdelaati Daouia & Byeong U. Park, 2013. "On Projection-type Estimators of Multivariate Isotonic Functions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(2), pages 363-386, June.
    14. repec:wyi:journl:002095 is not listed on IDEAS
    15. Bashtannyk, David M. & Hyndman, Rob J., 2001. "Bandwidth selection for kernel conditional density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 36(3), pages 279-298, May.
    16. Gerard, François & Rothe, Christoph & Rokkanen, Miikka, 2016. "Bounds on Treatment Effects in Regression Discontinuity Designs under Manipulation of the Running Variable, with an Application," CEPR Discussion Papers 11668, C.E.P.R. Discussion Papers.
    17. Taoufik Bouezmarni & Abderrahim Taamouti, 2014. "Nonparametric tests for conditional independence using conditional distributions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(4), pages 697-719, December.
    18. Roberto Basile, 2009. "Productivity Polarization across Regions in Europe," International Regional Science Review, , vol. 32(1), pages 92-115, January.
    19. Manfred Fischer & Peter Stumpner, 2008. "Income distribution dynamics and cross-region convergence in Europe," Journal of Geographical Systems, Springer, vol. 10(2), pages 109-139, June.
    20. Aït-Sahalia, Yacine & Fan, Jianqing & Peng, Heng, 2009. "Nonparametric Transition-Based Tests for Jump Diffusions," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1102-1116.
    21. Song, Song & Ritov, Ya’acov & Härdle, Wolfgang K., 2012. "Bootstrap confidence bands and partial linear quantile regression," Journal of Multivariate Analysis, Elsevier, vol. 107(C), pages 244-262.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:6:p:1484-1504. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.