IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i6p1484-1504.html
   My bibliography  Save this article

Optimized fixed-size kernel models for large data sets

Author

Listed:
  • De Brabanter, K.
  • De Brabanter, J.
  • Suykens, J.A.K.
  • De Moor, B.

Abstract

A modified active subset selection method based on quadratic Rényi entropy and a fast cross-validation for fixed-size least squares support vector machines is proposed for classification and regression with optimized tuning process. The kernel bandwidth of the entropy based selection criterion is optimally determined according to the solve-the-equation plug-in method. Also a fast cross-validation method based on a simple updating scheme is developed. The combination of these two techniques is suitable for handling large scale data sets on standard personal computers. Finally, the performance on test data and computational time of this fixed-size method are compared to those for standard support vector machines and [nu]-support vector machines resulting in sparser models with lower computational cost and comparable accuracy.

Suggested Citation

  • De Brabanter, K. & De Brabanter, J. & Suykens, J.A.K. & De Moor, B., 2010. "Optimized fixed-size kernel models for large data sets," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1484-1504, June.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:6:p:1484-1504
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00039-3
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Liu, Yufeng & Helen Zhang, Hao & Park, Cheolwoo & Ahn, Jeongyoun, 2007. "Support vector machines with adaptive Lq penalty," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6380-6394, August.
    2. Sheather, Simon J., 1986. "An improved data-based algorithm for choosing the window width when estimating the density at a point," Computational Statistics & Data Analysis, Elsevier, vol. 4(1), pages 61-65, June.
    3. L. Ingber, 1989. "Very fast simulated re-annealing," Lester Ingber Papers 89vf, Lester Ingber.
    4. Hall, Peter & Wolff, Rodney C. L. & Yao, Qiwei, 1999. "Methods for estimating a conditional distribution function," LSE Research Online Documents on Economics 6631, London School of Economics and Political Science, LSE Library.
    5. Fan, Jianqing & Yao, Qiwei & Tong, Howell, 1996. "Estimation of conditional densities and sensitivity measures in nonlinear dynamical systems," LSE Research Online Documents on Economics 6704, London School of Economics and Political Science, LSE Library.
    6. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    7. Huang, Chien-Ming & Lee, Yuh-Jye & Lin, Dennis K.J. & Huang, Su-Yun, 2007. "Model selection for support vector machines via uniform design," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 335-346, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Luts, Jan & Molenberghs, Geert & Verbeke, Geert & Van Huffel, Sabine & Suykens, Johan A.K., 2012. "A mixed effects least squares support vector machine model for classification of longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 611-628.
    2. De Brabanter, Kris & Suykens, Johan & De Moor, Bart, 2013. "Nonparametric Regression via StatLSSVM," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 55(i02).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Patrick Saart & Jiti Gao & Nam Hyun Kim, 2014. "Semiparametric methods in nonlinear time series analysis: a selective review," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(1), pages 141-169, March.
    2. Dingshi Tian & Zongwu Cai & Ying Fang, 2018. "Econometric Modeling of Risk Measures: A Selective Review of the Recent Literature," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 201807, University of Kansas, Department of Economics, revised Oct 2018.
    3. Frandsen, Brigham R. & Frölich, Markus & Melly, Blaise, 2012. "Quantile treatment effects in the regression discontinuity design," Journal of Econometrics, Elsevier, vol. 168(2), pages 382-395.
    4. Bhattacharya, Debopam & Dupas, Pascaline, 2012. "Inferring welfare maximizing treatment assignment under budget constraints," Journal of Econometrics, Elsevier, vol. 167(1), pages 168-196.
    5. Dette, Holger & Volgushev, Stanislav, 2007. "Non-crossing nonparametric estimates of quantile curves," Technical Reports 2007,18, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    6. François Gerard & Miikka Rokkanen & Christoph Rothe, 2020. "Bounds on treatment effects in regression discontinuity designs with a manipulated running variable," Quantitative Economics, Econometric Society, vol. 11(3), pages 839-870, July.
    7. Valentina Corradi & Norman Swanson & Walter Distaso, 2006. "Predictive Inference for Integrated Volatility," Departmental Working Papers 200616, Rutgers University, Department of Economics.
    8. Abdelaati Daouia & Byeong U. Park, 2013. "On Projection-type Estimators of Multivariate Isotonic Functions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(2), pages 363-386, June.
    9. Gerard, François & Rothe, Christoph & Rokkanen, Miikka, 2016. "Bounds on Treatment Effects in Regression Discontinuity Designs under Manipulation of the Running Variable, with an Application," CEPR Discussion Papers 11668, C.E.P.R. Discussion Papers.
    10. Luke Taylor & Taisuke Otsu, 2019. "Estimation of nonseparable models with censored dependent variables and endogenous regressors," Econometric Reviews, Taylor & Francis Journals, vol. 38(1), pages 4-24, January.
    11. Roberto Basile, 2010. "Intra-distribution dynamics of regional per-capita income in Europe: evidence from alternative conditional density estimators," Statistica, Department of Statistics, University of Bologna, vol. 70(1), pages 3-22.
    12. Roberto Basile, 2009. "Productivity Polarization across Regions in Europe," International Regional Science Review, , vol. 32(1), pages 92-115, January.
    13. Manfred Fischer & Peter Stumpner, 2008. "Income distribution dynamics and cross-region convergence in Europe," Journal of Geographical Systems, Springer, vol. 10(2), pages 109-139, June.
    14. Ann-Kathrin Bott & Michael Kohler, 2016. "Adaptive Estimation of a Conditional Density," International Statistical Review, International Statistical Institute, vol. 84(2), pages 291-316, August.
    15. Cai, Zongwu & Wang, Xian, 2008. "Nonparametric estimation of conditional VaR and expected shortfall," Journal of Econometrics, Elsevier, vol. 147(1), pages 120-130, November.
    16. Taoufik Bouezmarni & Abderrahim Taamouti, 2014. "Nonparametric tests for conditional independence using conditional distributions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(4), pages 697-719, December.
    17. Hyndman, R.J. & Yao, Q., 1998. "Nonparametric Estimation and Symmetry Tests for Conditional Density Functions," Monash Econometrics and Business Statistics Working Papers 17/98, Monash University, Department of Econometrics and Business Statistics.
    18. Han-Ying Liang & Elias Ould Saïd, 2018. "A weighted estimator of conditional hazard rate with left-truncated and dependent data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(1), pages 155-189, February.
    19. Yan Shang & David Dunson & Jing-Sheng Song, 2017. "Exploiting Big Data in Logistics Risk Assessment via Bayesian Nonparametrics," Operations Research, INFORMS, vol. 65(6), pages 1574-1588, December.
    20. Ye Chen & Liangjun Su & Aman Ullah, 2009. "Functional Coefficient Estimation with Both Categorical and Continuous Data," Working Papers 200909, University of California at Riverside, Department of Economics, revised Jun 2009.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:6:p:1484-1504. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.