IDEAS home Printed from https://ideas.repec.org/a/erh/journl/v5y2013i1p20-42.html
   My bibliography  Save this article

A Review of Kernel Density Estimation with Applications to Econometrics

Author

Listed:
  • Adriano Z. Zambom

    (Universidade Estadual de Campinas)

  • Ronaldo Dias

    (Universidade Estadual de Campinas)

Abstract

Nonparametric density estimation is of great importance when econometricians want to model the probabilistic or stochastic structure of a data set. This comprehensive review summarizes the most important theoretical aspects of kernel density estimation and provides an extensive description of classical and modern data analytic methods to compute the smoothing parameter. Throughout the text, several references can be found to the most up-to-date and cut point research approaches in this area, while econometric data sets are analyzed as examples. Lastly, we present SIZer, a new approach introduced by Chaudhuri and Marron (2000), whose objective is to analyze the visible features representing important underlying structures for different bandwidths.

Suggested Citation

  • Adriano Z. Zambom & Ronaldo Dias, 2013. "A Review of Kernel Density Estimation with Applications to Econometrics," International Econometric Review (IER), Econometric Research Association, vol. 5(1), pages 20-42, April.
  • Handle: RePEc:erh:journl:v:5:y:2013:i:1:p:20-42
    as

    Download full text from publisher

    File URL: http://www.era.org.tr/makaleler/13120083.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. M. Pawlak & U. Stadtmuller, 1999. "Kernel Density Estimation with Generalized Binning," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 26(4), pages 539-561, December.
    2. Qiang Cai & Gerard Rushton & Budhendra Bhaduri, 2012. "Validation tests of an improved kernel density estimation method for identifying disease clusters," Journal of Geographical Systems, Springer, vol. 14(3), pages 243-264, July.
    3. Savchuk, Olga Y. & Hart, Jeffrey D. & Sheather, Simon J., 2010. "Indirect Cross-Validation for Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 415-423.
    4. Golyandina, Nina & Pepelyshev, Andrey & Steland, Ansgar, 2012. "New approaches to nonparametric density estimation and selection of smoothing parameters," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2206-2218.
    5. Holmström, Lasse, 2000. "The Accuracy and the Computational Complexity of a Multivariate Binned Kernel Density Estimator," Journal of Multivariate Analysis, Elsevier, vol. 72(2), pages 264-309, February.
    6. Song Chen, 2000. "Probability Density Function Estimation Using Gamma Kernels," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(3), pages 471-480, September.
    7. B. W. Silverman, 1982. "Kernel Density Estimation Using the Fast Fourier Transform," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 31(1), pages 93-99, March.
    8. Marron, J. S., 1989. "Comments on a data based bandwidth selector," Computational Statistics & Data Analysis, Elsevier, vol. 8(2), pages 155-170, July.
    9. Wu, Tiee-Jian & Chen, Ching-Fu & Chen, Huang-Yu, 2007. "A variable bandwidth selector in multivariate kernel density estimation," Statistics & Probability Letters, Elsevier, vol. 77(4), pages 462-467, February.
    10. Peter Hall & Michael C. Minnotte, 2002. "High order data sharpening for density estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(1), pages 141-157, January.
    11. Ziegler Klaus, 2006. "On local bootstrap bandwidth choice in kernel density estimation," Statistics & Risk Modeling, De Gruyter, vol. 24(2), pages 291-301, December.
    12. Hirukawa, Masayuki, 2010. "Nonparametric multiplicative bias correction for kernel-type density estimation on the unit interval," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 473-495, February.
    13. Mnatsakanov, Robert & Sarkisian, Khachatur, 2012. "Varying kernel density estimation on R+," Statistics & Probability Letters, Elsevier, vol. 82(7), pages 1337-1345.
    14. Chan, Ngai-Hang & Lee, Thomas C.M. & Peng, Liang, 2010. "On nonparametric local inference for density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 509-515, February.
    15. A. Delaigle & I. Gijbels, 2004. "Bootstrap bandwidth selection in kernel density estimation from a contaminated sample," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 56(1), pages 19-47, March.
    16. Evarist Giné & Hailin Sang, 2010. "Uniform asymptotics for kernel density estimators with variable bandwidths," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 22(6), pages 773-795.
    17. Hall, Peter, 1990. "Using the bootstrap to estimate mean squared error and select smoothing parameter in nonparametric problems," Journal of Multivariate Analysis, Elsevier, vol. 32(2), pages 177-203, February.
    18. Hall, Peter & Marron, J. S., 1987. "Estimation of integrated squared density derivatives," Statistics & Probability Letters, Elsevier, vol. 6(2), pages 109-115, November.
    19. J. Liao & Yujun Wu & Yong Lin, 2010. "Improving Sheather and Jones’ bandwidth selector for difficult densities in kernel density estimation," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 22(1), pages 105-114.
    20. Hall, Peter & Wand, M. P., 1996. "On the Accuracy of Binned Kernel Density Estimators," Journal of Multivariate Analysis, Elsevier, vol. 56(2), pages 165-184, February.
    21. Cao, Ricardo & Cuevas, Antonio & Gonzalez Manteiga, Wensceslao, 1994. "A comparative study of several smoothing methods in density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 17(2), pages 153-176, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lei Zhang & Lun Xie & Qinkai Han & Zhiliang Wang & Chen Huang, 2020. "Probability Density Forecasting of Wind Speed Based on Quantile Regression and Kernel Density Estimation," Energies, MDPI, vol. 13(22), pages 1-24, November.
    2. Peng Bin, 2016. "Dynamic Development of Regional Disparity in Mainland China: An Experimental Study Based on a Multidimensional Index," Sustainability, MDPI, vol. 8(12), pages 1-28, December.
    3. Diaa Al Mohamad, 2018. "Towards a better understanding of the dual representation of phi divergences," Statistical Papers, Springer, vol. 59(3), pages 1205-1253, September.
    4. Han, Qinkai & Wang, Tianyang & Chu, Fulei, 2022. "Nonparametric copula modeling of wind speed-wind shear for the assessment of height-dependent wind energy in China," Renewable and Sustainable Energy Reviews, Elsevier, vol. 161(C).
    5. Laurini, Márcio Poletti & Mauad, Roberto Baltieri, 2012. "Non-Parametric Pricing of Interest Rates Options," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 32(2), April.
    6. Xu, Bing & Costa-Climent, Ricardo & Wang, Yanyan & Xiao, Yuan, 2020. "Financial support for micro and small enterprises: Economic benefit or social responsibility?," Journal of Business Research, Elsevier, vol. 115(C), pages 266-271.
    7. Miśkiewicz, Janusz, 2016. "Improving quality of sample entropy estimation for continuous distribution probability functions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 450(C), pages 473-485.
    8. Han, Qinkai & Ma, Sai & Wang, Tianyang & Chu, Fulei, 2019. "Kernel density estimation model for wind speed probability distribution with applicability to wind energy assessment in China," Renewable and Sustainable Energy Reviews, Elsevier, vol. 115(C).
    9. William Cipolli & Timothy Hanson, 2019. "Supervised learning via smoothed Polya trees," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 877-904, December.
    10. Bin, Peng, 2015. "Regional Disparity and Dynamic Development of China: a Multidimensional Index," MPRA Paper 61849, University Library of Munich, Germany.
    11. Han, Qinkai & Chu, Fulei, 2021. "Directional wind energy assessment of China based on nonparametric copula models," Renewable Energy, Elsevier, vol. 164(C), pages 1334-1349.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gonzalez-Manteiga, W. & Sanchez-Sellero, C. & Wand, M. P., 1996. "Accuracy of binned kernel functional approximations," Computational Statistics & Data Analysis, Elsevier, vol. 22(1), pages 1-16, June.
    2. Golyandina, Nina & Pepelyshev, Andrey & Steland, Ansgar, 2012. "New approaches to nonparametric density estimation and selection of smoothing parameters," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2206-2218.
    3. Ouimet, Frédéric & Tolosana-Delgado, Raimon, 2022. "Asymptotic properties of Dirichlet kernel density estimators," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    4. Chan, Ngai-Hang & Lee, Thomas C.M. & Peng, Liang, 2010. "On nonparametric local inference for density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 509-515, February.
    5. Gaku Igarashi, 2018. "Multivariate Density Estimation Using a Multivariate Weighted Log-Normal Kernel," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 80(2), pages 247-266, August.
    6. Berwin A. TURLACH, "undated". "Bandwidth selection in kernel density estimation: a rewiew," Statistic und Oekonometrie 9307, Humboldt Universitaet Berlin.
    7. Nils-Bastian Heidenreich & Anja Schindler & Stefan Sperlich, 2013. "Bandwidth selection for kernel density estimation: a review of fully automatic selectors," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 403-433, October.
    8. Duc Devroye & J. Beirlant & R. Cao & R. Fraiman & P. Hall & M. Jones & Gábor Lugosi & E. Mammen & J. Marron & C. Sánchez-Sellero & J. Uña & F. Udina & L. Devroye, 1997. "Universal smoothing factor selection in density estimation: theory and practice," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 6(2), pages 223-320, December.
    9. Sreevani, & Murthy, C.A., 2016. "On bandwidth selection using minimal spanning tree for kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 102(C), pages 67-84.
    10. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    11. Catalina Bolance & Montserrat Guillen & David Pitt, 2014. "Non-parametric Models for Univariate Claim Severity Distributions - an approach using R," Working Papers 2014-01, Universitat de Barcelona, UB Riskcenter.
    12. Holmström, Lasse, 2000. "The Accuracy and the Computational Complexity of a Multivariate Binned Kernel Density Estimator," Journal of Multivariate Analysis, Elsevier, vol. 72(2), pages 264-309, February.
    13. Funke, Benedikt & Hirukawa, Masayuki, 2019. "Nonparametric estimation and testing on discontinuity of positive supported densities: a kernel truncation approach," Econometrics and Statistics, Elsevier, vol. 9(C), pages 156-170.
    14. Igarashi, Gaku & Kakizawa, Yoshihide, 2014. "Re-formulation of inverse Gaussian, reciprocal inverse Gaussian, and Birnbaum–Saunders kernel estimators," Statistics & Probability Letters, Elsevier, vol. 84(C), pages 235-246.
    15. Malec, Peter & Schienle, Melanie, 2014. "Nonparametric kernel density estimation near the boundary," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 57-76.
    16. David Pitt & Montserrat Guillen & Catalina Bolancé, 2011. "Estimation of Parametric and Nonparametric Models for Univariate Claim Severity Distributions - an approach using R," Working Papers XREAP2011-06, Xarxa de Referència en Economia Aplicada (XREAP), revised Jun 2011.
    17. Karim M Abadir & Michel Lubrano, 2024. "Explicit solutions for the asymptotically optimal bandwidth in cross-validation," Post-Print hal-04678541, HAL.
    18. Meintanis, S. & Ushakov, N. G., 2004. "Binned goodness-of-fit tests based on the empirical characteristic function," Statistics & Probability Letters, Elsevier, vol. 69(3), pages 305-314, September.
    19. Funke, Benedikt & Hirukawa, Masayuki, 2021. "Bias correction for local linear regression estimation using asymmetric kernels via the skewing method," Econometrics and Statistics, Elsevier, vol. 20(C), pages 109-130.
    20. Tang, Qingguo & Karunamuni, Rohana J., 2016. "Fast and accurate computation for kernel estimators," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 49-62.

    More about this item

    Keywords

    Nonparametric Density Estimation; SiZer; Plug-In Bandwidth Selectors; Cross- Validation; Smoothing Parameter.;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:erh:journl:v:5:y:2013:i:1:p:20-42. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: M. F. Cosar (email available below). General contact details of provider: https://edirc.repec.org/data/eratrea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.