IDEAS home Printed from https://ideas.repec.org/p/zbw/fubsbe/201527.html
   My bibliography  Save this paper

Kernel density estimation for heaped data

Author

Listed:
  • Groß, Marcus
  • Rendtel, Ulrich

Abstract

In self-reported data usually a phenomenon called 'heaping' occurs, i.e. survey participants round the values of their income, weight or height to some degree. Additionally, respondents may be more prone to round off or up due to social desirability. By ignoring the heaping process a severe bias in terms of spikes and bumps is introduced when applying kernel density methods naively to the rounded data. A generalized Stochastic Expectation Maximization (SEM) approach accounting for heaping with potentially asymmetric rounding behaviour in univariate kernel density estimation is presented in this work. The introduced methods are applied to survey data of the German Socio-Economic Panel and exhibit very good performance simulations.

Suggested Citation

  • Groß, Marcus & Rendtel, Ulrich, 2015. "Kernel density estimation for heaped data," Discussion Papers 2015/27, Free University Berlin, School of Business & Economics.
  • Handle: RePEc:zbw:fubsbe:201527
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/117333/1/834585685.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Pudney, Stephen, 2008. "Heaping and leaping: survey response behaviour and the dynamics of self-reported consumption expenditure," ISER Working Paper Series 2008-09, Institute for Social and Economic Research.
    2. Erich Battistin & Raffaele Miniaci & Guglielmo Weber, 2003. "What Do We Learn from Recall Consumption Data?," Journal of Human Resources, University of Wisconsin Press, vol. 38(2).
    3. Groß, Marcus & Rendtel, Ulrich & Schmid, Timo & Schmon, Sebastian & Tzavidis, Nikos, 2015. "Estimating the density of ethnic minorities and aged people in Berlin: Multivariate kernel density estimation applied to sensitive geo-referenced administrative data protected via measurement error," Discussion Papers 2015/7, Free University Berlin, School of Business & Economics.
    4. Zhang, Xibin & King, Maxwell L. & Hyndman, Rob J., 2006. "A Bayesian approach to bandwidth selection for multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 3009-3031, July.
    5. Steffen Otterbach & Alfonso Sousa-Poza, 2010. "How Accurate are German Work-time Data? A Comparison of Time-diary Reports and Stylized Estimates," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 97(3), pages 325-339, July.
    6. repec:mpr:mprres:6195 is not listed on IDEAS
    7. Gert G. Wagner & Joachim R. Frick & Jürgen Schupp, 2007. "The German Socio-Economic Panel Study (SOEP) – Scope, Evolution and Enhancements," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 127(1), pages 139-169.
    8. Jan Marcus & Rainer Siegers & Markus M. Grabka, 2013. "Preparation of Data from the New SOEP Consumption Module: Editing, Imputation, and Smoothing," Data Documentation 70, DIW Berlin, German Institute for Economic Research.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael C. Knaus & Steffen Otterbach, 2019. "Work Hour Mismatch And Job Mobility: Adjustment Channels And Resolution Rates," Economic Inquiry, Western Economic Association International, vol. 57(1), pages 227-242, January.
    2. Stefan Bach & Martin Beznoska & Viktor Steiner, 2016. "An Integrated Micro Data Base for Tax Analysis in Germany," Data Documentation 86, DIW Berlin, German Institute for Economic Research.
    3. Thomas F. Crossley & Joachim K. Winter, 2014. "Asking Households about Expenditures: What Have We Learned?," NBER Chapters, in: Improving the Measurement of Consumer Expenditures, pages 23-50, National Bureau of Economic Research, Inc.
    4. Nico Pestel & Eric Sommer, 2017. "Shifting Taxes from Labor to Consumption: More Employment and more Inequality?," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 63(3), pages 542-563, September.
    5. Bettina Sonnenberg & Michaela Riediger & Cornelia Wrzus & Gert G. Wagner, 2011. "Measuring Time Use in Surveys: How Valid Are Time Use Questions in Surveys? Concordance of Survey and Experience Sampling Measures," SOEPpapers on Multidisciplinary Panel Data Research 390, DIW Berlin, The German Socio-Economic Panel (SOEP).
    6. Byung-hill Jun & Hosin Song, 2019. "Tests for Detecting Probability Mass Points," Korean Economic Review, Korean Economic Association, vol. 35, pages 205-248.
    7. Löffler, Max & Peichl, Andreas & Pestel, Nico & Siegloch, Sebastian & Sommer, Eric, 2014. "Documentation IZA?MOD v3.0: The IZA Policy Simulation Model," IZA Discussion Papers 8553, Institute of Labor Economics (IZA).
    8. Marcus Groß & Ulrich Rendtel & Timo Schmid & Sebastian Schmon & Nikos Tzavidis, 2017. "Estimating the density of ethnic minorities and aged people in Berlin: multivariate kernel density estimation applied to sensitive georeferenced administrative data protected via measurement error," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(1), pages 161-183, January.
    9. Heineck, Guido & Süssmuth, Bernd, 2013. "A different look at Lenin’s legacy: Social capital and risk taking in the Two Germanies," Journal of Comparative Economics, Elsevier, vol. 41(3), pages 789-803.
    10. Kemptner, Daniel & Tolan, Songül, 2018. "The role of time preferences in educational decision making," Economics of Education Review, Elsevier, vol. 67(C), pages 25-39.
    11. Falk, Armin & Abeler, Johannes & Kosse, Fabian, 2021. "Malleability of preferences for honesty," CEPR Discussion Papers 16164, C.E.P.R. Discussion Papers.
    12. Fossen, Frank M. & König, Johannes, 2015. "Public health insurance and entry into self-employment," VfS Annual Conference 2015 (Muenster): Economic Development - Theory and Policy 112934, Verein für Socialpolitik / German Economic Association.
    13. Stefanie Sperlich & Frauke-Marie Adler & Johannes Beller & Batoul Safieddine & Juliane Tetzlaff & Fabian Tetzlaff & Siegfried Geyer, 2022. "Getting Better or Getting Worse? A Population-Based Study on Trends in Self-Rated Health among Single Mothers in Germany between 1994 and 2018," IJERPH, MDPI, vol. 19(5), pages 1-13, February.
    14. Sarah Flèche & Richard Layard, 2017. "Do More of Those in Misery Suffer from Poverty, Unemployment or Mental Illness?," Kyklos, Wiley Blackwell, vol. 70(1), pages 27-41, February.
    15. Catalina Bolance & Montserrat Guillen & David Pitt, 2014. "Non-parametric Models for Univariate Claim Severity Distributions - an approach using R," Working Papers 2014-01, Universitat de Barcelona, UB Riskcenter.
    16. Klos, Alexander & Rottke, Simon, 2013. "Saving and Consumption When Children Move Out," VfS Annual Conference 2013 (Duesseldorf): Competition Policy and Regulation in a Global Economic Order 79786, Verein für Socialpolitik / German Economic Association.
    17. Matthias Lühr & Maria K. Pavlova & Maike Luhmann, 2022. "They are Doing Well, but is it by Doing Good? Pathways from Nonpolitical and Political Volunteering to Subjective Well-Being in Age Comparison," Journal of Happiness Studies, Springer, vol. 23(5), pages 1969-1989, June.
    18. Billari, Francesco C. & Giuntella, Osea & Stella, Luca, 2018. "Broadband internet, digital temptations, and sleep," Journal of Economic Behavior & Organization, Elsevier, vol. 153(C), pages 58-76.
    19. Fabian Kosse & Thomas Deckers & Pia Pinger & Hannah Schildberg-Hörisch & Armin Falk, 2020. "The Formation of Prosociality: Causal Evidence on the Role of Social Environment," Journal of Political Economy, University of Chicago Press, vol. 128(2), pages 434-467.
    20. Schupp, Jürgen, 2009. "25 Jahre Sozio-oekonomisches Panel - ein Infrastrukturprojekt der empirischen Sozial- und Wirtschaftsforschung in Deutschland," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 38(5), pages 350-357.

    More about this item

    Keywords

    Heaping; Survey Data; Measurement error; Self-reported data; Kernel density estimation; Rounded data;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:fubsbe:201527. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/fwfubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.