IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2204.05480.html
   My bibliography  Save this paper

Tuning Parameter-Free Nonparametric Density Estimation from Tabulated Summary Data

Author

Listed:
  • Ji Hyung Lee
  • Yuya Sasaki
  • Alexis Akira Toda
  • Yulong Wang

Abstract

Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning parameters and admits a closed-form density that is convenient for post-estimation analysis. We apply the proposed method to the tabulated summary data of the U.S. tax returns to estimate the income distribution.

Suggested Citation

  • Ji Hyung Lee & Yuya Sasaki & Alexis Akira Toda & Yulong Wang, 2022. "Tuning Parameter-Free Nonparametric Density Estimation from Tabulated Summary Data," Papers 2204.05480, arXiv.org, revised May 2023.
  • Handle: RePEc:arx:papers:2204.05480
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2204.05480
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Chotikapanich, Duangkamon & Griffiths, William E. & Rao, D. S. Prasada, 2007. "Estimating and Combining National Income Distributions Using Limited Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 97-109, January.
    2. Ji Hyung Lee & Yuya Sasaki & Alexis Akira Toda & Yulong Wang, 2022. "Capital and Labor Income Pareto Exponents in the United States, 1916-2019," Papers 2206.04257, arXiv.org.
    3. Alexis Toda, 2015. "Bayesian general equilibrium," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 58(2), pages 375-411, February.
    4. Villasenor, JoseA. & Arnold, Barry C., 1989. "Elliptical Lorenz curves," Journal of Econometrics, Elsevier, vol. 40(2), pages 327-338, February.
    5. Thomas Blanchet & Juliette Fournier & Thomas Piketty, 2022. "Generalized Pareto Curves: Theory and Applications," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(1), pages 263-288, March.
    6. James B. McDonald, 2008. "Some Generalized Functions for the Size Distribution of Income," Economic Studies in Inequality, Social Exclusion, and Well-Being, in: Duangkamon Chotikapanich (ed.), Modeling Income Distributions and Lorenz Curves, chapter 3, pages 37-55, Springer.
    7. Foley Duncan K., 1994. "A Statistical Equilibrium Theory of Markets," Journal of Economic Theory, Elsevier, vol. 62(2), pages 321-345, April.
    8. Tjeerd de Vries & Alexis Akira Toda, 2022. "Capital and Labor Income Pareto Exponents Across Time and Space," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(4), pages 1058-1078, December.
    9. Leland E. Farmer & Alexis Akira Toda, 2017. "Discretizing nonlinear, non‐Gaussian Markov processes with exact conditional moments," Quantitative Economics, Econometric Society, vol. 8(2), pages 651-683, July.
    10. Thomas Piketty & Emmanuel Saez, 2003. "Income Inequality in the United States, 1913–1998," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 118(1), pages 1-41.
    11. Tanaka, Ken'ichiro & Toda, Alexis Akira, 2015. "Discretizing Distributions with Exact Moments: Error Estimate and Convergence Analysis," University of California at San Diego, Economics Working Paper Series qt7g23r5kh, Department of Economics, UC San Diego.
    12. Toda, Alexis Akira, 2012. "The double power law in income distribution: Explanations and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 84(1), pages 364-381.
    13. Daniel R. Feenberg & James M. Poterba, 1993. "Income Inequality and the Incomes of Very High-Income Taxpayers: Evidence from Tax Returns," NBER Chapters, in: Tax Policy and the Economy, Volume 7, pages 145-177, National Bureau of Economic Research, Inc.
    14. Yuichi Kitamura & Michael Stutzer, 1997. "An Information-Theoretic Alternative to Generalized Method of Moments Estimation," Econometrica, Econometric Society, vol. 65(4), pages 861-874, July.
    15. Tanaka, Ken’ichiro & Toda, Alexis Akira, 2013. "Discrete approximations of continuous distributions by maximum entropy," Economics Letters, Elsevier, vol. 118(3), pages 445-450.
    16. Thomas Piketty & Emmanuel Saez, 2001. "Income Inequality in the United States, 1913-1998 (series updated to 2000 available)," NBER Working Papers 8467, National Bureau of Economic Research, Inc.
    17. Yi-Ting Chen, 2018. "A Unified Approach to Estimating and Testing Income Distributions With Grouped Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 438-455, July.
    18. Miguel Reyes & Mario Francisco-Fernández & Ricardo Cao, 2016. "Nonparametric kernel density estimation for general grouped data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(2), pages 235-249, June.
    19. Stutzer, Michael, 1996. "A Simple Nonparametric Approach to Derivative Security Valuation," Journal of Finance, American Finance Association, vol. 51(5), pages 1633-1652, December.
    20. Wu, Ximing, 2003. "Calculation of maximum entropy densities with application to income distribution," Journal of Econometrics, Elsevier, vol. 115(2), pages 347-354, August.
    21. Gholamreza Hajargasht & William E. Griffiths & Joseph Brice & D.S. Prasada Rao & Duangkamon Chotikapanich, 2012. "Inference for Income Distributions Using Grouped Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(4), pages 563-575, May.
    22. Kakwani, Nanak C & Podder, N, 1976. "Efficient Estimation of the Lorenz Curve and Associated Inequality Measures from Grouped Observations," Econometrica, Econometric Society, vol. 44(1), pages 137-148, January.
    23. Frank A. Cowell & Fatemeh Mehta, 1982. "The Estimation and Interpolation of Inequality Measures," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 49(2), pages 273-290.
    24. Vanesa Jorda & José María Sarabia & Markus Jäntti, 2021. "Inequality measurement with grouped data: Parametric and non‐parametric methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 964-984, July.
    25. Alexis Toda, 2010. "Existence of a statistical equilibrium for an economy with endogenous offer sets," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 45(3), pages 379-415, December.
    26. Stutzer, Michael, 1995. "A Bayesian approach to diagnosis of asset pricing models," Journal of Econometrics, Elsevier, vol. 68(2), pages 367-397, August.
    27. Gholamreza Hajargasht & William E. Griffiths, 2020. "Minimum distance estimation of parametric Lorenz curves based on grouped data," Econometric Reviews, Taylor & Francis Journals, vol. 39(4), pages 344-361, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tanaka, Ken'ichiro & Toda, Alexis Akira, 2015. "Discretizing Distributions with Exact Moments: Error Estimate and Convergence Analysis," University of California at San Diego, Economics Working Paper Series qt7g23r5kh, Department of Economics, UC San Diego.
    2. Alexis Akira Toda & Yulong Wang, 2021. "Efficient minimum distance estimation of Pareto exponent from top income shares," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(2), pages 228-243, March.
    3. Jangho Yang, 2018. "Information Theoretic Approaches In Economics," Journal of Economic Surveys, Wiley Blackwell, vol. 32(3), pages 940-960, July.
    4. Tobias Eckernkemper & Bastian Gribisch, 2021. "Classical and Bayesian Inference for Income Distributions using Grouped Data," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 83(1), pages 32-65, February.
    5. Vanesa Jorda & José María Sarabia & Markus Jäntti, 2021. "Inequality measurement with grouped data: Parametric and non‐parametric methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 964-984, July.
    6. Gholamreza Hajargasht & William E. Griffiths, 2016. "Inference for Lorenz Curves," Department of Economics - Working Papers Series 2022, The University of Melbourne.
    7. Chotikapanich, Duangkamon & Griffiths, William E. & Rao, D.S. Prasada & Karunarathne, Wasana, 2014. "Income Distributions, Inequality, and Poverty in Asia, 1992–2010," ADBI Working Papers 468, Asian Development Bank Institute.
    8. Vladimir Hlasny, 2021. "Parametric representation of the top of income distributions: Options, historical evidence, and model selection," Journal of Economic Surveys, Wiley Blackwell, vol. 35(4), pages 1217-1256, September.
    9. Duangkamon Chotikapanich & William Griffiths & Wasana Karunarathne & D.S. Prasada Rao, 2013. "Calculating Poverty Measures from the Generalised Beta Income Distribution," The Economic Record, The Economic Society of Australia, vol. 89, pages 48-66, June.
    10. Lee, Jongchul, 2013. "A provincial perspective on income inequality in urban China and the role of property and business income," China Economic Review, Elsevier, vol. 26(C), pages 140-150.
    11. Guanghua Wan, 2012. "Towards Greater Equality in China: The Economic Growth Dividend," Working Papers 2012/33, Maastricht School of Management.
    12. Jangho Yang, 2023. "Information‐theoretic model of induced technical change: Theory and empirics," Metroeconomica, Wiley Blackwell, vol. 74(1), pages 2-39, February.
    13. Thomas Blanchet & Juliette Fournier & Thomas Piketty, 2022. "Generalized Pareto Curves: Theory and Applications," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(1), pages 263-288, March.
    14. Camelia Minoiu & Sanjay Reddy, 2014. "Kernel density estimation on grouped data: the case of poverty assessment," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(2), pages 163-189, June.
    15. Ji Hyung Lee & Yuya Sasaki & Alexis Akira Toda & Yulong Wang, 2022. "Capital and Labor Income Pareto Exponents in the United States, 1916-2019," Papers 2206.04257, arXiv.org.
    16. Felix Koenig, 2023. "Technical Change and Superstar Effects: Evidence from the Rollout of Television," American Economic Review: Insights, American Economic Association, vol. 5(2), pages 207-223, June.
    17. Griffiths, William & Hajargasht, Gholamreza, 2015. "On GMM estimation of distributions from grouped data," Economics Letters, Elsevier, vol. 126(C), pages 122-126.
    18. Tu, Teng-Tsai, 1998. "An entropic approach to equity market integration and consumption-based capital asset pricing models," ISU General Staff Papers 1998010108000012895, Iowa State University, Department of Economics.
    19. Enora Belz, 2019. "Estimating Inequality Measures from Quantile Data," Economics Working Paper Archive (University of Rennes 1 & University of Caen) 2019-09, Center for Research in Economics and Management (CREM), University of Rennes 1, University of Caen and CNRS.
    20. Alexis Akira Toda, 2021. "Data-Based Automatic Discretization of Nonparametric Distributions," Computational Economics, Springer;Society for Computational Economics, vol. 57(4), pages 1217-1235, April.

    More about this item

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2204.05480. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.