IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v238y2024i1s0304407623002841.html
   My bibliography  Save this article

Tuning parameter-free nonparametric density estimation from tabulated summary data

Author

Listed:
  • Lee, Ji Hyung
  • Sasaki, Yuya
  • Toda, Alexis Akira
  • Wang, Yulong

Abstract

Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning parameters and admits a closed-form density that is convenient for post-estimation analysis. We apply the proposed method to the tabulated summary data of the U.S. tax returns to estimate the income distribution.

Suggested Citation

  • Lee, Ji Hyung & Sasaki, Yuya & Toda, Alexis Akira & Wang, Yulong, 2024. "Tuning parameter-free nonparametric density estimation from tabulated summary data," Journal of Econometrics, Elsevier, vol. 238(1).
  • Handle: RePEc:eee:econom:v:238:y:2024:i:1:s0304407623002841
    DOI: 10.1016/j.jeconom.2023.105568
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407623002841
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2023.105568?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Alexis Toda, 2015. "Bayesian general equilibrium," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 58(2), pages 375-411, February.
    2. Villasenor, JoseA. & Arnold, Barry C., 1989. "Elliptical Lorenz curves," Journal of Econometrics, Elsevier, vol. 40(2), pages 327-338, February.
    3. James B. McDonald, 2008. "Some Generalized Functions for the Size Distribution of Income," Economic Studies in Inequality, Social Exclusion, and Well-Being, in: Duangkamon Chotikapanich (ed.), Modeling Income Distributions and Lorenz Curves, chapter 3, pages 37-55, Springer.
    4. Foley Duncan K., 1994. "A Statistical Equilibrium Theory of Markets," Journal of Economic Theory, Elsevier, vol. 62(2), pages 321-345, April.
    5. Chotikapanich, Duangkamon & Griffiths, William E. & Rao, D. S. Prasada, 2007. "Estimating and Combining National Income Distributions Using Limited Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 97-109, January.
    6. Tjeerd de Vries & Alexis Akira Toda, 2022. "Capital and Labor Income Pareto Exponents Across Time and Space," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(4), pages 1058-1078, December.
    7. Thomas Piketty & Emmanuel Saez, 2003. "Income Inequality in the United States, 1913–1998," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 118(1), pages 1-41.
    8. Thomas Blanchet & Juliette Fournier & Thomas Piketty, 2022. "Generalized Pareto Curves: Theory and Applications," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(1), pages 263-288, March.
    9. Tanaka, Ken'ichiro & Toda, Alexis Akira, 2015. "Discretizing Distributions with Exact Moments: Error Estimate and Convergence Analysis," University of California at San Diego, Economics Working Paper Series qt7g23r5kh, Department of Economics, UC San Diego.
    10. Yuichi Kitamura & Michael Stutzer, 1997. "An Information-Theoretic Alternative to Generalized Method of Moments Estimation," Econometrica, Econometric Society, vol. 65(4), pages 861-874, July.
    11. Daniel R. Feenberg & James M. Poterba, 1993. "Income Inequality and the Incomes of Very High-Income Taxpayers: Evidence from Tax Returns," NBER Chapters, in: Tax Policy and the Economy, Volume 7, pages 145-177, National Bureau of Economic Research, Inc.
    12. Thomas Piketty & Emmanuel Saez, 2001. "Income Inequality in the United States, 1913-1998 (series updated to 2000 available)," NBER Working Papers 8467, National Bureau of Economic Research, Inc.
    13. Yi-Ting Chen, 2018. "A Unified Approach to Estimating and Testing Income Distributions With Grouped Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 438-455, July.
    14. Miguel Reyes & Mario Francisco-Fernández & Ricardo Cao, 2016. "Nonparametric kernel density estimation for general grouped data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(2), pages 235-249, June.
    15. Stutzer, Michael, 1996. "A Simple Nonparametric Approach to Derivative Security Valuation," Journal of Finance, American Finance Association, vol. 51(5), pages 1633-1652, December.
    16. Wu, Ximing, 2003. "Calculation of maximum entropy densities with application to income distribution," Journal of Econometrics, Elsevier, vol. 115(2), pages 347-354, August.
    17. Kakwani, Nanak C & Podder, N, 1976. "Efficient Estimation of the Lorenz Curve and Associated Inequality Measures from Grouped Observations," Econometrica, Econometric Society, vol. 44(1), pages 137-148, January.
    18. Alexis Toda, 2010. "Existence of a statistical equilibrium for an economy with endogenous offer sets," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 45(3), pages 379-415, December.
    19. Ji Hyung Lee & Yuya Sasaki & Alexis Akira Toda & Yulong Wang, 2022. "Capital and Labor Income Pareto Exponents in the United States, 1916-2019," Papers 2206.04257, arXiv.org.
    20. Leland E. Farmer & Alexis Akira Toda, 2017. "Discretizing nonlinear, non‐Gaussian Markov processes with exact conditional moments," Quantitative Economics, Econometric Society, vol. 8(2), pages 651-683, July.
    21. Toda, Alexis Akira, 2012. "The double power law in income distribution: Explanations and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 84(1), pages 364-381.
    22. Tanaka, Ken’ichiro & Toda, Alexis Akira, 2013. "Discrete approximations of continuous distributions by maximum entropy," Economics Letters, Elsevier, vol. 118(3), pages 445-450.
    23. Gholamreza Hajargasht & William E. Griffiths & Joseph Brice & D.S. Prasada Rao & Duangkamon Chotikapanich, 2012. "Inference for Income Distributions Using Grouped Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(4), pages 563-575, May.
    24. Frank A. Cowell & Fatemeh Mehta, 1982. "The Estimation and Interpolation of Inequality Measures," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 49(2), pages 273-290.
    25. Vanesa Jorda & José María Sarabia & Markus Jäntti, 2021. "Inequality measurement with grouped data: Parametric and non‐parametric methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 964-984, July.
    26. Stutzer, Michael, 1995. "A Bayesian approach to diagnosis of asset pricing models," Journal of Econometrics, Elsevier, vol. 68(2), pages 367-397, August.
    27. Gholamreza Hajargasht & William E. Griffiths, 2020. "Minimum distance estimation of parametric Lorenz curves based on grouped data," Econometric Reviews, Taylor & Francis Journals, vol. 39(4), pages 344-361, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tanaka, Ken'ichiro & Toda, Alexis Akira, 2015. "Discretizing Distributions with Exact Moments: Error Estimate and Convergence Analysis," University of California at San Diego, Economics Working Paper Series qt7g23r5kh, Department of Economics, UC San Diego.
    2. Alexis Akira Toda & Yulong Wang, 2021. "Efficient minimum distance estimation of Pareto exponent from top income shares," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(2), pages 228-243, March.
    3. Jangho Yang, 2018. "Information Theoretic Approaches In Economics," Journal of Economic Surveys, Wiley Blackwell, vol. 32(3), pages 940-960, July.
    4. Tobias Eckernkemper & Bastian Gribisch, 2021. "Classical and Bayesian Inference for Income Distributions using Grouped Data," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 83(1), pages 32-65, February.
    5. Vanesa Jorda & José María Sarabia & Markus Jäntti, 2021. "Inequality measurement with grouped data: Parametric and non‐parametric methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 964-984, July.
    6. Gholamreza Hajargasht & William E. Griffiths, 2016. "Inference for Lorenz Curves," Department of Economics - Working Papers Series 2022, The University of Melbourne.
    7. Vladimir Hlasny, 2021. "Parametric representation of the top of income distributions: Options, historical evidence, and model selection," Journal of Economic Surveys, Wiley Blackwell, vol. 35(4), pages 1217-1256, September.
    8. Tsvetana Spasova, 2024. "Estimating Income Distributions From Grouped Data: A Minimum Quantile Distance Approach," Computational Economics, Springer;Society for Computational Economics, vol. 64(4), pages 2079-2096, October.
    9. Lee, Jongchul, 2013. "A provincial perspective on income inequality in urban China and the role of property and business income," China Economic Review, Elsevier, vol. 26(C), pages 140-150.
    10. Jangho Yang, 2023. "Information‐theoretic model of induced technical change: Theory and empirics," Metroeconomica, Wiley Blackwell, vol. 74(1), pages 2-39, February.
    11. Chotikapanich, Duangkamon & Griffiths, William E. & Rao, D.S. Prasada & Karunarathne, Wasana, 2014. "Income Distributions, Inequality, and Poverty in Asia, 1992–2010," ADBI Working Papers 468, Asian Development Bank Institute.
    12. Duangkamon Chotikapanich & William Griffiths & Wasana Karunarathne & D.S. Prasada Rao, 2013. "Calculating Poverty Measures from the Generalised Beta Income Distribution," The Economic Record, The Economic Society of Australia, vol. 89, pages 48-66, June.
    13. Guanghua Wan, 2012. "Towards Greater Equality in China: The Economic Growth Dividend," Working Papers 2012/33, Maastricht School of Management.
    14. Xiaobo Shen & Pingsheng Dai, 2024. "A regression method for estimating Gini index by decile," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-8, December.
    15. Thomas Blanchet & Juliette Fournier & Thomas Piketty, 2022. "Generalized Pareto Curves: Theory and Applications," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 68(1), pages 263-288, March.
    16. Camelia Minoiu & Sanjay Reddy, 2014. "Kernel density estimation on grouped data: the case of poverty assessment," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(2), pages 163-189, June.
    17. Tu, Teng-Tsai, 1998. "An entropic approach to equity market integration and consumption-based capital asset pricing models," ISU General Staff Papers 1998010108000012895, Iowa State University, Department of Economics.
    18. Nak-Nyeon Kim, 2018. "Top Incomes in Korea: Update, 1933-2016"," World Inequality Lab Working Papers hal-02878150, HAL.
    19. Melanie Krause, 2014. "Parametric Lorenz Curves and the Modality of the Income Density Function," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(4), pages 905-929, December.
    20. Duangkamon Chotikapanich & William E. Griffiths & Gholamreza Hajargasht & Wasana Karunarathne & D. S. Prasada Rao, 2018. "Using the GB2 Income Distribution," Econometrics, MDPI, vol. 6(2), pages 1-24, April.

    More about this item

    Keywords

    Grouped data; Income distribution; Maximum entropy;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:238:y:2024:i:1:s0304407623002841. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.