IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v23y2023i1p53-85.html
   My bibliography  Save this article

Finite mixture models for linked survey and administrative data: Estimation and postestimation

Author

Listed:
  • Stephen P. Jenkins

    (London School of Economics and Political Science)

  • Fernando Rios-Avila

    (The Levy Economics Institute of Bard College)

Abstract

Researchers use finite mixture models to analyze linked survey and administrative data on labor earnings, while also accounting for various types of measurement error in each data source. Different combinations of error-ridden and error-free observations characterize latent classes. Latent class probabilities depend on the probabilities of the different types of error. We introduce a suite of commands to fit finite mixture models to linked survey-administrative data: there is a general model and seven simpler variants. We also provide postestimation commands for assessment of reliability, marginal effects, data simulation, and pre- diction of hybrid variables that combine information from both data sources about the outcome of interest. Our commands can also be used to study measurement errors in other variables besides labor earnings.

Suggested Citation

  • Stephen P. Jenkins & Fernando Rios-Avila, 2023. "Finite mixture models for linked survey and administrative data: Estimation and postestimation," Stata Journal, StataCorp LP, vol. 23(1), pages 53-85, March.
  • Handle: RePEc:tsj:stataj:v:23:y:2023:i:1:p:53-85
    DOI: 10.1177/1536867X231161976
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj23-1/st0701/
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1177/1536867X231161976
    Download Restriction: no

    File URL: https://libkey.io/10.1177/1536867X231161976?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Stephen P. Jenkins & Fernando Rios‐Avila, 2021. "Measurement error in earnings data: Replication of Meijer, Rohwedder, and Wansbeek's mixture model approach to combining survey and register data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(4), pages 474-483, June.
    2. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, Institute of Labor Economics (IZA).
    3. Jenkins, Stephen P. & Rios-Avila, Fernando, 2020. "Modelling errors in survey and administrative data on employment earnings: Sensitivity to the fraction assumed to have error-free earnings," Economics Letters, Elsevier, vol. 192(C).
    4. John M. Abowd & Martha H. Stinson, 2013. "Estimating Measurement Error in Annual Job Earnings: A Comparison of Survey and Administrative Data," The Review of Economics and Statistics, MIT Press, vol. 95(5), pages 1451-1467, December.
    5. Dean R. Hyslop & Wilbur Townsend, 2020. "Earnings Dynamics and Measurement Error in Matched Survey and Administrative Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 457-469, April.
    6. repec:taf:jnlbes:v:30:y:2012:i:2:p:191-201 is not listed on IDEAS
    7. Arie Kapteyn & Jelmer Y. Ypma, 2007. "Measurement Error and Misclassification: A Comparison of Survey and Administrative Data," Journal of Labor Economics, University of Chicago Press, vol. 25(3), pages 513-551.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, Institute of Labor Economics (IZA).
    2. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.
    3. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, Institute of Labor Economics (IZA).
    2. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.
    3. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.
    4. Emmanuel Flachaire & Nora Lustig & Andrea Vigorito, 2023. "Underreporting of Top Incomes and Inequality: A Comparison of Correction Methods using Simulations and Linked Survey and Tax Data," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 69(4), pages 1033-1059, December.
    5. R. Bollinger, Christopher & Valentinova Tasseva, Iva, 2022. "Income source confusion using the SILC," ISER Working Paper Series 2022-04, Institute for Social and Economic Research.
    6. Jenkins, Stephen P. & Rios-Avila, Fernando, 2020. "Modelling errors in survey and administrative data on employment earnings: Sensitivity to the fraction assumed to have error-free earnings," Economics Letters, Elsevier, vol. 192(C).
    7. Hyslop, Dean R. & Townsend, Wilbur, 2017. "Employment misclassification in survey and administrative reports," Economics Letters, Elsevier, vol. 155(C), pages 19-23.
    8. Ha Trong Nguyen & Huong Thu Le & Luke Connelly & Francis Mitrou, 2023. "Accuracy of self‐reported private health insurance coverage," Health Economics, John Wiley & Sons, Ltd., vol. 32(12), pages 2709-2729, December.
    9. Crossley, Thomas F. & Fisher, Paul & Hussein, Omar, 2023. "Assessing data from summary questions about earnings and income," Labour Economics, Elsevier, vol. 81(C).
    10. Madeira, Carlos & Margaretic, Paula, 2022. "The impact of financial literacy on the quality of self-reported financial information," Journal of Behavioral and Experimental Finance, Elsevier, vol. 34(C).
    11. R. Bollinger, Christopher & Valentinova Tasseva, Iva, 2022. "Income source confusion using the SILC," ISER Working Paper Series 2022-04, Institute for Social and Economic Research.
    12. Whitaker, Stephan D., 2018. "Big Data versus a survey," The Quarterly Review of Economics and Finance, Elsevier, vol. 67(C), pages 285-296.
    13. Zachary H. Seeskin, 2016. "Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes," CARRA Working Papers 2016-06, Center for Economic Studies, U.S. Census Bureau.
    14. Michele Lalla & Patrizio Frederic & Daniela Mantovani, 2022. "The inextricable association of measurement errors and tax evasion as examined through a microanalysis of survey data matched with fiscal data: a case study," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(5), pages 1375-1401, December.
    15. Quinn Moore & Irma Perez-Johnson & Robert Santillano, 2018. "Decomposing Differences in Impacts on Survey- and Administrative-Measured Earnings From a Job Training Voucher Experiment," Evaluation Review, , vol. 42(5-6), pages 515-549, October.
    16. Paulus, Alari, 2015. "Tax evasion and measurement error: An econometric analysis of survey data linked with tax records," ISER Working Paper Series 2015-10, Institute for Social and Economic Research.
    17. Stephen P. Jenkins & Fernando Rios‐Avila, 2021. "Measurement error in earnings data: Replication of Meijer, Rohwedder, and Wansbeek's mixture model approach to combining survey and register data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(4), pages 474-483, June.
    18. Michele Lalla & Maddalena Cavicchioli, 2020. "Nonresponse and measurement errors in income: matching individual survey data with administrative tax data," Department of Economics 0170, University of Modena and Reggio E., Faculty of Economics "Marco Biagi".
    19. Dieter Vandelannoote & André Decoster & Toon Vanheukelom & Gerlinde Verbist, 2016. "Evaluating The Quality Of Gross Incomes In SILC: Compare Them With Fiscal Data And Re-calibrate Them Using EUROMOD," International Journal of Microsimulation, International Microsimulation Association, vol. 9(3), pages 5-34.
    20. Bruce D. Meyer & Nikolas Mittag, 2015. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," NBER Working Papers 21676, National Bureau of Economic Research, Inc.

    More about this item

    Keywords

    ky_fit; ky_estat; ky_sim; linked survey and administrative data; measurement error; finite mixture models; latent class models;
    All these keywords.

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:23:y:2023:i:1:p:53-85. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.