IDEAS home Printed from https://ideas.repec.org/a/jns/jbstat/v225y2005i5p567-583.html
   My bibliography  Save this article

Microdata Disclosure Control by Resampling - Effects on Regression Results

Author

Listed:
  • Gottschalk Sandra

    (Centre for European Economic Research (ZEW), L 7, 1, D-68161 Mannheim, Germany)

Abstract

Nonparametric resampling is a method for generating synthetic microdata and is introduced as a procedure for microdata disclosure limitation. Theoretically, re-identification of individuals or firms is not possible with synthetic data. The resampling procedure creates datasets - the resample - which nearly have the same empirical cumulative distribution functions as the original survey data and thus permit econometricians to calculate meaningful regression results. The idea of nonparametric resampling, especially, is to draw from univariate or multivariate empirical distribution functions without having to estimate these explicitly. Until now, the resampling procedure shown here has only been applicable to variables with continuous distribution functions. Monte Carlo simulations and applications with data from the Mannheim Innovation Panel show that results of linear and nonlinear regression analyses can be reproduced quite precisely by nonparametric resamples. A univariate and a multivariate resampling version are examined. The univariate version as well as the multivariate version which is using the correlation structure of the original data as a scaling instrument turn out to be able to retain the coefficients of model estimations. Furthermore, multivariate resampling best reproduces regression results if all variables are anonymised.

Suggested Citation

  • Gottschalk Sandra, 2005. "Microdata Disclosure Control by Resampling - Effects on Regression Results," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 225(5), pages 567-583, October.
  • Handle: RePEc:jns:jbstat:v:225:y:2005:i:5:p:567-583
    DOI: 10.1515/jbnst-2005-0506
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jbnst-2005-0506
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jbnst-2005-0506?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hardle, Wolfgang & Linton, Oliver, 1986. "Applied nonparametric methods," Handbook of Econometrics, in: R. F. Engle & D. McFadden (ed.), Handbook of Econometrics, edition 1, volume 4, chapter 38, pages 2295-2339, Elsevier.
    2. Almus, Matthias & Engel, Dirk & Prantl, Susanne, 2000. "The Mannheim Foundation Panels of the Centre for European Economic Research (ZEW)," ZEW Dokumentationen 00-02, ZEW - Leibniz Centre for European Economic Research.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gottschalk, Sandra, 2013. "The Research Data Centre of the Centre for European Economic Research (ZEW-FDZ)," ZEW Discussion Papers 13-051, ZEW - Leibniz Centre for European Economic Research.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Herwartz, Helmut & Reimers, Hans-Eggert, 2006. "Modelling the Fisher hypothesis: World wide evidence," Economics Working Papers 2006-04, Christian-Albrechts-University of Kiel, Department of Economics.
    2. Severance-Lossin, E. & Sperlich, S., 1995. "Estimation of Derivatives for Additive Separable Models," SFB 373 Discussion Papers 1995,60, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.
    3. Hjalmarsson, Erik, 2003. "Does the Black-Scholes formula work for electricity markets? A nonparametric approach," Working Papers in Economics 101, University of Gothenburg, Department of Economics.
    4. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    5. Geng, Xin & Janssens, Wendy & Kramer, Berber, 2018. "Liquid milk: Cash Constraints and Recurring Savings among Dairy Farmers in Kenya," 2018 Annual Meeting, August 5-7, Washington, D.C. 273823, Agricultural and Applied Economics Association.
    6. BERTINELLI, Luisito & STROBL, Eric, 2003. "Urbanization, urban concentration and economic growth in developing countries," LIDAM Discussion Papers CORE 2003076, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    7. Bonsoo Koo & Oliver Linton, 2010. "Semiparametric Estimation of Locally Stationary Diffusion Models," STICERD - Econometrics Paper Series 551, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    8. Zhijie Xiao & Oliver Linton & Raymond J. Carroll & E. Mammen, 2002. "More Efficient Kernel Estimation in Nonparametric Regression with Autocorrelated Errors," Cowles Foundation Discussion Papers 1375, Cowles Foundation for Research in Economics, Yale University.
    9. Dabo-Niang, Sophie & Francq, Christian & Zakoïan, Jean-Michel, 2010. "Combining Nonparametric and Optimal Linear Time Series Predictions," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1554-1565.
    10. Koop, Gary & Poirier, Dale J., 2004. "Bayesian variants of some classical semiparametric regression techniques," Journal of Econometrics, Elsevier, vol. 123(2), pages 259-282, December.
    11. Engel, Dirk, 2002. "The Impact of Venture Capital on Firm Growth: An Empirical Investigation," ZEW Discussion Papers 02-02, ZEW - Leibniz Centre for European Economic Research.
    12. Labandeira, Xavier & Labeaga, José M. & López-Otero, Xiral, 2017. "A meta-analysis on the price elasticity of energy demand," Energy Policy, Elsevier, vol. 102(C), pages 549-568.
    13. Emmanuel Saez, 2010. "Do Taxpayers Bunch at Kink Points?," American Economic Journal: Economic Policy, American Economic Association, vol. 2(3), pages 180-212, August.
    14. Linton, Oliver, 1995. "Second Order Approximation in the Partially Linear Regression Model," Econometrica, Econometric Society, vol. 63(5), pages 1079-1112, September.
    15. Oliver Linton & Pedro Gozalo, 1996. "Conditional Independence Restrictions: Testing and Estimation," Cowles Foundation Discussion Papers 1140, Cowles Foundation for Research in Economics, Yale University.
    16. Michael LaCour-Little & Michael Marschoun & Clark L. Maxam, 2002. "Improving Parametric Mortgage Prepayment Models with Non-parametric Kernel Regression," Journal of Real Estate Research, American Real Estate Society, vol. 24(3), pages 299-328.
    17. Bolancé, Catalina & Guillén, Montserrat & Pinquet, Jean, 2008. "On the link between credibility and frequency premium," Insurance: Mathematics and Economics, Elsevier, vol. 43(2), pages 209-213, October.
    18. McMillen, Daniel P., 2001. "Nonparametric Employment Subcenter Identification," Journal of Urban Economics, Elsevier, vol. 50(3), pages 448-473, November.
    19. Creemers, An & Aerts, Marc & Hens, Niel & Molenberghs, Geert, 2012. "A nonparametric approach to weighted estimating equations for regression analysis with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 100-113, January.
    20. Das, J.W.M. & Dominitz, J. & van Soest, A.H.O., 1997. "Comparing Predictions and Outcomes : Theory and Application to Income Changes," Discussion Paper 1997-45, Tilburg University, Center for Economic Research.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jns:jbstat:v:225:y:2005:i:5:p:567-583. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.