IDEAS home Printed from https://ideas.repec.org/a/eee/quaeco/v67y2018icp285-296.html
   My bibliography  Save this article

Big Data versus a survey

Author

Listed:
  • Whitaker, Stephan D.

Abstract

Economists are shifting resources from work on survey data to work involving “Big Data.” This analysis is an empirical exploration of the trade-offs this substitution requires. Parallel models are estimated using Equifax credit bureau data and Survey of Consumer Finances data. After adjustments to account for different variable definitions and sampled populations, it is possible to arrive at similar models of total household debt. However, the estimates are sensitive to the adjustments. In this example, some external education and income measures are successfully integrated with the big data, but other external aggregates fail to adequately substitute for survey responses.

Suggested Citation

  • Whitaker, Stephan D., 2018. "Big Data versus a survey," The Quarterly Review of Economics and Finance, Elsevier, vol. 67(C), pages 285-296.
  • Handle: RePEc:eee:quaeco:v:67:y:2018:i:c:p:285-296
    DOI: 10.1016/j.qref.2017.07.011
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1062976917300170
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.qref.2017.07.011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Céline CARRERE & Jaime MELO DE, 2009. "Non-Tariff Measures: What do we Know, What Should be Done?," Working Papers 200933, CERDI.
    2. Liran Einav & Jonathan Levin, 2014. "The Data Revolution and Economic Analysis," Innovation Policy and the Economy, University of Chicago Press, vol. 14(1), pages 1-24.
    3. Sonka, Steve, 2014. "Big Data and the Ag Sector: More than Lots of Numbers," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 17(1), pages 1-20, February.
    4. Ansolabehere, Stephen & Hersh, Eitan, 2012. "Validation: What Big Data Reveal About Survey Misreporting and the Real Electorate," Political Analysis, Cambridge University Press, vol. 20(4), pages 437-459.
    5. John M. Abowd & Martha H. Stinson, 2013. "Estimating Measurement Error in Annual Job Earnings: A Comparison of Survey and Administrative Data," The Review of Economics and Statistics, MIT Press, vol. 95(5), pages 1451-1467, December.
    6. Milton Friedman, 1957. "Introduction to "A Theory of the Consumption Function"," NBER Chapters, in: A Theory of the Consumption Function, pages 1-6, National Bureau of Economic Research, Inc.
    7. Philippe Liégeois & Frédéric Berger & Nizamul Islam & Raymond Wagener, 2011. "Cross-validating administrative and survey datasets through microsimulation," International Journal of Microsimulation, International Microsimulation Association, vol. 4(1), pages 54-71.
    8. Sarah Brown & Gaia Garino & Karl Taylor, 2013. "Household Debt And Attitudes Toward Risk," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 59(2), pages 283-304, June.
    9. Wickens, Michael R, 1972. "A Note on the Use of Proxy Variables," Econometrica, Econometric Society, vol. 40(4), pages 759-761, July.
    10. John L. Czajka, "undated". "Can Administrative Records Be Used to Reduce Nonresponse Bias?," Mathematica Policy Research Reports 5a88b9fed835433f943c08646, Mathematica Policy Research.
    11. Peter Lynn & Annette Jäckle & Stephen P. Jenkins & Emanuela Sala, 2012. "The impact of questioning method on measurement error in panel survey measures of benefit receipt: evidence from a validation study," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 175(1), pages 289-308, January.
    12. Thomas D. Cook, 2014. "“Big Data” In Research On Social Policy," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 33(2), pages 544-547, March.
    13. Kevin Pugh & Gigi Foster, 2014. "Australia's National School Data and the ‘Big Data’ Revolution in Education Economics," Australian Economic Review, The University of Melbourne, Melbourne Institute of Applied Economic and Social Research, vol. 47(2), pages 258-268, June.
    14. McCallum, B T, 1972. "Relative Asymptotic Bias from Errors of Omission and Measurement," Econometrica, Econometric Society, vol. 40(4), pages 757-758, July.
    15. Chungui Qiao, 2005. "Combining Administrative and Survey Data to Derive Small‐area Estimates Using Loglinear Modelling," LABOUR, CEIS, vol. 19(4), pages 767-800, December.
    16. Merxe Tudela & Garry Young, 2005. "The determinants of household debt and balance sheets in the United Kingdom," Bank of England working papers 266, Bank of England.
    17. Milton Friedman, 1957. "A Theory of the Consumption Function," NBER Books, National Bureau of Economic Research, Inc, number frie57-1.
    18. repec:mpr:mprres:7586 is not listed on IDEAS
    19. Arie Kapteyn & Jelmer Y. Ypma, 2007. "Measurement Error and Misclassification: A Comparison of Survey and Administrative Data," Journal of Labor Economics, University of Chicago Press, vol. 25(3), pages 513-551.
    20. Meta Brown & Andrew F. Haughwout & Donghoon Lee & Wilbert Van der Klaauw, 2011. "Do we know what we owe? A comparison of borrower- and lender-reported consumer debt," Staff Reports 523, Federal Reserve Bank of New York.
    21. Hazen, Benjamin T. & Boone, Christopher A. & Ezell, Jeremy D. & Jones-Farmer, L. Allison, 2014. "Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications," International Journal of Production Economics, Elsevier, vol. 154(C), pages 72-80.
    22. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    23. David W. Nickerson & Todd Rogers, 2014. "Political Campaigns and Big Data," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 51-74, Spring.
    24. Diane K Schooley & Debra Drecnik Worden, 2010. "Fueling the Credit Crisis: Who Uses Consumer Credit and What Drives Debt Burden?," Business Economics, Palgrave Macmillan;National Association for Business Economics, vol. 45(4), pages 266-276, October.
    25. Donghoon Lee & Wilbert Van der Klaauw, 2010. "An introduction to the FRBNY Consumer Credit Panel," Staff Reports 479, Federal Reserve Bank of New York.
    26. Molly Dahl & Thomas DeLeire & Jonathan A. Schwabish, 2011. "Estimates of Year-to-Year Volatility in Earnings and in Household Incomes from Administrative, Survey, and Matched Data," Journal of Human Resources, University of Wisconsin Press, vol. 46(4), pages 750-774.
    27. Rebecca N. Warburton & William P. Warburton, 2004. "Canada Needs Better Data for Evidence-Based Policy: Inconsistencies Between Administrative and Survey Data on Welfare Dependence and Education," Canadian Public Policy, University of Toronto Press, vol. 30(3), pages 241-256, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Catalina Anampa Castro & Katherine Curtis & Jack DeWaard & Elizabeth Fussell & Kathryn McConnell & Kobie Price & Michael Soto & Stephan D. Whitaker, 2021. "Migration as a Vector of Economic Losses from Disaster-Affected Areas in the United States," Working Papers 21-22, Federal Reserve Bank of Cleveland.
    2. Jack DeWaard & Janna Johnson & Stephan Whitaker, 2019. "Internal migration in the United States: A comprehensive comparative assessment of the Consumer Credit Panel," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(33), pages 953-1006.
    3. Kathryn McConnell & Elizabeth Fussell & Jack DeWaard & Stephan Whitaker & Katherine J. Curtis & Lise Denis & Jennifer Balch & Kobie Price, 2024. "Rare and highly destructive wildfires drive human migration in the U.S," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    4. Madeleine I. G. Daepp, 2022. "Small-area moving ratios and the spatial connectivity of neighborhoods: Insights from consumer credit data," Environment and Planning B, , vol. 49(3), pages 1129-1146, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hermansson, Cecilia, 2016. "Relationships between bank customers’ risk attitudes and their balance sheets," Working Paper Series 15/12, Royal Institute of Technology, Department of Real Estate and Construction Management & Banking and Finance.
    2. Du Caju, Philip & Rycx, François & Tojerow, Ilan, 2016. "Unemployment risk and over-indebtedness," Working Paper Series 1908, European Central Bank.
    3. Cappellari, Lorenzo & Jenkins, Stephen P., 2014. "Earnings and labour market volatility in Britain, with a transatlantic comparison," Labour Economics, Elsevier, vol. 30(C), pages 201-211.
    4. Dettling, Lisa J. & Hsu, Joanne W., 2018. "Returning to the nest: Debt and parental co-residence among young adults," Labour Economics, Elsevier, vol. 54(C), pages 225-236.
    5. Robert Moffitt & Sisi Zhang, 2018. "Income Volatility and the PSID: Past Research and New Results," AEA Papers and Proceedings, American Economic Association, vol. 108, pages 277-280, May.
    6. Bruce D. Meyer & Nikolas Mittag, 2015. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," NBER Working Papers 21676, National Bureau of Economic Research, Inc.
    7. Bogdan Andrei Dumitrescu & Adrian Enciu & Cătălina Adriana Hândoreanu & Carmen Obreja & Florin Blaga, 2022. "Macroeconomic Determinants of Household Debt in OECD Countries," Sustainability, MDPI, vol. 14(7), pages 1-14, March.
    8. Philip Du Caju & François Rycx & Ilan Tojerow, 2015. "Unemployment Risk and Over-indebtedness A Micro-econometric Perspective," Working Papers CEB 15-046, ULB -- Universite Libre de Bruxelles.
    9. Herrala, Risto & Kauko, Karlo, 2007. "Household loan loss risk in Finland: estimations and simulations with micro data," Bank of Finland Research Discussion Papers 5/2007, Bank of Finland.
    10. Robert Moffitt & John Abowd & Christopher Bollinger & Michael Carr & Charles Hokayem & Kevin McKinney & Emily Wiemers & Sisi Zhang & James Ziliak, 2022. "Reconciling Trends in U.S. Male Earnings Volatility: Results from Survey and Administrative Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 1-11, December.
    11. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," IZA Discussion Papers 10943, Institute of Labor Economics (IZA).
    12. Eva Branten, 2022. "The role of risk attitudes and expectations in household borrowing: evidence from Estonia," Baltic Journal of Economics, Baltic International Centre for Economic Policy Studies, vol. 22(2), pages 126-145.
    13. repec:zbw:bofrdp:2007_005 is not listed on IDEAS
    14. Mary Eschelbach Hansen & Julie Routzahn, 2014. "Gender Differences in Attitudes Toward Debt and Financial Position: The Impact of the Great Recession," Working Papers 2014-10, American University, Department of Economics.
    15. Chichaibelu, Bezawit Beyene & Waibel, Hermann, 2018. "Over-indebtedness and its persistence in rural households in Thailand and Vietnam," Journal of Asian Economics, Elsevier, vol. 56(C), pages 1-23.
    16. Bruce Meyer & Nikolas Mittag, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," Working Papers 2017-075, Human Capital and Economic Opportunity Working Group.
    17. Herrala, Risto & Kauko, Karlo, 2007. "Household loan loss risk in Finland: estimations and simulations with micro data," Bank of Finland Research Discussion Papers 5/2007, Bank of Finland.
    18. Robert A. Moffitt & Peter Gottschalk, 2012. "Trends in the Transitory Variance of Male Earnings: Methods and Evidence," Journal of Human Resources, University of Wisconsin Press, vol. 47(1), pages 204-236.
    19. Cappellari, Lorenzo & Jenkins, Stephen P., 2014. "Earnings and labour market volatility in Britain, with a transatlantic comparison," Labour Economics, Elsevier, vol. 30(C), pages 201-211.
    20. Ntebogang Dinah Moroke, 2014. "Household Debts-and Macroeconomic factors Nexus in the United States: A Cointegration and Vector Error Correction Approach," Journal of Economics and Behavioral Studies, AMH International, vol. 6(6), pages 452-465.
    21. Chul‐Woo Kwon & Peter F. Orazem & Daniel M. Otto, 2006. "Off‐farm labor supply responses to permanent and transitory farm income," Agricultural Economics, International Association of Agricultural Economists, vol. 34(1), pages 59-67, January.

    More about this item

    Keywords

    Big Data; Survey data; Household debt;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • D12 - Microeconomics - - Household Behavior - - - Consumer Economics: Empirical Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:quaeco:v:67:y:2018:i:c:p:285-296. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620167 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.