IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2202.08370.html
   My bibliography  Save this paper

CAREER: A Foundation Model for Labor Sequence Data

Author

Listed:
  • Keyon Vafa
  • Emil Palikot
  • Tianyu Du
  • Ayush Kanodia
  • Susan Athey
  • David M. Blei

Abstract

Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small to take advantage of them. In recent years large datasets of online resumes have also become available, providing data about the career trajectories of millions of individuals. However, standard econometric models cannot take advantage of their scale or incorporate them into the analysis of survey data. To this end we develop CAREER, a foundation model for job sequences. CAREER is first fit to large, passively-collected resume data and then fine-tuned to smaller, better-curated datasets for economic inferences. We fit CAREER to a dataset of 24 million job sequences from resumes, and adjust it on small longitudinal survey datasets. We find that CAREER forms accurate predictions of job sequences, outperforming econometric baselines on three widely-used economics datasets. We further find that CAREER can be used to form good predictions of other downstream variables. For example, incorporating CAREER into a wage model provides better predictions than the econometric models currently in use.

Suggested Citation

  • Keyon Vafa & Emil Palikot & Tianyu Du & Ayush Kanodia & Susan Athey & David M. Blei, 2022. "CAREER: A Foundation Model for Labor Sequence Data," Papers 2202.08370, arXiv.org, revised Feb 2024.
  • Handle: RePEc:arx:papers:2202.08370
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2202.08370
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. McCall, Brian P, 1990. "Occupational Matching: A Test of Sorts," Journal of Political Economy, University of Chicago Press, vol. 98(1), pages 45-69, February.
    2. Robert W. Fairlie & William A. Sundstrom, 1999. "The Emergence, Persistence, and Recent Widening of the Racial Unemployment Gap," ILR Review, Cornell University, ILR School, vol. 52(2), pages 252-270, January.
    3. Fatih Guvenen & Burhan Kuruscu & Satoshi Tanaka & David Wiczer, 2020. "Multidimensional Skill Mismatch," American Economic Journal: Macroeconomics, American Economic Association, vol. 12(1), pages 210-244, January.
    4. David H. Autor & David Dorn, 2013. "The Growth of Low-Skill Service Jobs and the Polarization of the US Labor Market," American Economic Review, American Economic Association, vol. 103(5), pages 1553-1597, August.
    5. Bernd Fitzenberger & Aderonke Osikominu & Robert Völter, 2008. "Get Training or Wait? Long-Run Employment Effects of Training Programs for the Unemployed in West Germany," Annals of Economics and Statistics, GENES, issue 91-92, pages 321-355.
    6. Francine D. Blau & Lawrence M. Kahn, 2017. "The Gender Wage Gap: Extent, Trends, and Explanations," Journal of Economic Literature, American Economic Association, vol. 55(3), pages 789-865, September.
    7. Martin Henning & Rikard H Eriksson, 2021. "Labour market polarisation as a localised process: evidence from Sweden," Cambridge Journal of Regions, Economy and Society, Cambridge Political Economy Society, vol. 14(1), pages 69-91.
    8. Fane Groes & Philipp Kircher & Iourii Manovskii, 2015. "The U-Shapes of Occupational Mobility," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(2), pages 659-692.
    9. Carolyn Heinrich & Peter Mueser & Kenneth Troske & Kyung-Seong Jeon & Daver Kahvecioglu, 2013. "Do Public Employment and Training Programs Work?," IZA Journal of Labor Economics, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 2(1), pages 1-23, December.
    10. Poterba, James M & Summers, Lawrence H, 1986. "Reporting Errors and Labor Market Dynamics," Econometrica, Econometric Society, vol. 54(6), pages 1319-1338, November.
    11. Schmidt, Peter & Strauss, Robert P, 1975. "The Prediction of Occupation Using Multiple Logit Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 16(2), pages 471-486, June.
    12. Robert E. Hall, 1972. "Turnover in the Labor Force," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 3(3), pages 709-764.
    13. Randall S. Brown & Marilyn Moon & Barbara S. Zoloth, 1980. "Incorporating Occupational Attainment in Studies of Male-Female Earnings Differentials," Journal of Human Resources, University of Wisconsin Press, vol. 15(1), pages 3-28.
    14. Gueorgui Kambourov & Iourii Manovskii, 2008. "Rising Occupational And Industry Mobility In The United States: 1968-97," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 49(1), pages 41-79, February.
    15. Ruhose, Jens & Thomsen, Stephan L. & Weilage, Insa, 2019. "The benefits of adult learning: Work-related training, social capital, and earnings," Economics of Education Review, Elsevier, vol. 72(C), pages 166-186.
    16. Boskin, Michael J, 1974. "A Conditional Logit Model of Occupational Choice," Journal of Political Economy, University of Chicago Press, vol. 82(2), pages 389-398, Part I, M.
    17. Neal, Derek, 1999. "The Complexity of Job Mobility among Young Men," Journal of Labor Economics, University of Chicago Press, vol. 17(2), pages 237-261, April.
    18. Keane, Michael P & Wolpin, Kenneth I, 1997. "The Career Decisions of Young Men," Journal of Political Economy, University of Chicago Press, vol. 105(3), pages 473-522, June.
    19. Fredrik Andersson & Harry J. Holzer & Julia I. Lane & David Rosenblum & Jeffrey Smith, 2024. "Does Federally Funded Job Training Work? Nonexperimental Estimates of WIA Training Impacts Using Longitudinal Data on Workers and Firms," Journal of Human Resources, University of Wisconsin Press, vol. 59(4), pages 1244-1283.
    20. Guido Matias Cortes, 2016. "Where Have the Middle-Wage Workers Gone? A Study of Polarization Using Panel Data," Journal of Labor Economics, University of Chicago Press, vol. 34(1), pages 63-105.
    21. David Hummels & Rasmus J?rgensen & Jakob Munch & Chong Xiang, 2014. "The Wage Effects of Offshoring: Evidence from Danish Matched Worker-Firm Data," American Economic Review, American Economic Association, vol. 104(6), pages 1597-1629, June.
    22. repec:adr:anecst:y:2008:i:91-92:p:15 is not listed on IDEAS
    23. Jana Stefanová Lauerová & Katherine Terrell, 2007. "What Drives Gender Differences in Unemployment?," Comparative Economic Studies, Palgrave Macmillan;Association for Comparative Economic Studies, vol. 49(1), pages 128-155, March.
    24. Sharon Traiberman, 2019. "Occupations and Import Competition: Evidence from Denmark," American Economic Review, American Economic Association, vol. 109(12), pages 4260-4301, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tianyu Du & Ayush Kanodia & Herman Brunborg & Keyon Vafa & Susan Athey, 2024. "LABOR-LLM: Language-Based Occupational Representations with Large Language Models," Papers 2406.17972, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian vom Lehn & Cache Ellsworth & Zachary Kroff, 2022. "Reconciling Occupational Mobility in the Current Population Survey," Journal of Labor Economics, University of Chicago Press, vol. 40(4), pages 1005-1051.
    2. Nicolas A. Roys & Christopher R. Taber, 2019. "Skill Prices, Occupations, and Changes in the Wage Structure for Low Skilled Men," NBER Working Papers 26453, National Bureau of Economic Research, Inc.
    3. Keller, Wolfgang & Utar, Hale, 2023. "International trade and job polarization: Evidence at the worker level," Journal of International Economics, Elsevier, vol. 145(C).
    4. Guido Matias Cortes & Giovanni Gallipoli, 2018. "The Costs of Occupational Mobility: An Aggregate Analysis," Journal of the European Economic Association, European Economic Association, vol. 16(2), pages 275-315.
    5. Guido Matias Cortes, 2016. "Where Have the Middle-Wage Workers Gone? A Study of Polarization Using Panel Data," Journal of Labor Economics, University of Chicago Press, vol. 34(1), pages 63-105.
    6. Ronald Bachmann & Peggy Bechara & Christina Vonnahme, 2020. "Occupational Mobility in Europe: Extent, Determinants and Consequences," De Economist, Springer, vol. 168(1), pages 79-108, March.
    7. Carl Sanders, 2012. "Skill Uncertainty, Skill Accumulation, and Occupational Choice," 2012 Meeting Papers 633, Society for Economic Dynamics.
    8. Fitzenberger, Bernd & Licklederer, Stefanie & Zwiener, Hanna, 2015. "Mobility across firms and occupations among graduates from apprenticeship," Labour Economics, Elsevier, vol. 34(C), pages 138-151.
    9. Stijepic Damir, 2020. "Job Mobility and Sorting: Theory and Evidence," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 240(1), pages 19-49, February.
    10. Genz, Sabrina & Schnabel, Claus, 2021. "Digging into the digital divide: Workers' exposure to digitalization and its consequences for individual employment," Discussion Papers 118, Friedrich-Alexander University Erlangen-Nuremberg, Chair of Labour and Regional Economics.
    11. Guido Matias Cortes & Giovanni Gallipoli, 2014. "The Costs of Occupational Mobility: An Aggregate Analysis," Working Papers 2014-015, Human Capital and Economic Opportunity Working Group.
    12. Wang, Xiupeng, 2020. "Labor market polarization in Britain and Germany: A cross-national comparison using longitudinal household data," Labour Economics, Elsevier, vol. 65(C).
    13. Stijepic Damir, 2020. "Job Mobility and Sorting: Theory and Evidence," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 240(1), pages 19-49, February.
    14. Papageorgiou, Theodore, 2018. "Large firms and within firm occupational reallocation," Journal of Economic Theory, Elsevier, vol. 174(C), pages 184-223.
    15. Baird, Matthew D. & Engberg, John & Gutierrez, Italo A., 2022. "RCT evidence on differential impact of US job training programmes by pre-training employment status," Labour Economics, Elsevier, vol. 75(C).
    16. Demiralp, Berna, 2011. "Occupational self-selection in a labor market with moral hazard," European Economic Review, Elsevier, vol. 55(4), pages 497-519, May.
    17. Pedros Silos & Eric Smith, 2015. "Human Capital Portfolios," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 18(3), pages 635-652, July.
    18. Isaac Baley & Ana Figueiredo & Robert Ulbricht, 2022. "Mismatch Cycles," Journal of Political Economy, University of Chicago Press, vol. 130(11), pages 2943-2984.
    19. Maximiliano Dvorkin, 2021. "International trade and labor reallocation: misclassification errors, mobility, and switching costs," Working Papers 2021-014, Federal Reserve Bank of St. Louis, revised Jun 2024.
    20. Theodore Papageorgiou, 2022. "Occupational Matching and Cities," American Economic Journal: Macroeconomics, American Economic Association, vol. 14(3), pages 82-132, July.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2202.08370. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.