IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2111.02528.html
   My bibliography  Save this paper

occ2vec: A principal approach to representing occupations using natural language processing

Author

Listed:
  • Nicolaj S{o}ndergaard Muhlbach

Abstract

We propose \textbf{occ2vec}, a principal approach to representing occupations, which can be used in matching, predictive and causal modeling, and other economic areas. In particular, we use it to score occupations on any definable characteristic of interest, say the degree of \textquote{greenness}. Using more than 17,000 occupation-specific text descriptors, we transform each occupation into a high-dimensional vector using natural language processing. Similar, we assign a vector to the target characteristic and estimate the occupational degree of this characteristic as the cosine similarity between the vectors. The main advantages of this approach are its universal applicability and verifiability contrary to existing ad-hoc approaches. We extensively validate our approach on several exercises and then use it to estimate the occupational degree of charisma and emotional intelligence (EQ). We find that occupations that score high on these tend to have higher educational requirements. Turning to wages, highly charismatic occupations are either found in the lower or upper tail in the wage distribution. This is not found for EQ, where higher levels of EQ are generally correlated with higher wages.

Suggested Citation

  • Nicolaj S{o}ndergaard Muhlbach, 2021. "occ2vec: A principal approach to representing occupations using natural language processing," Papers 2111.02528, arXiv.org, revised Jul 2022.
  • Handle: RePEc:arx:papers:2111.02528
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2111.02528
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Edward W. Felten & Manav Raj & Robert Seamans, 2018. "A Method to Link Advances in Artificial Intelligence to Occupational Abilities," AEA Papers and Proceedings, American Economic Association, vol. 108, pages 54-57, May.
    2. Simon Mongey & Laura Pilossoph & Alex Weinberg, 2020. "Which Workers Bear the Burden of Social Distancing Policies?," Working Papers 2020-51, Becker Friedman Institute for Research In Economics.
    3. David H. Autor & David Dorn, 2013. "The Growth of Low-Skill Service Jobs and the Polarization of the US Labor Market," American Economic Review, American Economic Association, vol. 103(5), pages 1553-1597, August.
    4. David H. Autor & Frank Levy & Richard J. Murnane, 2003. "The skill content of recent technological change: an empirical exploration," Proceedings, Federal Reserve Bank of San Francisco, issue Nov.
    5. Dingel, Jonathan I. & Neiman, Brent, 2020. "How many jobs can be done at home?," Journal of Public Economics, Elsevier, vol. 189(C).
    6. David J. Deming, 2017. "The Growing Importance of Social Skills in the Labor Market," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(4), pages 1593-1640.
    7. David H. Autor & Michael J. Handel, 2013. "Putting Tasks to the Test: Human Capital, Job Tasks, and Wages," Journal of Labor Economics, University of Chicago Press, vol. 31(S1), pages 59-96.
    8. Claudia Goldin, 2014. "A Grand Gender Convergence: Its Last Chapter," American Economic Review, American Economic Association, vol. 104(4), pages 1091-1119, April.
    9. Erik Brynjolfsson & Tom Mitchell & Daniel Rock, 2018. "What Can Machines Learn, and What Does It Mean for Occupations and the Economy?," AEA Papers and Proceedings, American Economic Association, vol. 108, pages 43-47, May.
    10. Frey, Carl Benedikt & Osborne, Michael A., 2017. "The future of employment: How susceptible are jobs to computerisation?," Technological Forecasting and Social Change, Elsevier, vol. 114(C), pages 254-280.
    11. Simon Mongey & Laura Pilossoph & Alexander Weinberg, 2021. "Which workers bear the burden of social distancing?," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 19(3), pages 509-526, September.
    12. Daron Acemoglu & David Autor & Jonathon Hazell & Pascual Restrepo, 2020. "AI and Jobs: Evidence from Online Vacancies," NBER Working Papers 28257, National Bureau of Economic Research, Inc.
    13. Acemoglu, Daron & Autor, David, 2011. "Skills, Tasks and Technologies: Implications for Employment and Earnings," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 4, chapter 12, pages 1043-1171, Elsevier.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Songul Tolan & Annarosa Pesole & Fernando Martinez-Plumed & Enrique Fernandez-Macias & José Hernandez-Orallo & Emilia Gomez, 2020. "Measuring the Occupational Impact of AI: Tasks, Cognitive Abilities and AI Benchmarks," JRC Working Papers on Labour, Education and Technology 2020-02, Joint Research Centre.
    2. Carbonero, Francesco & Scicchitano, Sergio, 2021. "Labour and technology at the time of Covid-19. Can artificial intelligence mitigate the need for proximity?," GLO Discussion Paper Series 765, Global Labor Organization (GLO).
    3. Blanas, Sotiris & Oikonomou, Rigas, 2023. "COVID-induced economic uncertainty, tasks and occupational demand," Labour Economics, Elsevier, vol. 81(C).
    4. Alex Chernoff & Casey Warman, 2023. "COVID-19 and implications for automation," Applied Economics, Taylor & Francis Journals, vol. 55(17), pages 1939-1957, April.
    5. Lucas van der Velde, 2020. "Within Occupation Wage Dispersion and the Task Content of Jobs," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 82(5), pages 1161-1197, October.
    6. Hensvik, Lena & Skans, Oskar Nordström, 2023. "The skill-specific impact of past and projected occupational decline," Labour Economics, Elsevier, vol. 81(C).
    7. Consoli, Davide & Marin, Giovanni & Rentocchini, Francesco & Vona, Francesco, 2023. "Routinization, within-occupation task changes and long-run employment dynamics," Research Policy, Elsevier, vol. 52(1).
    8. Fossen, Frank M. & Sorgner, Alina, 2019. "New Digital Technologies and Heterogeneous Employment and Wage Dynamics in the United States: Evidence from Individual-Level Data," IZA Discussion Papers 12242, Institute of Labor Economics (IZA).
    9. Albanesi, Stefania & Dias da Silva, Antonio & Jimeno, Juan Francisco & Lamo, Ana & Wabitsch, Alena, 2023. "New Technologies and Jobs in Europe," CEPR Discussion Papers 18220, C.E.P.R. Discussion Papers.
    10. Rita Pető & Balázs Reizer, 2021. "Gender differences in the skill content of jobs," Journal of Population Economics, Springer;European Society for Population Economics, vol. 34(3), pages 825-864, July.
    11. Fossen, Frank M. & Sorgner, Alina, 2022. "New digital technologies and heterogeneous wage and employment dynamics in the United States: Evidence from individual-level data," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    12. Genz, Sabrina & Schnabel, Claus, 2021. "Digging into the Digital Divide: Workers' Exposure to Digitalization and Its Consequences for Individual Employment," IZA Discussion Papers 14649, Institute of Labor Economics (IZA).
    13. Parteka, Aleksandra & Wolszczak-Derlacz, Joanna & Nikulin, Dagmara, 2024. "How digital technology affects working conditions in globally fragmented production chains: Evidence from Europe," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    14. Cristian Bonavida & Irene Brambilla & Leonardo Gasparini, 2021. "Automatización y Pandemia: Amenazas sobre el Empleo en América Latina," CEDLAS, Working Papers 0288, CEDLAS, Universidad Nacional de La Plata.
    15. Edward Felten & Manav Raj & Robert Seamans, 2021. "Occupational, industry, and geographic exposure to artificial intelligence: A novel dataset and its potential uses," Strategic Management Journal, Wiley Blackwell, vol. 42(12), pages 2195-2217, December.
    16. Sorgner, Alina & Bode, Eckhardt & Krieger-Boden, Christiane & Aneja, Urvashi & Coleman, Susan & Mishra, Vidisha & Robb, Alicia M., 2017. "The effects of digitalization on gender equaliy in the G20 economies: Women20 study," Kiel E-Books, Kiel Institute for the World Economy (IfW Kiel), number 170571.
    17. Jean-Philippe Deranty & Thomas Corbin, 2022. "Artificial Intelligence and work: a critical review of recent research from the social sciences," Papers 2204.00419, arXiv.org.
    18. Vahagn Jerbashian, 2019. "Automation and Job Polarization: On the Decline of Middling Occupations in Europe," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 81(5), pages 1095-1116, October.
    19. David J. Deming, 2017. "The Growing Importance of Social Skills in the Labor Market," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(4), pages 1593-1640.
    20. Armanda Cetrulo & Dario Guarascio & Maria Enrica Virgillito, 2020. "Anatomy of the Italian occupational structure: concentrated power and distributed knowledge," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 29(6), pages 1345-1379.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2111.02528. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.