IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2304.03464.html
   My bibliography  Save this paper

Linking Representations with Multimodal Contrastive Learning

Author

Listed:
  • Abhishek Arora
  • Xinmei Yang
  • Shao-Yu Jheng
  • Melissa Dell

Abstract

Many applications require linking individuals, firms, or locations across datasets. Most widely used methods, especially in social science, do not employ deep learning, with record linkage commonly approached using string matching techniques. Moreover, existing methods do not exploit the inherently multimodal nature of documents. In historical record linkage applications, documents are typically noisily transcribed by optical character recognition (OCR). Linkage with just OCR'ed texts may fail due to noise, whereas linkage with just image crops may also fail because vision models lack language understanding (e.g., of abbreviations or other different ways of writing firm names). To leverage multimodal learning, this study develops CLIPPINGS (Contrastively LInking Pooled Pre-trained Embeddings). CLIPPINGS aligns symmetric vision and language bi-encoders, through contrastive language-image pre-training on document images and their corresponding OCR'ed texts. It then contrastively learns a metric space where the pooled image-text embedding for a given instance is close to embeddings in the same class (e.g., the same firm or location) and distant from embeddings of a different class. Data are linked by treating linkage as a nearest neighbor retrieval problem with the multimodal embeddings. CLIPPINGS outperforms widely used string matching methods by a wide margin in linking mid-20th century Japanese firms across financial documents. A purely self-supervised model - trained only by aligning the embeddings for the image crop of a firm name and its corresponding OCR'ed text - also outperforms popular string matching methods. Fascinatingly, a multimodally pre-trained vision-only encoder outperforms a unimodally pre-trained vision-only encoder, illustrating the power of multimodal pre-training even if only one modality is available for linking at inference time.

Suggested Citation

  • Abhishek Arora & Xinmei Yang & Shao-Yu Jheng & Melissa Dell, 2023. "Linking Representations with Multimodal Contrastive Learning," Papers 2304.03464, arXiv.org, revised Jun 2024.
  • Handle: RePEc:arx:papers:2304.03464
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2304.03464
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Lane, Nathaniel, 2016. "Manufacturing Revolutions: Industrial Policy and Industrialization in South Korea," SocArXiv 6tqax, Center for Open Science.
    2. Ran Abramitzky & Leah Boustan & Katherine Eriksson & James Feigenbaum & Santiago Pérez, 2021. "Automated Linking of Historical Data," Journal of Economic Literature, American Economic Association, vol. 59(3), pages 865-918, September.
    3. Ventura, Samuel L. & Nugent, Rebecca & Fuchs, Erica R.H., 2015. "Seeing the non-stars: (Some) sources of bias in past disambiguation approaches and a new public tool leveraging labeled records," Research Policy, Elsevier, vol. 44(9), pages 1672-1701.
    4. Martha J. Bailey & Connor Cole & Morgan Henderson & Catherine Massey, 2020. "How Well Do Automated Linking Methods Perform? Lessons from US Historical Data," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 997-1044, December.
    5. Dominick Bartelme & Yuriy Gorodnichenko, 2015. "Linkages and Economic Development," NBER Working Papers 21251, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Melissa Dell, 2024. "Deep Learning for Economists," Papers 2407.15339, arXiv.org, revised Sep 2024.
    2. Xinmei Yang & Abhishek Arora & Shao-Yu Jheng & Melissa Dell, 2023. "Quantifying Character Similarity with Vision Transformers," Papers 2305.14672, arXiv.org.
    3. Emily Silcock & Melissa Dell, 2023. "A Massive Scale Semantic Similarity Dataset of Historical English," Papers 2306.17810, arXiv.org, revised Aug 2023.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xinmei Yang & Abhishek Arora & Shao-Yu Jheng & Melissa Dell, 2023. "Quantifying Character Similarity with Vision Transformers," Papers 2305.14672, arXiv.org.
    2. Elisa Jácome & Ilyana Kuziemko & Suresh Naidu, 2021. "Mobility for All: Representative Intergenerational Mobility Estimates over the 20th Century," Working Papers 302, Princeton University, Department of Economics, Center for Economic Policy Studies..
    3. Collins, William J. & Zimran, Ariell, 2019. "The economic assimilation of Irish Famine migrants to the United States," Explorations in Economic History, Elsevier, vol. 74(C).
    4. Matthew Jaremski, 2020. "Today’s economic history and tomorrow’s scholars," Cliometrica, Springer;Cliometric Society (Association Francaise de Cliométrie), vol. 14(1), pages 169-180, January.
    5. Escamilla-Guerrero, David & Kosack, Edward & Ward, Zachary, 2021. "Life after crossing the border: Assimilation during the first Mexican mass migration," Explorations in Economic History, Elsevier, vol. 82(C).
    6. Hanlon, W.Walker & Heblich, Stephan, 2022. "History and urban economics," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    7. Philipp Ager & Leah Boustan & Katherine Eriksson, 2021. "The Intergenerational Effects of a Large Wealth Shock: White Southerners after the Civil War," American Economic Review, American Economic Association, vol. 111(11), pages 3767-3794, November.
    8. Berger, Thor & Engzell, Per & Eriksson, Björn & Molinder, Jakob, 2023. "Social Mobility in Sweden before the Welfare State," The Journal of Economic History, Cambridge University Press, vol. 83(2), pages 431-463, June.
    9. Andreas Vortisch, 2023. "The impact of the Johnson–Reed Act on Filipino labor market outcomes," French Stata Users' Group Meetings 2023 12, Stata Users Group.
    10. Dahl, Christian M. & Johansen, Torben S.D. & Sørensen, Emil N. & Wittrock, Simon, 2023. "HANA: A handwritten name database for offline handwritten text recognition," Explorations in Economic History, Elsevier, vol. 87(C).
    11. Baker, Richard B. & Blanchette, John & Eriksson, Katherine, 2020. "Long-Run Impacts of Agricultural Shocks on Educational Attainment: Evidence from the Boll Weevil," The Journal of Economic History, Cambridge University Press, vol. 80(1), pages 136-174, March.
    12. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    13. Obolensky, Marguerite & Tabellini, Marco & Taylor, Charles A., 2024. "Homeward Bound: How Migrants Seek Out Familiar Climates," IZA Discussion Papers 16710, Institute of Labor Economics (IZA).
    14. Sarah Tahamont & Zubin Jelveh & Aaron Chalfin & Shi Yan & Benjamin Hansen, 2019. "Administrative Data Linking and Statistical Power Problems in Randomized Experiments," NBER Working Papers 25657, National Bureau of Economic Research, Inc.
    15. Krzysztof Karbownik & Anthony Wray, 2019. "Educational, Labor-market and Intergenerational Consequences of Poor Childhood Health," NBER Working Papers 26368, National Bureau of Economic Research, Inc.
    16. Karapanagiotis, Pantelis & Liebald, Marius, 2023. "Entity matching with similarity encoding: A supervised learning recommendation framework for linking (big) data," SAFE Working Paper Series 398, Leibniz Institute for Financial Research SAFE.
    17. Inwood, Kris & Oxley, Les & Roberts, Evan, 2022. "The mortality risk of being overweight in the twentieth century: Evidence from two cohorts of New Zealand men," Explorations in Economic History, Elsevier, vol. 86(C).
    18. Wolfgang Keller & Carol H. Shiue, 2023. "Intergenerational Mobility of Daughters and Marital Sorting: New Evidence from Imperial China," NBER Working Papers 31695, National Bureau of Economic Research, Inc.
    19. Marguerite Obolensky & Marco Tabellini & Charles Taylor, 2024. "Homeward Bound: How Migrants Seek Out Familiar Climates," RF Berlin - CReAM Discussion Paper Series 2401, Rockwool Foundation Berlin (RF Berlin) - Centre for Research and Analysis of Migration (CReAM).
    20. Anna Aizer & Shari Eli & Adriana Lleras-Muney, 2020. "The Incentive Effects of Cash Transfers to the Poor," NBER Working Papers 27523, National Bureau of Economic Research, Inc.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2304.03464. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.