IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v15y2015i3p672-697.html
   My bibliography  Save this article

Record linkage using Stata: Preprocessing, linking, and reviewing utilities

Author

Listed:
  • Nada Wasi

    (Institute for Social Research, University of Michigan)

  • Aaron Flaaen

    (Division of Research and Statistics, Federal Reserve Board of Governors)

Abstract

In this article, we describe Stata utilities that facilitate probabilistic record linkage—the technique typically used for merging two datasets with no common record identifier. While the preprocessing tools are developed specifically for linking two company databases, the other tools can be used for many different types of linkage. Specifically, the stnd compname and stnd address commands parse and standardize company names and addresses to improve the match quality when linking. The reclink2 command is a generalized version of Blasnik’s reclink (2010, Statistical Software Components S456876, Department of Economics, Boston College) that allows for many-to-one matching. Finally, clrevmatch is an interactive tool that allows the user to review matched results in an efficient and seamless manner. Rather than exporting results to another file format (for example, Excel), inputting clerical reviews, and importing back into Stata, one can use the clrevmatch tool to conduct all of these steps within Stata. This helps improve the speed and flexibility of matching, which often involves multiple runs. Copyright 2015 by StataCorp LP.

Suggested Citation

  • Nada Wasi & Aaron Flaaen, 2015. "Record linkage using Stata: Preprocessing, linking, and reviewing utilities," Stata Journal, StataCorp LP, vol. 15(3), pages 672-697, September.
  • Handle: RePEc:tsj:stataj:v:15:y:2015:i:3:p:672-697
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj15-3/dm0082/
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=dm0082
    File Function: link to article purchase
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:15:y:2015:i:3:p:672-697. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.