IDEAS home Printed from https://ideas.repec.org/a/cup/polals/v25y2017i02p223-240_00.html
   My bibliography  Save this article

Two Wrongs Make a Right: Addressing Underreporting in Binary Data from Multiple Sources

Author

Listed:
  • Cook, Scott J.
  • Blas, Betsabe
  • Carroll, Raymond J.
  • Sinha, Samiran

Abstract

Media-based event data—i.e., data comprised from reporting by media outlets—are widely used in political science research. However, events of interest (e.g., strikes, protests, conflict) are often underreported by these primary and secondary sources, producing incomplete data that risks inconsistency and bias in subsequent analysis. While general strategies exist to help ameliorate this bias, these methods do not make full use of the information often available to researchers. Specifically, much of the event data used in the social sciences is drawn from multiple, overlapping news sources (e.g., Agence France-Presse, Reuters). Therefore, we propose a novel maximum likelihood estimator that corrects for misclassification in data arising from multiple sources. In the most general formulation of our estimator, researchers can specify separate sets of predictors for the true-event model and each of the misclassification models characterizing whether a source fails to report on an event. As such, researchers are able to accurately test theories on both the causes of and reporting on an event of interest. Simulations evidence that our technique regularly outperforms current strategies that either neglect misclassification, the unique features of the data-generating process, or both. We also illustrate the utility of this method with a model of repression using the Social Conflict in Africa Database.

Suggested Citation

  • Cook, Scott J. & Blas, Betsabe & Carroll, Raymond J. & Sinha, Samiran, 2017. "Two Wrongs Make a Right: Addressing Underreporting in Binary Data from Multiple Sources," Political Analysis, Cambridge University Press, vol. 25(2), pages 223-240, April.
  • Handle: RePEc:cup:polals:v:25:y:2017:i:02:p:223-240_00
    as

    Download full text from publisher

    File URL: https://www.cambridge.org/core/product/identifier/S1047198716000139/type/journal_article
    File Function: link to article abstract page
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Edward N. Okeke, 2021. "Money and my mind: Maternal cash transfers and mental health," Health Economics, John Wiley & Sons, Ltd., vol. 30(11), pages 2879-2904, November.
    2. von Borzyskowski, Inken & Wahman, Michael, 2018. "Systematic measurement error in election violence data: causes and consequences," LSE Research Online Documents on Economics 90450, London School of Economics and Political Science, LSE Library.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cup:polals:v:25:y:2017:i:02:p:223-240_00. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kirk Stebbing (email available below). General contact details of provider: https://www.cambridge.org/pan .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.