IDEAS home Printed from https://ideas.repec.org/a/cup/polals/v27y2019i03p339-359_00.html
   My bibliography  Save this article

Testing the Validity of Automatic Speech Recognition for Political Text Analysis

Author

Listed:
  • Proksch, Sven-Oliver
  • Wratil, Christopher
  • Wäckerle, Jens

Abstract

The analysis of political texts from parliamentary speeches, party manifestos, social media, or press releases forms the basis of major and growing fields in political science, not least since advances in “text-as-data” methods have rendered the analysis of large text corpora straightforward. However, a lot of sources of political speech are not regularly transcribed, and their on-demand transcription by humans is prohibitively expensive for research purposes. This class includes political speech in certain legislatures, during political party conferences as well as television interviews and talk shows. We showcase how scholars can use automatic speech recognition systems to analyze such speech with quantitative text analysis models of the “bag-of-words” variety. To probe results for robustness to transcription error, we present an original “word error rate simulation” (WERSIM) procedure implemented in $R$. We demonstrate the potential of automatic speech recognition to address open questions in political science with two substantive applications and discuss its limitations and practical challenges.

Suggested Citation

  • Proksch, Sven-Oliver & Wratil, Christopher & Wäckerle, Jens, 2019. "Testing the Validity of Automatic Speech Recognition for Political Text Analysis," Political Analysis, Cambridge University Press, vol. 27(3), pages 339-359, July.
  • Handle: RePEc:cup:polals:v:27:y:2019:i:03:p:339-359_00
    as

    Download full text from publisher

    File URL: https://www.cambridge.org/core/product/identifier/S1047198718000621/type/journal_article
    File Function: link to article abstract page
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. repec:nad:wpaper:20220080 is not listed on IDEAS
    2. Mónica D. Oliveira & Inês Mataloto & Panos Kanavos, 2019. "Multi-criteria decision analysis for health technology assessment: addressing methodological challenges to improve the state of the art," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 20(6), pages 891-918, August.
    3. Konstantin Gavras & Jan Karem Höhne & Annelies G. Blom & Harald Schoen, 2022. "Innovating the collection of open‐ended answers: The linguistic and content characteristics of written and oral answers to political attitude questions," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(3), pages 872-890, July.
    4. Baranski, Andrzej & Haas, Nicholas, 2023. "The timing of communication and retaliation in bargaining: An experimental study," Journal of Economic Psychology, Elsevier, vol. 96(C).
    5. Christopher Wratil & Sara B Hobolt, 2019. "Public deliberations in the Council of the European Union: Introducing and validating DICEU," European Union Politics, , vol. 20(3), pages 511-531, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cup:polals:v:27:y:2019:i:03:p:339-359_00. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kirk Stebbing (email available below). General contact details of provider: https://www.cambridge.org/pan .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.