IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/vrt4a.html
   My bibliography  Save this paper

An Online Structured Political Event Dataset based on CAMEO Ontology

Author

Listed:
  • Salam, Sayeed
  • Brandt, Patrick
  • D'Orazio, Vito
  • Holmes, Jennifer
  • Osorio, Javiar
  • Khan, Latifur

Abstract

Political activities and interactions between different global entities are becoming growing field for data-intensive computing with a wide scope of research opportunities for both social science and computer science researchers. This research needs to be carried out at a local (limited to a particular region) and global scale, often divided in temporal manner. It is also useful to have the most recently updated dataset for relevant analysis. For these purposes, we need timestamped, geolocaated structured information about political interactions. Keeping this in mind, we develop a datatset that complies with Conflict and Mediation Event Observation (CAMEO) ontology inspired by the ”who-did-what-to- whom” format. We use a distributed framework for data collection and processing that works in real-time with Apache Kafka and SPARK in order to process a global collection of news data in different languages (i.e., Spanish, Arabic) and generate those structured event data in real-time. We also provide an API for easy access to the data. In this paper, we describe how the data is represented, collected, and processed, how we generate the most up-to-date dataset with dynamic ontology extension, and how to access the data and possible analytical problems that can be addressed by building a model on the dataset.

Suggested Citation

  • Salam, Sayeed & Brandt, Patrick & D'Orazio, Vito & Holmes, Jennifer & Osorio, Javiar & Khan, Latifur, 2020. "An Online Structured Political Event Dataset based on CAMEO Ontology," SocArXiv vrt4a, Center for Open Science.
  • Handle: RePEc:osf:socarx:vrt4a
    DOI: 10.31219/osf.io/vrt4a
    as

    Download full text from publisher

    File URL: https://osf.io/download/5e722a270cd06c046c001ec7/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/vrt4a?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Matthias Studer & Gilbert Ritschard, 2016. "What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 481-511, February.
    2. Javier Osorio & Viveca Pavon & Sayeed Salam & Jennifer Holmes & Patrick T. Brandt & Latifur Khan, 2019. "Translating CAMEO verbs for automated coding of event data," International Interactions, Taylor & Francis Journals, vol. 45(6), pages 1049-1064, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcel Raab & Emanuela Struffolino, 2020. "The Heterogeneity of Partnership Trajectories to Childlessness in Germany," European Journal of Population, Springer;European Association for Population Studies, vol. 36(1), pages 53-70, March.
    2. Júlia Mikolai & Hill Kulu, 2019. "Union dissolution and housing trajectories in Britain," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(7), pages 161-196.
    3. Marc A. Scott & Kaushik Mohan & Jacques‐Antoine Gauthier, 2020. "Model‐based clustering and analysis of life history data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1231-1251, June.
    4. Devillanova, Carlo & Raitano, Michele & Struffolino, Emanuela, 2019. "Longitudinal employment trajectories and health in middle life: Insights from linked administrative and survey data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 40, pages 1375-1412.
    5. Andy Dickerson & Emily McDool & Damon Morris, 2023. "Post-compulsory education pathways and labour market outcomes," Education Economics, Taylor & Francis Journals, vol. 31(3), pages 326-352, May.
    6. Cees H. Elzinga & Matthias Studer, 2019. "Normalization of Distance and Similarity in Sequence Analysis," Sociological Methods & Research, , vol. 48(4), pages 877-904, November.
    7. Kandt, Jens & Leak, Alistair, 2019. "Examining inclusive mobility through smartcard data: What shall we make of senior citizens' declining bus patronage in the West Midlands?," Journal of Transport Geography, Elsevier, vol. 79(C), pages 1-1.
    8. Mathias Voigt & Antonio Abellán & Julio Pérez & Diego Ramiro, 2020. "The effects of socioeconomic conditions on old-age mortality within shared disability pathways," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-17, September.
    9. Niklas Mäkinen & Jussi Tanskanen & Satu Ojala & Pasi Pyöriä, 2023. "Part-Time Workers’ Employment Trajectories by Length of Hours and Reason for Working Part-Time: An 8-Year Follow-Up Study," SAGE Open, , vol. 13(4), pages 21582440231, November.
    10. Borgna, Camilla & Struffolino, Emanuela, 2018. "Unpacking Configurational Dynamics: Sequence Analysis and Qualitative Comparative Analysis as a Mixed-Method Design," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, pages 167-184.
    11. Morten Dybdahl Krebs & Gonçalo Espregueira Themudo & Michael Eriksen Benros & Ole Mors & Anders D. Børglum & David Hougaard & Preben Bo Mortensen & Merete Nordentoft & Michael J. Gandal & Chun Chieh F, 2021. "Associations between patterns in comorbid diagnostic trajectories of individuals with schizophrenia and etiological factors," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    12. Wiebke Schmitz & L. Naegele & F. Frerichs & L. Ellwardt, 2023. "Gendered late working life trajectories, family history and welfare regimes: evidence from SHARELIFE," European Journal of Ageing, Springer, vol. 20(1), pages 1-15, December.
    13. Cristina Samper Mejia, 2023. "The Interplay Between the Early Work and Family Trajectories of Young Adult Women Born in West Germany: Differences by Parental Origins," Journal of International Migration and Integration, Springer, vol. 24(1), pages 345-368, March.
    14. Beusch, Elisabeth, 2020. "Essays on the self-employed in the Netherlands and Europe," Other publications TiSEM e3c09995-aac0-4c99-b88e-d, Tilburg University, School of Economics and Management.
    15. Andrés F. Castro Torres, 2020. "Family formation trajectories and migration status in the United States, 1970-2010," MPIDR Working Papers WP-2020-008, Max Planck Institute for Demographic Research, Rostock, Germany.
    16. Lídia Montero & Lucía Mejía-Dorantes & Jaume Barceló, 2023. "Applying Data Analytics to Analyze Activity Sequences for an Assessment of Fragmentation in Daily Travel Patterns: A Case Study of the Metropolitan Region of Barcelona," Sustainability, MDPI, vol. 15(19), pages 1-22, September.
    17. Struffolino, Emanuela & Van Winkle, Zachary, 2019. "Is there only one way out of in-work poverty? Difference by gender and race in the US," Discussion Papers, Research Group Demography and Inequality SP I 2019-601, WZB Berlin Social Science Center.
    18. Piccarreta, Raffaella & Struffolino, Emanuela, 2019. "An Integrated Heuristic for Validation in Sequence Analysis," SocArXiv v7mj8, Center for Open Science.
    19. Liao, Tim F. & Bolano, Danilo & Brzinsky-Fay, Christian & Cornwell, Benjamin & Fasang, Anette Eva & Helske, Satu & Piccarreta, Raffaella & Raab, Marcel & Ritschard, Gilbert & Struffolino, Emanuela & S, 2022. "Sequence analysis: Its past, present, and future," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 107, pages 1-1.
    20. Montorsi, Carlotta & Fusco, Alessio & Van Kerm, Philippe & Bordas, Stéphane P.A., 2024. "Predicting depression in old age: Combining life course data with machine learning," Economics & Human Biology, Elsevier, vol. 52(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:vrt4a. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.