IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/v7u3h_v1.html
   My bibliography  Save this paper

A network-based matching design for text mining of hyper-polarised online reviews

Author

Listed:
  • Cantone, Giulio Giacomo
  • Tomaselli, Venera

Abstract

Online reviews provide users with the opportunity to rate various types of items such as movies, music, and video games using a combination of numeric scores and textual comments. The study proposes a novel method that combines network modeling with statistical matching to estimate the unbiased association between words and hyper-polarized items in online reviews. The application of this method to a sample of 40,665 items from the website Metacritic detects 218 hyper-polarized items; these are matched with an equal number of items using 8 covariates of item quality and network centrality. Application of the method reveals an unbiased association between hyper-polarization and semantics indicating reactive social action in online reviews, especially related to controversial political issues in the USA.

Suggested Citation

  • Cantone, Giulio Giacomo & Tomaselli, Venera, 2023. "A network-based matching design for text mining of hyper-polarised online reviews," SocArXiv v7u3h_v1, Center for Open Science.
  • Handle: RePEc:osf:socarx:v7u3h_v1
    DOI: 10.31219/osf.io/v7u3h_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/6449d088c76c076b16136469/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/v7u3h_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Deutsch, Joseph & Fusco, Alessio & Silber, Jacques, 2013. "The BIP Trilogy (bipolarization, inequality and polarization): One saga but three different stories," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 7, pages 1-33.
    2. Esteban, Joan & Ray, Debraj, 1994. "On the Measurement of Polarization," Econometrica, Econometric Society, vol. 62(4), pages 819-851, July.
    3. Bart de Langhe & Philip M. Fernbach & Donald R. Lichtenstein, 2016. "Navigating by the Stars: Investigating the Actual and Perceived Validity of Online User Ratings," Journal of Consumer Research, Oxford University Press, vol. 42(6), pages 817-833.
    4. Domenico Piccolo & Rosaria Simone, 2019. "The class of cub models: statistical foundations, inferential issues and empirical evidence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 389-435, September.
    5. Petter Holme, 2019. "Rare and everywhere: Perspectives on scale-free networks," Nature Communications, Nature, vol. 10(1), pages 1-3, December.
    6. King, Gary & Nielsen, Richard, 2019. "Why Propensity Scores Should Not Be Used for Matching," Political Analysis, Cambridge University Press, vol. 27(4), pages 435-454, October.
    7. Sanjeev Dewan & Yi-Jen (Ian) Ho & Jui Ramaprasad, 2017. "Popularity or Proximity: Characterizing the Nature of Social Influence in an Online Music Community," Information Systems Research, INFORMS, vol. 28(1), pages 117-136, March.
    8. Kosuke Imai & Gary King & Elizabeth A. Stuart, 2008. "Misunderstandings between experimentalists and observationalists about causal inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 171(2), pages 481-502, April.
    9. Heeseung Andrew Lee & Angela Aerry Choi & Tianshu Sun & Wonseok Oh, 2021. "Reviewing Before Reading? An Empirical Investigation of Book-Consumption Patterns and Their Effects on Reviews and Sales," Information Systems Research, INFORMS, vol. 32(4), pages 1368-1389, December.
    10. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2012. "Causal Inference without Balance Checking: Coarsened Exact Matching," Political Analysis, Cambridge University Press, vol. 20(1), pages 1-24, January.
    11. Domenico Piccolo & Rosaria Simone, 2019. "Rejoinder to the discussion of “The class of cub models: statistical foundations, inferential issues and empirical evidence”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 477-493, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cantone, Giulio Giacomo & Tomaselli, Venera, 2023. "Quasi-experimental network-based design for semantic analysis of small clusters of bi-polar online reviews," SocArXiv v7u3h, Center for Open Science.
    2. Christophe Loussouarn & Carine Franc & Yann Videau & Julien Mousquès, 2021. "Can General Practitioners Be More Productive? The Impact of Teamwork and Cooperation with Nurses on GP Activities," Health Economics, John Wiley & Sons, Ltd., vol. 30(3), pages 680-698, March.
    3. Philipp vom Berge & Achim Schmillen, 2023. "Effects of mass layoffs on local employment—evidence from geo-referenced data," Journal of International Economic Law, Oxford University Press, vol. 23(3), pages 509-539.
    4. Ravi Bapna & Alok Gupta & Gautam Ray & Shweta Singh, 2023. "Single-Sourcing vs. Multisourcing: An Empirical Analysis of Large Information Technology Outsourcing Arrangements," Information Systems Research, INFORMS, vol. 34(3), pages 1109-1130, September.
    5. Wildmer Daniel Gregori & Maria Martinez Cillero & Michela Nardo, 2022. "The effects of cross-border acquisitions on firms’ productivity in the EU," Working Papers 2022.10, International Network for Economic Research - INFER.
    6. Cigdem Gedikli & Robert Hill & Oleksandr Talavera & Okan Yilmaz, 2025. "Online Real Estate Agencies and their Impact on the Housing Market," Discussion Papers 25-01, Department of Economics, University of Birmingham.
    7. Jan Stede, 2019. "Do Energy Efficiency Networks Save Energy? Evidence from German Plant-Level Data," Discussion Papers of DIW Berlin 1813, DIW Berlin, German Institute for Economic Research.
    8. Hans Degryse & Yalin Gündüz & Kuchulain O'Flynn & Steven Ongena, 2020. "Identifying Empty Creditors with a Shock and Micro-Data," Swiss Finance Institute Research Paper Series 20-15, Swiss Finance Institute.
    9. Heyna, Philipp, 2024. "Can TikTok Drive Support for Populist Radical Right Parties? Causal Evidence From Germany," OSF Preprints yju9n, Center for Open Science.
    10. Anthony Howell & Chong Liu & Rudai Yang, 2020. "Explaining the urban premium in Chinese cities and the role of place-based policies," Environment and Planning A, , vol. 52(7), pages 1332-1356, October.
    11. Nazareno Panichella & Stefano Cantalini, 2023. "Is Geographical Mobility Beneficial? The Impact of the South-to-North Internal Migration on Occupational Achievement in Italy," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(5), pages 1-22, October.
    12. Mitra, Aniruddha & Bang, James T. & Abbas, Faisal, 2021. "Do remittances reduce women’s acceptance of domestic violence? Evidence from Pakistan," World Development, Elsevier, vol. 138(C).
    13. Rossignoli, Domenico & Trombetta, Federico, 2024. "Ora et Guberna. The Economic Impact of the Rule of St Benedict in Medieval England," The Journal of Economic History, Cambridge University Press, vol. 84(3), pages 838-873, September.
    14. Mussini Mauro, 2018. "On Measuring Polarization For Ordinal Data: An Approach Based On The Decomposition Of The Leti Index," Statistics in Transition New Series, Statistics Poland, vol. 19(2), pages 277-296, June.
    15. Markku Maula & Wouter Stam, 2020. "Enhancing Rigor in Quantitative Entrepreneurship Research," Entrepreneurship Theory and Practice, , vol. 44(6), pages 1059-1090, November.
    16. Giampaolo Arachi & Debora Assisi & Berardino Cesi & Michele G. Giuranno & Felice Russo, 2024. "Intermunicipal cooperation in public procurement," Regional Studies, Taylor & Francis Journals, vol. 58(11), pages 2055-2073, November.
    17. Stefania Capecchi & Francesca Di Iorio & Nunzia Nappo, 2024. "A mixture model for self-assessed stress at work across EU 163," RIEDS - Rivista Italiana di Economia, Demografia e Statistica - The Italian Journal of Economic, Demographic and Statistical Studies, SIEDS Societa' Italiana di Economia Demografia e Statistica, vol. 78(2), pages 163-174, April-Jun.
    18. Cigdem Gedikli & Robert Hill & Oleksandr Talavera & Okan Yilmaz, 2023. "The hidden cost of smoking: Rent premia in the housing market," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 51(3), pages 611-629, May.
    19. Serena Fatica & Roberto Panzica, 2021. "Green bonds as a tool against climate change?," Business Strategy and the Environment, Wiley Blackwell, vol. 30(5), pages 2688-2701, July.
    20. Costa-Font, Joan & Knapp, Martin & Vilaplana-Prieto, Cristina, 2023. "The ‘welcomed lockdown’ hypothesis? Mental wellbeing and mobility restrictions," LSE Research Online Documents on Economics 115323, London School of Economics and Political Science, LSE Library.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:v7u3h_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.