IDEAS home Printed from https://ideas.repec.org/a/pal/palcom/v6y2020i1d10.1057_s41599-020-0436-1.html
   My bibliography  Save this article

Using data science to understand the film industry’s gender gap

Author

Listed:
  • Dima Kagan

    (Ben-Gurion University of the Negev)

  • Thomas Chesney

    (Nottingham University Business School)

  • Michael Fire

    (Ben-Gurion University of the Negev)

Abstract

Data science can offer answers to a wide range of social science questions. Here we turn attention to the portrayal of women in movies, an industry that has a significant influence on society, impacting such aspects of life as self-esteem and career choice. To this end, we fused data from the online movie database IMDb with a dataset of movie dialogue subtitles to create the largest available corpus of movie social networks (15,540 networks). Analyzing this data, we investigated gender bias in on-screen female characters over the past century. We find a trend of improvement in all aspects of women‘s roles in movies, including a constant rise in the centrality of female characters. There has also been an increase in the number of movies that pass the well-known Bechdel test, a popular—albeit flawed—measure of women in fiction. Here we propose a new and better alternative to this test for evaluating female roles in movies. Our study introduces fresh data, an open-code framework, and novel techniques that present new opportunities in the research and analysis of movies.

Suggested Citation

  • Dima Kagan & Thomas Chesney & Michael Fire, 2020. "Using data science to understand the film industry’s gender gap," Palgrave Communications, Palgrave Macmillan, vol. 6(1), pages 1-16, December.
  • Handle: RePEc:pal:palcom:v:6:y:2020:i:1:d:10.1057_s41599-020-0436-1
    DOI: 10.1057/s41599-020-0436-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41599-020-0436-1
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/s41599-020-0436-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sen Jia & Thomas Lansdall-Welfare & Saatviga Sudhahar & Cynthia Carter & Nello Cristianini, 2016. "Women Are Seen More than Heard in Online Newspapers," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-11, February.
    2. Vincent Larivière & Chaoqun Ni & Yves Gingras & Blaise Cronin & Cassidy R. Sugimoto, 2013. "Bibliometrics: Global gender disparities in science," Nature, Nature, vol. 504(7479), pages 211-213, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Muhammad Junaid Haris & Aanchal Upreti & Melih Kurtaran & Filip Ginter & Sebastien Lafond & Sepinoud Azimi, 2023. "Identifying gender bias in blockbuster movies through the lens of machine learning," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-8, December.
    2. Johann Valentowitsch, 2023. "Hollywood caught in two worlds? The impact of the Bechdel test on the international box office performance of cinematic films," Marketing Letters, Springer, vol. 34(2), pages 293-308, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lin Zhang & Yuanyuan Shang & Ying Huang & Gunnar Sivertsen, 2022. "Gender differences among active reviewers: an investigation based on publons," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 145-179, January.
    2. Wu, Jiang & Ou, Guiyan & Liu, Xiaohui & Dong, Ke, 2022. "How does academic education background affect top researchers’ performance? Evidence from the field of artificial intelligence," Journal of Informetrics, Elsevier, vol. 16(2).
    3. Chaojiang Wu & Erjia Yan & Yongjun Zhu & Kai Li, 2021. "Gender imbalance in the productivity of funded projects: A study of the outputs of National Institutes of Health R01 grants," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(11), pages 1386-1399, November.
    4. Kwiek, Marek & Roszka, Wojciech, 2021. "Gender-based homophily in research: A large-scale study of man-woman collaboration," Journal of Informetrics, Elsevier, vol. 15(3).
    5. Kwiek, Marek & Szymula, Łukasz, 2024. "Growth of Science and Women: Methodological Challenges of Using Structured Big Data," SocArXiv w34pr, Center for Open Science.
    6. Josh Yamamoto & Eitan Frachtenberg, 2022. "Gender Differences in Collaboration Patterns in Computer Science," Publications, MDPI, vol. 10(1), pages 1-21, February.
    7. Sorana-Alexandra Constantinescu & Maria-Henriete Pozsar, 2022. "Was This Supposed to Be on the Test? Academic Leadership, Gender and the COVID-19 Pandemic in Denmark, Hungary, Romania, and United Kingdom," Publications, MDPI, vol. 10(2), pages 1-13, April.
    8. Ann-Maree Vallence & Mark R Hinder & Hakuei Fujiyama, 2019. "Data-driven selection of conference speakers based on scientific impact to achieve gender parity," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-10, July.
    9. Lee, Jangwook & Chung, Jiyoon, 2022. "Women in top management teams and their impact on innovation," Technological Forecasting and Social Change, Elsevier, vol. 183(C).
    10. Fengyuan Liu & Petter Holme & Matteo Chiesa & Bedoor AlShebli & Talal Rahwan, 2023. "Gender inequality and self-publication are common among academic editors," Nature Human Behaviour, Nature, vol. 7(3), pages 353-364, March.
    11. Lorenzo Ductor & Sanjeev Goyal & Anja Prummer, 2023. "Gender and Collaboration," The Review of Economics and Statistics, MIT Press, vol. 105(6), pages 1366-1378, November.
    12. Abramo, Giovanni & D'Angelo, Ciriaco Andrea & Grilli, Leonardo, 2021. "The effects of citation-based research evaluation schemes on self-citation behavior," Journal of Informetrics, Elsevier, vol. 15(4).
    13. Yining Wang & Qiang Wu & Liangyu Li, 2024. "Examining the influence of women scientists on scientific impact and novelty: insights from top business journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(6), pages 3517-3542, June.
    14. Gita Ghiasi & Matthew Harsh & Andrea Schiffauerova, 2018. "Inequality and collaboration patterns in Canadian nanotechnology: implications for pro-poor and gender-inclusive policy," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 785-815, May.
    15. Zhang, Lin & Shang, Yuanyuan & HUANG, Ying & Sivertsen, Gunnar, 2021. "Gender differences among active reviewers: an investigation based on Publons," SocArXiv 4z6w8, Center for Open Science.
    16. Letki, Natalia & Biały, Grzegorz & Sankowski, Piotr & Walentek, Dawid, 2022. "Streamlining for excellence discriminates against women: A study of research productivity of 2.7 mln scientists in 45 countries," OSF Preprints yr8me, Center for Open Science.
    17. Zhou, Sifan & Chai, Sen & Freeman, Richard B., 2024. "Gender homophily: In-group citation preferences and the gender disadvantage," Research Policy, Elsevier, vol. 53(1).
    18. Gómez-Ferri, Javier & González-Alcaide, Gregorio & LLopis-Goig, Ramón, 2019. "Measuring dissatisfaction with coauthorship: An empirical approach based on the researchers’ perception," Journal of Informetrics, Elsevier, vol. 13(4).
    19. Mike Thelwall & Tamara Nevill, 2019. "No evidence of citation bias as a determinant of STEM gender disparities in US biochemistry, genetics and molecular biology research," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1793-1801, December.
    20. Xinyi Zhao & Samin Aref & Emilio Zagheni & Guy Stecklov, 2022. "Return migration of German-affiliated researchers: analyzing departure and return by gender, cohort, and discipline using Scopus bibliometric data 1996–2020," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7707-7729, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:6:y:2020:i:1:d:10.1057_s41599-020-0436-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.