IDEAS home Printed from https://ideas.repec.org/a/pal/palcom/v6y2020i1d10.1057_s41599-020-0436-1.html
   My bibliography  Save this article

Using data science to understand the film industry’s gender gap

Author

Listed:
  • Dima Kagan

    (Ben-Gurion University of the Negev)

  • Thomas Chesney

    (Nottingham University Business School)

  • Michael Fire

    (Ben-Gurion University of the Negev)

Abstract

Data science can offer answers to a wide range of social science questions. Here we turn attention to the portrayal of women in movies, an industry that has a significant influence on society, impacting such aspects of life as self-esteem and career choice. To this end, we fused data from the online movie database IMDb with a dataset of movie dialogue subtitles to create the largest available corpus of movie social networks (15,540 networks). Analyzing this data, we investigated gender bias in on-screen female characters over the past century. We find a trend of improvement in all aspects of women‘s roles in movies, including a constant rise in the centrality of female characters. There has also been an increase in the number of movies that pass the well-known Bechdel test, a popular—albeit flawed—measure of women in fiction. Here we propose a new and better alternative to this test for evaluating female roles in movies. Our study introduces fresh data, an open-code framework, and novel techniques that present new opportunities in the research and analysis of movies.

Suggested Citation

  • Dima Kagan & Thomas Chesney & Michael Fire, 2020. "Using data science to understand the film industry’s gender gap," Palgrave Communications, Palgrave Macmillan, vol. 6(1), pages 1-16, December.
  • Handle: RePEc:pal:palcom:v:6:y:2020:i:1:d:10.1057_s41599-020-0436-1
    DOI: 10.1057/s41599-020-0436-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41599-020-0436-1
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/s41599-020-0436-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sen Jia & Thomas Lansdall-Welfare & Saatviga Sudhahar & Cynthia Carter & Nello Cristianini, 2016. "Women Are Seen More than Heard in Online Newspapers," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-11, February.
    2. Vincent Larivière & Chaoqun Ni & Yves Gingras & Blaise Cronin & Cassidy R. Sugimoto, 2013. "Bibliometrics: Global gender disparities in science," Nature, Nature, vol. 504(7479), pages 211-213, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Muhammad Junaid Haris & Aanchal Upreti & Melih Kurtaran & Filip Ginter & Sebastien Lafond & Sepinoud Azimi, 2023. "Identifying gender bias in blockbuster movies through the lens of machine learning," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-8, December.
    2. Johann Valentowitsch, 2023. "Hollywood caught in two worlds? The impact of the Bechdel test on the international box office performance of cinematic films," Marketing Letters, Springer, vol. 34(2), pages 293-308, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wu, Jiang & Ou, Guiyan & Liu, Xiaohui & Dong, Ke, 2022. "How does academic education background affect top researchers’ performance? Evidence from the field of artificial intelligence," Journal of Informetrics, Elsevier, vol. 16(2).
    2. Chaojiang Wu & Erjia Yan & Yongjun Zhu & Kai Li, 2021. "Gender imbalance in the productivity of funded projects: A study of the outputs of National Institutes of Health R01 grants," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(11), pages 1386-1399, November.
    3. Sorana-Alexandra Constantinescu & Maria-Henriete Pozsar, 2022. "Was This Supposed to Be on the Test? Academic Leadership, Gender and the COVID-19 Pandemic in Denmark, Hungary, Romania, and United Kingdom," Publications, MDPI, vol. 10(2), pages 1-13, April.
    4. Abramo, Giovanni & D'Angelo, Ciriaco Andrea & Grilli, Leonardo, 2021. "The effects of citation-based research evaluation schemes on self-citation behavior," Journal of Informetrics, Elsevier, vol. 15(4).
    5. Letki, Natalia & Biały, Grzegorz & Sankowski, Piotr & Walentek, Dawid, 2022. "Streamlining for excellence discriminates against women: A study of research productivity of 2.7 mln scientists in 45 countries," OSF Preprints yr8me, Center for Open Science.
    6. Mike Thelwall & Tamara Nevill, 2019. "No evidence of citation bias as a determinant of STEM gender disparities in US biochemistry, genetics and molecular biology research," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1793-1801, December.
    7. Xinyi Zhao & Samin Aref & Emilio Zagheni & Guy Stecklov, 2022. "Return migration of German-affiliated researchers: analyzing departure and return by gender, cohort, and discipline using Scopus bibliometric data 1996–2020," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7707-7729, December.
    8. Zhang, Ming-Ze & Wang, Tang-Rong & Lyu, Peng-Hui & Chen, Qi-Mei & Li, Ze-Xia & Ngai, Eric W.T., 2024. "Impact of gender composition of academic teams on disruptive output," Journal of Informetrics, Elsevier, vol. 18(2).
    9. Abramo, Giovanni & Cicero, Tindaro & D’Angelo, Ciriaco Andrea, 2015. "Should the research performance of scientists be distinguished by gender?," Journal of Informetrics, Elsevier, vol. 9(1), pages 25-38.
    10. Tahmooresnejad, Leila & Turkina, Ekaterina, 2022. "Female inventors over time: Factors affecting female Inventors’ innovation performance," Journal of Informetrics, Elsevier, vol. 16(1).
    11. Diego Kozlowski & Dakota S Murray & Alexis Bell & Will Hulsey & Vincent Larivière & Thema Monroe-White & Cassidy R Sugimoto, 2022. "Avoiding bias when inferring race using name-based approaches," PLOS ONE, Public Library of Science, vol. 17(3), pages 1-16, March.
    12. Hanne Vandenberghe, 2019. "Representation of Women in the News: Balancing between Career and Family Life," Media and Communication, Cogitatio Press, vol. 7(1), pages 4-12.
    13. Helena Mihaljević-Brandt & Lucía Santamaría & Marco Tullney, 2016. "The Effect of Gender in the Publication Patterns in Mathematics," PLOS ONE, Public Library of Science, vol. 11(10), pages 1-23, October.
    14. Hamid R. Jamali & Alireza Abbasi, 2023. "Gender gaps in Australian research publishing, citation and co-authorship," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(5), pages 2879-2893, May.
    15. Shang, Yuanyuan & Sivertsen, Gunnar & Cao, Zhe & Zhang, Lin, 2021. "Gender differences in research focused on the Sustainable Development Goal of Gender Equality," SocArXiv 3fapz, Center for Open Science.
    16. Nakajima, Kazuki & Liu, Ruodan & Shudo, Kazuyuki & Masuda, Naoki, 2023. "Quantifying gender imbalance in East Asian academia: Research career and citation practice," Journal of Informetrics, Elsevier, vol. 17(4).
    17. Anna Kiss & Sándor Soós & Andrea Petróczi, 2024. "Impact as equalizer: the demise of gender-related differences in anti-doping research," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4071-4108, July.
    18. Zhu, Wanying & Jin, Ching & Ma, Yifang & Xu, Cong, 2023. "Earlier recognition of scientific excellence enhances future achievements and promotes persistence," Journal of Informetrics, Elsevier, vol. 17(2).
    19. Shubhanshu Mishra & Brent D Fegley & Jana Diesner & Vetle I Torvik, 2018. "Self-citation is the hallmark of productive authors, of any gender," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-21, September.
    20. Moustafa, Khaled, 2018. "Aberration of the citation," arabixiv.org gn8zb, Center for Open Science.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:6:y:2020:i:1:d:10.1057_s41599-020-0436-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.