IDEAS home Printed from https://ideas.repec.org/a/spr/trosos/v14y2020i1d10.1007_s12626-019-00040-6.html
   My bibliography  Save this article

Prediction Model for Bollywood Movie Success: A Comparative Analysis of Performance of Supervised Machine Learning Algorithms

Author

Listed:
  • Hemraj Verma

    (DIT University)

  • Garima Verma

    (DIT University)

Abstract

The main purpose of this paper is to do a comparative analysis of prediction models using various machine learning techniques. The models will be used to predict whether a movie will be a hit or flop before it is actually released. The techniques used for comparisons are decision tree, random forest (RF), support vector machine, logistics regression, adaptive tree boosting, and artificial neural network algorithms. The major predictors used in the models are the ratings of the lead actor, IMDb ranking of a movie, music rank of the movie, and total number of screens planned for the release of a movie. The results of most models indicated a reasonable accuracy, ranging from 80 to 90%. However, models based on two techniques, RF and logistic regression, achieved an accuracy of 92%. From the results, the most important predictors of a movie’s success are music rating, followed by its IMDb rating and total screens used for release.

Suggested Citation

  • Hemraj Verma & Garima Verma, 2020. "Prediction Model for Bollywood Movie Success: A Comparative Analysis of Performance of Supervised Machine Learning Algorithms," The Review of Socionetwork Strategies, Springer, vol. 14(1), pages 1-17, April.
  • Handle: RePEc:spr:trosos:v:14:y:2020:i:1:d:10.1007_s12626-019-00040-6
    DOI: 10.1007/s12626-019-00040-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12626-019-00040-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12626-019-00040-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Laura Auria & Rouslan A. Moro, 2008. "Support Vector Machines (SVM) as a Technique for Solvency Analysis," Discussion Papers of DIW Berlin 811, DIW Berlin, German Institute for Economic Research.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aurelia Rybak & Aleksandra Rybak & Spas D. Kolev, 2023. "Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression," Energies, MDPI, vol. 16(22), pages 1-17, November.
    2. Kyuhan Lee & Jinsoo Park & Iljoo Kim & Youngseok Choi, 2018. "Predicting movie success with machine learning techniques: ways to improve accuracy," Information Systems Frontiers, Springer, vol. 20(3), pages 577-588, June.
    3. Sagnik Anupam & Arpan Kumar Kar, 2021. "Phishing website detection using support vector machines and nature-inspired optimization algorithms," Telecommunication Systems: Modelling, Analysis, Design and Management, Springer, vol. 76(1), pages 17-32, January.
    4. Mostafaei, Kamran & maleki, Shaho & Zamani Ahmad Mahmoudi, Mohammad & Knez, Dariusz, 2022. "Risk management prediction of mining and industrial projects by support vector machine," Resources Policy, Elsevier, vol. 78(C).
    5. Yoon-Joo Park, 2018. "Predicting the Helpfulness of Online Customer Reviews across Different Product Types," Sustainability, MDPI, vol. 10(6), pages 1-20, May.
    6. Abdulkadir Atalan, 2023. "Forecasting drinking milk price based on economic, social, and environmental factors using machine learning algorithms," Agribusiness, John Wiley & Sons, Ltd., vol. 39(1), pages 214-241, January.
    7. Junlong Zhang & Youbin He & Yuan Zhang & Weifeng Li & Junjie Zhang, 2022. "Well-Logging-Based Lithology Classification Using Machine Learning Methods for High-Quality Reservoir Identification: A Case Study of Baikouquan Formation in Mahu Area of Junggar Basin, NW China," Energies, MDPI, vol. 15(10), pages 1-15, May.
    8. Ni, Ji & Chen, Bowei & Allinson, Nigel M. & Ye, Xujiong, 2020. "A hybrid model for predicting human physical activity status from lifelogging data," European Journal of Operational Research, Elsevier, vol. 281(3), pages 532-542.
    9. Petrakova Aleksandra & Merkurjeva Galina & Affenzeller Michael, 2015. "Heterogeneous versus Homogeneous Machine Learning Ensembles," Information Technology and Management Science, Sciendo, vol. 18(1), pages 135-140, December.
    10. Mukhtar Sani & Maxime Piffard & Vincent Heiries, 2023. "Fault Detection for PEM Fuel Cells via Analytical Redundancy: A Critical Review and Prospects," Energies, MDPI, vol. 16(14), pages 1-16, July.
    11. Wang, Xinlin & Ahn, Sung-Hoon, 2020. "Real-time prediction and anomaly detection of electrical load in a residential community," Applied Energy, Elsevier, vol. 259(C).
    12. Söhnke M. Bartram & Jürgen Branke & Mehrshad Motahari, 2020. "Artificial intelligence in asset management," Working Papers 20202001, Cambridge Judge Business School, University of Cambridge.
    13. Thierry Delahaye & Rodrigo Acuna-Agost & Nicolas Bondoux & Anh-Quan Nguyen & Mourad Boudia, 2017. "Data-driven models for itinerary preferences of air travelers and application for dynamic pricing optimization," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 16(6), pages 621-639, December.
    14. Yee-Fan Tan & Lee-Yeng Ong & Meng-Chew Leow & Yee-Xian Goh, 2021. "Exploring Time-Series Forecasting Models for Dynamic Pricing in Digital Signage Advertising," Future Internet, MDPI, vol. 13(10), pages 1-24, September.
    15. Nehrebecka Natalia, 2018. "Predicting the Default Risk of Companies. Comparison of Credit Scoring Models: Logit Vs Support Vector Machines," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 54-73, June.
    16. Changju Lee & Sunghoon Lee, 2022. "Exploring the Contributions by Transportation Features to Urban Economy: An Experiment of a Scalable Tree-Boosting Algorithm with Big Data," Land, MDPI, vol. 11(4), pages 1-30, April.
    17. Amarda Cano, 2021. "Evolution of Public Debt in Albania during 1990-2017 and its impact on the Economic Growth," European Journal of Marketing and Economics Articles, Revistia Research and Publishing, vol. 4, ejme_v4_i.
    18. Swati Anand & Kushendra Mishra, 2022. "Identifying potential millennial customers for financial institutions using SVM," Journal of Financial Services Marketing, Palgrave Macmillan, vol. 27(4), pages 335-345, December.
    19. Hossein Hassani & Emmanuel S. Silva & Marine Combe & Demetra Andreou & Mansi Ghodsi & Mohammad Reza Yeganegi & Rodolphe E. Gozlan, 2019. "A Support Vector Machine Based Approach for Predicting the Risk of Freshwater Disease Emergence in England," Stats, MDPI, vol. 2(1), pages 1-15, February.
    20. Kyuhan Lee & Jinsoo Park & Iljoo Kim & Youngseok Choi, 0. "Predicting movie success with machine learning techniques: ways to improve accuracy," Information Systems Frontiers, Springer, vol. 0, pages 1-12.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:trosos:v:14:y:2020:i:1:d:10.1007_s12626-019-00040-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.