IDEAS home Printed from https://ideas.repec.org/a/ids/wremsd/v17y2021i2-3p343-357.html
   My bibliography  Save this article

A hybrid ensemble machine learning model to predict success of Bollywood movies

Author

Listed:
  • Garima Verma
  • Hemraj Verma
  • Sushil Kumar Dixit

Abstract

Bollywood is a multi-billion industry. Hundreds of films are released every year, where each film is an investment of multi-crores. In terms of awards or marketing it has found a place in almost every country and culture. It also contributes and attracts skilled and passionate people to become entrepreneurs. Therefore, it becomes a need as well as a huge concern of the director, producer and all stakeholders involved in a particular film to know the chances of the success of a film on the box office before its release. To address this concern, a hybrid ensemble machine learning model has been proposed. The model uses data sets collected from various sources, such as Boxofficeindia, cinemalytics, YouTube, etc. The model performed pre-processing on data set, which included handling of missing values with mean, cleaning of data, and removal of text values. Feature engineering has been applied in the model to create a new feature called act_direct to make the model more robust. Further, the effectiveness of the model has been tested in terms of accuracy and the AUC-ROC curve. From the experimental results, it is evident that the proposed model ensures relatively better accuracy compared to some recent state-of-art models.

Suggested Citation

  • Garima Verma & Hemraj Verma & Sushil Kumar Dixit, 2021. "A hybrid ensemble machine learning model to predict success of Bollywood movies," World Review of Entrepreneurship, Management and Sustainable Development, Inderscience Enterprises Ltd, vol. 17(2/3), pages 343-357.
  • Handle: RePEc:ids:wremsd:v:17:y:2021:i:2/3:p:343-357
    as

    Download full text from publisher

    File URL: http://www.inderscience.com/link.php?id=114439
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ids:wremsd:v:17:y:2021:i:2/3:p:343-357. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sarah Parker (email available below). General contact details of provider: http://www.inderscience.com/browse/index.php?journalID=173 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.