Author
Abstract
Political researchers and scientists have long sought to forecast the results of upcoming elections in a democratic setup. Prediction approaches have been combined to improve prediction accuracy. In this work, we focus on India, utilizing exit polls spanning the last 8 years to develop machine learning-based prediction models. Exit polls provided by major media agencies for both provincial as well as the union legislative assemblies have been used as inputs to improve prediction accuracy. The training set comprises of the seats projected for the United Progressive Alliance (UPA) and the National Democratic Alliance (NDA) in state and central government elections from 2014 through 2020, providing ample data for training across a wide range of short and long-term trends. The testing set includes seats projected for the UPA and the NDA in 2021, across states including Assam, Kerala, Pondicherry, Tamil Nadu and West Bengal, ensuring validation of the prediction models using undisclosed ‘out of sample’ data. Trained models have been used to provide projected seats for the 2021 legislative assembly elections in the states of Assam, Kerala, Pondicherry, Tamil Nadu and West Bengal. Efficient machine learning approaches viz. support vector regression (SVR), extreme gradient boosting (XGBoost) and deep CNN-LSTM networks have been explored as predictors. The performance of the predictive models is evaluated by utilizing the metrics such as relative error (RE%), normalised mean square error (NRMSE), fractional bias (FB), coefficient of determination (R2) and normalised absolute error (NAE). XGBoost exhibits the lowest R2 value of 0.97, indicating superior fit to the observed data compared to other methods. Additionally, it demonstrates the lowest FB value of −0.03, suggesting minimal prediction bias. Notably, XGBoost also yields the lowest NRMSE, NAE, RE% values (4.11, 2.73 and 0.97% respectively), highlighting its superior accuracy in predicting election outcomes. XGBoost is followed by the deep CNN-LSTM model, while the SVR model exhibits the lowest prediction accuracy among the proposed forecasting methods.
Suggested Citation
Anurag Barthwal & Mamta Bhatt & Shwetank Avikal & Chandra Prakash, 2025.
"Machine learning-based prediction models for electoral outcomes in India: a comparative analysis of exit polls from 2014–2021,"
Quality & Quantity: International Journal of Methodology, Springer, vol. 59(1), pages 313-338, February.
Handle:
RePEc:spr:qualqt:v:59:y:2025:i:1:d:10.1007_s11135-024-01937-3
DOI: 10.1007/s11135-024-01937-3
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:59:y:2025:i:1:d:10.1007_s11135-024-01937-3. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.