IDEAS home Printed from https://ideas.repec.org/a/inm/ordeca/v14y2017i4p274-297.html
   My bibliography  Save this article

Identifying Soccer Players on Facebook Through Predictive Analytics

Author

Listed:
  • Matthias Bogaert

    (Department of Marketing, Ghent University, 9000 Ghent, Belgium)

  • Michel Ballings

    (Department of Business Analytics and Statistics, University of Tennessee, Knoxville, Tennessee 37996)

  • Martijn Hosten

    (Department of Marketing, Ghent University, 9000 Ghent, Belgium)

  • Dirk Van den Poel

    (Department of Marketing, Ghent University, 9000 Ghent, Belgium)

Abstract

This study assesses the feasibility of identifying self-reported sports practitioners (soccer players) on Facebook. The main goal is to develop a system to support marketers with the decision as to which prospects to target for advertising purposes. To do so, we benchmark several algorithms (i.e., random forest, logistic regression, adaboost, rotation forest, neural networks, and kernel factory) using five times twofold cross-validation. To evaluate performance and variable importances, we build a fusion model, which combines the results of the other algorithms using the weighted average. This technique is also referred to as information-fusion sensitivity analysis. The results reveal that Facebook data provide a viable basis to come up with sports predictions as the predictive performance ranges from 72.01% to 80.43% for area under the receiver operating characteristic curve (AUC), from 81.96% to 83.95% for accuracy, and from 2.41 to 3.06 for top-decile lift. Our benchmark study indicates that stochastic adaboost, the fusion model, random forest, rotation forest, and regularized logistic regression are the best-performing algorithms. Furthermore, the results show that the most important variables are the average number of friends that play soccer, membership of a soccer group , and the number of favorite teams . We also assess the impact of our results on profitability by conducting a thorough sensitivity analysis. Our analysis reveals that our approach can be beneficial for a wide range of companies. The analysis and results in this study will assist sports brands with decisions regarding their implementation of targeted marketing approaches.

Suggested Citation

  • Matthias Bogaert & Michel Ballings & Martijn Hosten & Dirk Van den Poel, 2017. "Identifying Soccer Players on Facebook Through Predictive Analytics," Decision Analysis, INFORMS, vol. 14(4), pages 274-297, December.
  • Handle: RePEc:inm:ordeca:v:14:y:2017:i:4:p:274-297
    DOI: 10.1287/deca.2017.0354
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/deca.2017.0354
    Download Restriction: no

    File URL: https://libkey.io/10.1287/deca.2017.0354?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. K.W. de Bock & D. van den Poel, 2011. "An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction," Post-Print hal-00800160, HAL.
    2. Hoch, Stephen J, 1988. "Who Do We Know: Predicting the Interests and Opinions of the American Consumer," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 15(3), pages 315-324, December.
    3. Joel B. Predd & Daniel N. Osherson & Sanjeev R. Kulkarni & H. Vincent Poor, 2008. "Aggregating Probabilistic Forecasts from Incoherent and Abstaining Experts," Decision Analysis, INFORMS, vol. 5(4), pages 177-189, December.
    4. Venkatesh, Kamini & Ravi, Vadlamani & Prinzie, Anita & Poel, Dirk Van den, 2014. "Cash demand forecasting in ATMs by clustering and neural networks," European Journal of Operational Research, Elsevier, vol. 232(2), pages 383-392.
    5. Wei, Pengfei & Lu, Zhenzhou & Song, Jingwen, 2015. "Variable importance analysis: A comprehensive review," Reliability Engineering and System Safety, Elsevier, vol. 142(C), pages 399-432.
    6. Dudoit S. & Fridlyand J. & Speed T. P, 2002. "Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 77-87, March.
    7. Guanchun Wang & Sanjeev R. Kulkarni & H. Vincent Poor & Daniel N. Osherson, 2011. "Aggregating Large Sets of Probabilistic Forecasts by Weighted Coherent Adjustment," Decision Analysis, INFORMS, vol. 8(2), pages 128-144, June.
    8. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    9. Coussement, Kristof & Benoit, Dries Frederik & Van den Poel, Dirk, 2009. "Improved Marketing Decision Making in a Customer Churn Prediction Context Using Generalized Additive Models," Working Papers 2009/18, Hogeschool-Universiteit Brussel, Faculteit Economie en Management.
    10. Kevin Filo & Daniel Lock & Adam Karg, 2015. "Sport and social media research: A review," Sport Management Review, Taylor & Francis Journals, vol. 18(2), pages 166-181, April.
    11. Malthouse, Edward C. & Haenlein, Michael & Skiera, Bernd & Wege, Egbert & Zhang, Michael, 2013. "Managing Customer Relationships in the Social Media Era: Introducing the Social CRM House," Journal of Interactive Marketing, Elsevier, vol. 27(4), pages 270-280.
    12. Jacob W. Ulvila & John E. Gaffney, 2004. "A Decision Analysis Method for Evaluating Computer Intrusion Detection Systems," Decision Analysis, INFORMS, vol. 1(1), pages 35-50, March.
    13. M. Ballings & D. Van Den Poel, 2012. "The Relevant Length of Customer Event History for Churn Prediction: How long is long enough?," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/804, Ghent University, Faculty of Economics and Business Administration.
    14. Lemmens, A. & Croux, C., 2006. "Bagging and boosting classification trees to predict churn," Other publications TiSEM d5cb664d-5859-44db-a621-e, Tilburg University, School of Economics and Management.
    15. Sevim, Cuneyt & Oztekin, Asil & Bali, Ozkan & Gumus, Serkan & Guresen, Erkam, 2014. "Developing an early warning system to predict currency crises," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1095-1104.
    16. Filo, Kevin & Lock, Daniel & Karg, Adam, 2015. "Sport and social media research: A review," Sport Management Review, Elsevier, vol. 18(2), pages 166-181.
    17. Kizilaslan, Recep & Freund, Steven & Iseri, Ali, 2016. "A data analytic approach to forecasting daily stock returns in an emerging marketAuthor-Name: Oztekin, Asil," European Journal of Operational Research, Elsevier, vol. 253(3), pages 697-710.
    18. Culp, Mark & Johnson, Kjell & Michailides, George, 2006. "ada: An R Package for Stochastic Boosting," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i02).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sangjae Lee & Kun Chang Lee & Joon Yeon Choeh, 2020. "Using Bayesian Network to Predict Online Review Helpfulness," Sustainability, MDPI, vol. 12(17), pages 1-17, August.
    2. Ali E. Abbas & Jay Simon & Chris Smith, 2017. "Introduction to the Special Issue on Decision Analysis and Social Media," Decision Analysis, INFORMS, vol. 14(4), pages 227-228, December.
    3. Bogaert, Matthias & Lootens, Justine & Van den Poel, Dirk & Ballings, Michel, 2019. "Evaluating multi-label classifiers and recommender systems in the financial service sector," European Journal of Operational Research, Elsevier, vol. 279(2), pages 620-634.
    4. Vicki M. Bier & Simon French, 2020. "From the Editors: Decision Analysis Focus and Trends," Decision Analysis, INFORMS, vol. 17(1), pages 1-8, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ballings, Michel & Van den Poel, Dirk, 2015. "CRM in social media: Predicting increases in Facebook usage frequency," European Journal of Operational Research, Elsevier, vol. 244(1), pages 248-260.
    2. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    3. M. Ballings & D. Van Den Poel & E. Verhagen, 2013. "Evaluating the Added Value of Pictorial Data for Customer Churn Prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 13/869, Ghent University, Faculty of Economics and Business Administration.
    4. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    5. Annamalai, Balamurugan & Yoshida, Masayuki & Varshney, Sanjeev & Pathak, Atul Arun & Venugopal, Pingali, 2021. "Social media content strategy for sport clubs to drive fan engagement," Journal of Retailing and Consumer Services, Elsevier, vol. 62(C).
    6. M. Ballings & D. Van Den Poel, 2012. "The Relevant Length of Customer Event History for Churn Prediction: How long is long enough?," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/804, Ghent University, Faculty of Economics and Business Administration.
    7. Schaeffer, Satu Elisa & Rodriguez Sanchez, Sara Veronica, 2020. "Forecasting client retention — A machine-learning approach," Journal of Retailing and Consumer Services, Elsevier, vol. 52(C).
    8. Giovanni Bernardo & Massimo Ruberti & Roberto Verona, 2022. "Image is everything! Professional football players' visibility and wages: evidence from the Italian Serie A," Applied Economics, Taylor & Francis Journals, vol. 54(5), pages 595-614, January.
    9. Matthias Bogaert & Michel Ballings & Dirk Van den Poel, 2018. "Evaluating the importance of different communication types in romantic tie prediction on social media," Annals of Operations Research, Springer, vol. 263(1), pages 501-527, April.
    10. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    11. repec:cup:judgdm:v:13:y:2018:i:6:p:607-621 is not listed on IDEAS
    12. Albrecht, Tobias & Rausch, Theresa Maria & Derra, Nicholas Daniel, 2021. "Call me maybe: Methods and practical implementation of artificial intelligence in call center arrivals’ forecasting," Journal of Business Research, Elsevier, vol. 123(C), pages 267-278.
    13. Adler, Werner & Lausen, Berthold, 2009. "Bootstrap estimated true and false positive rates and ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 718-729, January.
    14. Coussement, Kristof & De Bock, Koen W., 2013. "Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning," Journal of Business Research, Elsevier, vol. 66(9), pages 1629-1636.
    15. Hayes, Michelle & Filo, Kevin & Geurin, Andrea & Riot, Caroline, 2020. "An exploration of the distractions inherent to social media use among athletes," Sport Management Review, Elsevier, vol. 23(5), pages 852-868.
    16. Næss, Hans Erik, 2017. "Authenticity matters: A digital ethnography of FIA World Rally Championship fan forums," Sport Management Review, Elsevier, vol. 20(1), pages 105-113.
    17. Nicolas Scelles & Boris Helleu & Christophe Durand & Liliane Bonnal & Stephen Morrow, 2017. "Explaining the Number of Social Media Fans for North American and European Professional Sports Clubs with Determinants of Their Financial Value," IJFS, MDPI, vol. 5(4), pages 1-19, November.
    18. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    19. Kharouf, Husni & Biscaia, Rui & Garcia-Perez, Alexeis & Hickman, Ellie, 2020. "Understanding online event experience: The importance of communication, engagement and interaction," Journal of Business Research, Elsevier, vol. 121(C), pages 735-746.
    20. Boris Helleu, 2016. "Un état de la recherche en management du #Digisport : enjeux et perspectives," Post-Print hal-01715966, HAL.
    21. Yuyu Fan & David V. Budescu & David Mandel & Mark Himmelstein, 2019. "Improving Accuracy by Coherence Weighting of Direct and Ratio Probability Judgments," Decision Analysis, INFORMS, vol. 16(3), pages 197-217, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ordeca:v:14:y:2017:i:4:p:274-297. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.