IDEAS home Printed from https://ideas.repec.org/p/ant/wpaper/2017009.html
   My bibliography  Save this paper

Attributing value in a data pooling setting for predictive modeling

Author

Listed:
  • MOEYERSOMS, Julie
  • D'ALESSANDRO, Brian
  • PROVOST, Foster
  • MARTENS, David

Abstract

The rapid growth of data sources comes with numerous challenges. One of them is the determination of its value. That is, when building prediction models based on different data sources, it is interesting to know how much each of the features has contributed to that specific prediction. As such, we get an idea on how the benefits created by the prediction model could be divided over the features responsible for it. The goal of this paper is to define, solve and evaluate a data attribution scheme for predictive modeling that is “fair”, which is defined by using concepts from game theory. We use two methods from various research fields in order to distribute the value both on an instance level and ultimately on a feature level: The (approximate) Shapley value and an explanation approach for high-dimensional data. By using a high-dimensional and sparse data set, consisting of website visits for each user, we show that: (i) the proposed methods allow to create a fair value distribution among a very large number of data sources (websites in this case) in a prediction model, and (i) are able to obtain a double amount of instances that are explained for a given number of features as compared to just looking at the high-coefficient features. Interestingly, (iii) although the proposed methods come from different sources and motivations, the two new alternatives provide strikingly similar rankings of important features and division of the revenues.

Suggested Citation

  • MOEYERSOMS, Julie & D'ALESSANDRO, Brian & PROVOST, Foster & MARTENS, David, 2017. "Attributing value in a data pooling setting for predictive modeling," Working Papers 2017009, University of Antwerp, Faculty of Business and Economics.
  • Handle: RePEc:ant:wpaper:2017009
    as

    Download full text from publisher

    File URL: https://repository.uantwerpen.be/docman/irua/ab3df5/145523.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Martin J. Osborne & Ariel Rubinstein, 1994. "A Course in Game Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262650401, April.
    2. Foster Provost & David Martens & Alan Murray, 2015. "Finding Similar Mobile Consumers with a Privacy-Friendly Geosocial Design," Information Systems Research, INFORMS, vol. 26(2), pages 243-265, June.
    3. Nagarajan, Mahesh & Sosic, Greys, 2008. "Game-theoretic analysis of cooperation among supply chain agents: Review and extensions," European Journal of Operational Research, Elsevier, vol. 187(3), pages 719-745, June.
    4. Morton Davis & Michael Maschler, 1965. "The kernel of a cooperative game," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 12(3), pages 223-259, September.
    5. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    6. Mas-Colell, Andreu & Whinston, Michael D. & Green, Jerry R., 1995. "Microeconomic Theory," OUP Catalogue, Oxford University Press, number 9780195102680.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexei A. Gaivoronski & Per Jonny Nesse & Olai Bendik Erdal, 2017. "Internet service provision and content services: paid peering and competition between internet providers," Netnomics, Springer, vol. 18(1), pages 43-79, May.
    2. Battigalli, Pierpaolo & Leonetti, Paolo & Maccheroni, Fabio, 2020. "Behavioral equivalence of extensive game structures," Games and Economic Behavior, Elsevier, vol. 121(C), pages 533-547.
    3. Thijssen, J.J.J., 2003. "Investment under uncertainty, market evolution and coalition spillovers in a game theoretic perspective," Other publications TiSEM 672073a6-492e-4621-8d4a-0, Tilburg University, School of Economics and Management.
    4. Daley, Brendan & Sadowski, Philipp, 2017. "Magical thinking: A representation result," Theoretical Economics, Econometric Society, vol. 12(2), May.
    5. Horaguchi, Haruo, 1996. "The role of information processing cost as the foundation of bounded rationality in game theory," Economics Letters, Elsevier, vol. 51(3), pages 287-294, June.
    6. Albert Banal-Estañol & Inés Macho-Stadler, 2007. "Financial Incentives in Academia: Research versus Development," Working Papers 295, Barcelona School of Economics.
    7. Felix J. Bierbrauer & Pierre C. Boyer, 2016. "Efficiency, Welfare, and Political Competition," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(1), pages 461-518.
    8. Saša Zorc & Ilia Tsetlin, 2020. "Deadlines, Offer Timing, and the Search for Alternatives," Operations Research, INFORMS, vol. 68(3), pages 927-948, May.
    9. , & ,, 2013. "Implementation of communication equilibria by correlated cheap talk: The two-player case," Theoretical Economics, Econometric Society, vol. 8(1), January.
    10. Inohara, Takehiro, 2023. "Similarities, differences, and preservation of efficiencies, with application to attitude analysis, within the Graph Model for Conflict Resolution," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1330-1348.
    11. Huiye Ma & Nicole Ronald & Theo Arentze & Harry Timmermans, 2013. "Negotiating on location, timing, duration, and participant in agent-mediated joint activity-travel scheduling," Journal of Geographical Systems, Springer, vol. 15(4), pages 427-451, October.
    12. Giacomo Bonanno, 2016. "Exploring the Gap between Perfect Bayesian Equilibrium and Sequential Equilibrium," Games, MDPI, vol. 7(4), pages 1-23, November.
    13. Josep Maria Izquierdo & Carlos Rafels, 2017. "The incentive core in co-investment problems," UB School of Economics Working Papers 2017/369, University of Barcelona School of Economics.
    14. Alex Garivaltis, 2021. "Grade Inflation and Stunted Effort in a Curved Economics Course," Papers 2108.03709, arXiv.org, revised Aug 2021.
    15. Hitoshi Matsushima, 2005. "On Detail‐Free Mechanism Design And Rationality," The Japanese Economic Review, Japanese Economic Association, vol. 56(1), pages 41-54, March.
    16. Gömöri, András, 2005. "Nyugdíjrendszer és játékelmélet. Megjegyzések Mészáros József cikkéhez [The pension system and game theory. Remarks on the article by József Mészáros]," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(7), pages 732-742.
    17. Carlos Alós-Ferrer & Klaus Ritzberger, 2013. "Large extensive form games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 52(1), pages 75-102, January.
    18. Shimoji, Makoto, 2004. "On the equivalence of weak dominance and sequential best response," Games and Economic Behavior, Elsevier, vol. 48(2), pages 385-402, August.
    19. Fatma Aslan & Papatya Duman & Walter Trockel, 2019. "Duality for General TU-games Redefined," Working Papers CIE 121, Paderborn University, CIE Center for International Economics.
    20. Sanjith Gopalakrishnan & Daniel Granot & Frieda Granot, 2021. "Consistent Allocation of Emission Responsibility in Fossil Fuel Supply Chains," Management Science, INFORMS, vol. 67(12), pages 7637-7668, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ant:wpaper:2017009. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joeri Nys (email available below). General contact details of provider: https://edirc.repec.org/data/ftufsbe.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.