Author
Listed:
- Jayden Fitzsimon
(School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)
- Shrikant Agrawal
(School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)
- Kirti Khade
(School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)
- Evan Shellshear
(��Biarri, Brisbane, Queensland, Australia)
- Jonathon Allport
(��Biarri, Brisbane, Queensland, Australia)
- Archie C. Chapman
(School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)
Abstract
Market basket analysis (MBA) aims to discover purchasing patterns and item associations from customer transaction data. A major drawback of current techniques for MBA is a lack of quantitative metrics to measure the real value associated with basket items. This paper addresses this gap by deriving a practical game-theoretic measure for MBA based on the Shapley value of cooperative games, which we call Shapley value index for MBA (SIMBA). The SIMBA of an item represents the average revenue it earns, including its influence on the revenue earned from sales of other items. A significant challenge when applying Shapley value-inspired approaches in practical domains is the exponential complexity of Shapley value computation. However, for the MBA domain, we show that SIMBA admits a scalable exact computation method that does not require sampling or other approximations. Specifically, a characteristic function for the MBA game is constructed so that the transaction dataset input corresponds to the game’s Harsanyi dividends. The relationship between Harsanyi dividends and the Shapley value is then exploited to efficiently compute SIMBA. This approach scales linearly in the number of transactions, making SIMBA a feasible approach for quantitative MBA. SIMBA can be used to screen conventional MBA techniques, such as association rules, to identify significant rules based on the items’ cross-selling capacity. This combination of existing MBA methods and SIMBA will generate rules based not only on frequency of co-occurrence, but also on the significance of the items. We demonstrate the working of the algorithm by analyzing openly available transaction data from an online retail store. To the best of our knowledge, this is the first time Shapley value is used in this way to solve market basket analyses of a practical size.
Suggested Citation
Jayden Fitzsimon & Shrikant Agrawal & Kirti Khade & Evan Shellshear & Jonathon Allport & Archie C. Chapman, 2022.
"A Shapley Value Index for Market Basket Analysis: Efficient Computation Using an Harsanyi Dividend Representation,"
International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 24(04), pages 1-29, December.
Handle:
RePEc:wsi:igtrxx:v:24:y:2022:i:04:n:s0219198922500153
DOI: 10.1142/S0219198922500153
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:igtrxx:v:24:y:2022:i:04:n:s0219198922500153. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/igtr/igtr.shtml .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.