IDEAS home Printed from https://ideas.repec.org/p/ems/eureri/12243.html
   My bibliography  Save this paper

Choosing Attribute Weights for Item Dissimilarity using Clikstream Data with an Application to a Product Catalog Map

Author

Listed:
  • Kagie, M.
  • van Wezel, M.C.
  • Groenen, P.J.F.

Abstract

In content- and knowledge-based recommender systems often a measure of (dis)similarity between items is used. Frequently, this measure is based on the attributes of the items. However, which attributes are important for the users of the system remains an important question to answer. In this paper, we present an approach to determine attribute weights in a dissimilarity measure using clickstream data of an e-commerce website. Counted is how many times products are sold and based on this a Poisson regression model is estimated. Estimates of this model are then used to determine the attribute weights in the dissimilarity measure. We show an application of this approach on a product catalog of MP3 players provided by Compare Group, owner of the Dutch price comparison site http://www.vergelijk.nl, and show how the dissimilarity measure can be used to improve 2D product catalog visualizations.

Suggested Citation

  • Kagie, M. & van Wezel, M.C. & Groenen, P.J.F., 2008. "Choosing Attribute Weights for Item Dissimilarity using Clikstream Data with an Application to a Product Catalog Map," ERIM Report Series Research in Management ERS-2008-024-MKT, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
  • Handle: RePEc:ems:eureri:12243
    as

    Download full text from publisher

    File URL: https://repub.eur.nl/pub/12243/ERS-2008-024-MKT.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ciampi, Antonio, 1991. "Generalized regression trees," Computational Statistics & Data Analysis, Elsevier, vol. 12(1), pages 57-78, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kagie, M. & van Wezel, M.C. & Groenen, P.J.F., 2009. "An Empirical Comparison of Dissimilarity Measures for Recommender Systems," ERIM Report Series Research in Management ERS-2009-023-MKT, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    2. Kagie, M. & van Wezel, M.C. & Groenen, P.J.F., 2009. "Map Based Visualization of Product Catalogs," ERIM Report Series Research in Management ERS-2009-010-MKT, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    3. Kagie, M. & van Wezel, M.C. & Groenen, P.J.F., 2009. "Determination of Attribute Weights for Recommender Systems Based on Product Popularity," ERIM Report Series Research in Management ERS-2009-022-MKT, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tomàs Aluja-Banet & Eduard Nafria, 2003. "Stability and scalability in decision trees," Computational Statistics, Springer, vol. 18(3), pages 505-520, September.
    2. L. Lombardo & M. Cama & C. Conoscenti & M. Märker & E. Rotigliano, 2015. "Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: application to the 2009 storm event in Messi," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 79(3), pages 1621-1648, December.
    3. da Rosa, Joel Correa & Veiga, Alvaro & Medeiros, Marcelo C., 2008. "Tree-structured smooth transition regression models," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2469-2488, January.
    4. Alessandra De Rose & Alessandro Pallara, 1997. "Survival Trees: An Alternative Non-Parametric Multivariate Technique for Life History Analysis," European Journal of Population, Springer;European Association for Population Studies, vol. 13(3), pages 223-241, September.
    5. Christophe Dutang & Quentin Guibert, 2021. "An explicit split point procedure in model-based trees allowing for a quick fitting of GLM trees and GLM forests," Post-Print hal-03448250, HAL.
    6. Nan-Ting Liu & Feng-Chang Lin & Yu-Shan Shih, 2020. "Count regression trees," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(1), pages 5-27, March.
    7. Wei-Yin Loh, 2014. "Fifty Years of Classification and Regression Trees," International Statistical Review, International Statistical Institute, vol. 82(3), pages 329-348, December.
    8. Bárcena Ruiz, María Jesús & Tusell Palmer, Fernando Jorge, 2002. "Multivariate Data Imputation using Trees," BILTOKI 1134-8984, Universidad del País Vasco - Departamento de Economía Aplicada III (Econometría y Estadística).
    9. Hecker, Hartmut & Wubbelt, Peter, 1997. "Clustering by response: CBR," Computational Statistics & Data Analysis, Elsevier, vol. 24(2), pages 193-215, April.
    10. Ahn, Hongshik, 1996. "Log-normal regression modeling through recursive partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 21(4), pages 381-398, April.
    11. Joel Corrêa da Rosa & Álvaro Veiga & Marcelo C. Medeiros, 2003. "Three-structured smooth transition regression models based on CART algorithm," Textos para discussão 469, Department of Economics PUC-Rio (Brazil).
    12. Elise Dusseldorp & Jacqueline Meulman, 2004. "The regression trunk approach to discover treatment covariate interaction," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 355-374, September.

    More about this item

    Keywords

    attribute weights; clickstream data; comparison; dissimilarity measure;
    All these keywords.

    JEL classification:

    • C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • D8 - Microeconomics - - Information, Knowledge, and Uncertainty
    • M - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics
    • M31 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Marketing and Advertising - - - Marketing

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ems:eureri:12243. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RePub (email available below). General contact details of provider: https://edirc.repec.org/data/erimanl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.