Author
Listed:
- Nobuhiko Terui
- Yinxing Li
Abstract
In this article, we propose a regression model for high-dimensional sparse data from storelevel aggregated POS systems. The modeling procedure comprises two sub-models--topic model and hierarchical factor regression model--that are applied sequentially not only for accommodating high dimensionality and sparseness but also for managerial interpretation. First, the topic model is applied unusually to aggregated data to decompose the daily aggregated sales volume of a product into sub-sales for several topics by allocating each unit sale ("word" in text analysis) in a day ("document") into one of topics based on joint purchase information. This stage reduces the dimension of data inside topics because topic distribution is not uniform and product sales are allocated mostly into smaller numbers of topics. Next, the market response regression model in the topic is estimated by using information about other items in the same topic. That is, we construct a topic-wise market response function by using explanatory variables not only of itself but also of other items belonging to the same topic. Additional reduction of dimensionality remains necessary for each topic, and we propose a hierarchical factor regression model based on canonical correlation analysis for original high-dimensional sample spaces. Then we discuss feature selection based on a credible interval of parameters' posterior density. Empirical study shows that (i) our model has the advantage of managerial implications obtained from topic-hierarchical factor regression defined according to their contexts, and (ii) it offers better fit than does conventional category regressions in-sample as well as out of sample.
Suggested Citation
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:toh:dssraa:66. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tohoku University Library (email available below). General contact details of provider: https://edirc.repec.org/data/fetohjp.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.