IDEAS home Printed from https://ideas.repec.org/a/inm/ormksc/v37y2018i6p930-952.html
   My bibliography  Save this article

A Semantic Approach for Estimating Consumer Content Preferences from Online Search Queries

Author

Listed:
  • Jia Liu

    (Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong)

  • Olivier Toubia

    (Columbia Business School, Columbia University, New York, New York 10025)

Abstract

We extend latent Dirichlet allocation by introducing a topic model, hierarchically dual latent Dirichlet allocation (HDLDA), for contexts in which one type of document (e.g., search queries) are semantically related to another type of document (e.g., search results). In the context of online search engines, HDLDA identifies not only topics in short search queries and web pages, but also how the topics in search queries relate to the topics in the corresponding top search results. The output of HDLDA provides a basis for estimating consumers’ content preferences on the fly from their search queries given a set of assumptions on how consumers translate their content preferences into search queries. We apply HDLDA and explore its use in the estimation of content preferences in two studies. The first is a lab experiment in which we manipulate participants’ content preferences and observe the queries they formulate and their browsing behavior across different product categories. The second is a field study, which allows us to explore whether the content preferences estimated based on HDLDA may be used to explain and predict click-through rates in online search advertising.

Suggested Citation

  • Jia Liu & Olivier Toubia, 2018. "A Semantic Approach for Estimating Consumer Content Preferences from Online Search Queries," Marketing Science, INFORMS, vol. 37(6), pages 930-952, November.
  • Handle: RePEc:inm:ormksc:v:37:y:2018:i:6:p:930-952
    DOI: 10.1287/mksc.2018.1112
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/mksc.2018.1112
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mksc.2018.1112?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Przemyslaw Jeziorski & Ilya Segal, 2010. "What Makes them Click: Empirical Analysis of Consumer Demand for Search Advertising," Economics Working Paper Archive 569, The Johns Hopkins University,Department of Economics.
    2. Sridhar Narayanan & Kirthi Kalyanam, 2015. "Position Effects in Search Advertising and their Moderators: A Regression Discontinuity Approach," Marketing Science, INFORMS, vol. 34(3), pages 388-407, May.
    3. Jun B. Kim & Paulo Albuquerque & Bart J. Bronnenberg, 2010. "Online Demand Under Limited Consumer Search," Marketing Science, INFORMS, vol. 29(6), pages 1001-1023, 11-12.
    4. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    5. Grün, Bettina & Hornik, Kurt, 2011. "topicmodels: An R Package for Fitting Topic Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i13).
    6. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    7. Anindya Ghose & Panagiotis G. Ipeirotis & Beibei Li, 2012. "Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowdsourced Content," Marketing Science, INFORMS, vol. 31(3), pages 493-520, May.
    8. Kihlstrom, Richard E & Riordan, Michael H, 1984. "Advertising as a Signal," Journal of Political Economy, University of Chicago Press, vol. 92(3), pages 427-450, June.
    9. John Geweke, 1991. "Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments," Staff Report 148, Federal Reserve Bank of Minneapolis.
    10. Green, Paul E & Srinivasan, V, 1978. "Conjoint Analysis in Consumer Research: Issues and Outlook," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 5(2), pages 103-123, Se.
    11. Peiling Wang & Michael W. Berry & Yiheng Yang, 2003. "Mining longitudinal web queries: Trends and patterns," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(8), pages 743-758, June.
    12. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    13. Michael Trusov & Liye Ma & Zainab Jamal, 2016. "Crumbs of the Cookie: User Profiling in Customer-Base Analysis and Behavioral Targeting," Marketing Science, INFORMS, vol. 35(3), pages 405-426, May.
    14. Zhang, Yuchi & Moe, Wendy W. & Schweidel, David A., 2017. "Modeling the role of message content and influencers in social media rebroadcasting," International Journal of Research in Marketing, Elsevier, vol. 34(1), pages 100-119.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Savannah Wei Shi & Michael Trusov, 2021. "The Path to Click: Are You on It?," Marketing Science, INFORMS, vol. 40(2), pages 344-365, March.
    2. Carl F. Mela & Jason M. T. Roos & Tulio Sousa, 2023. "Advertiser Learning in Direct Advertising Markets," Papers 2307.07015, arXiv.org, revised Apr 2024.
    3. Jia Liu & Olivier Toubia, 2020. "Search query formation by strategic consumers," Quantitative Marketing and Economics (QME), Springer, vol. 18(2), pages 155-194, June.
    4. Ruomeng Cui & Meng Li & Qiang Li, 2020. "Value of High-Quality Logistics: Evidence from a Clash Between SF Express and Alibaba," Management Science, INFORMS, vol. 66(9), pages 3879-3902, September.
    5. Wang, Xin (Shane) & Ryoo, Jun Hyun (Joseph) & Bendle, Neil & Kopalle, Praveen K., 2021. "The role of machine learning analytics and metrics in retailing research," Journal of Retailing, Elsevier, vol. 97(4), pages 658-675.
    6. Hyowon Kim & Greg M. Allenby, 2022. "Integrating Textual Information into Models of Choice and Scaled Response Data," Marketing Science, INFORMS, vol. 41(4), pages 815-830, July.
    7. Yuping Liu-Thompkins & Shintaro Okazaki & Hairong Li, 2022. "Artificial empathy in marketing interactions: Bridging the human-AI gap in affective and social customer experience," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1198-1218, November.
    8. Martin Reisenbichler & Thomas Reutterer & David A. Schweidel & Daniel Dan, 2022. "Frontiers: Supporting Content Marketing with Natural Language Generation," Marketing Science, INFORMS, vol. 41(3), pages 441-452, May.
    9. Honka, Elisabeth & Seiler, Stephan & Ursu, Raluca, 2024. "Consumer search: What can we learn from pre-purchase data?," Journal of Retailing, Elsevier, vol. 100(1), pages 114-129.
    10. Peiyao Li & Noah Castelo & Zsolt Katona & Miklos Sarvary, 2024. "Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis," Marketing Science, INFORMS, vol. 43(2), pages 254-266, March.
    11. Bruno Jacobs & Dennis Fok & Bas Donkers, 2021. "Understanding Large-Scale Dynamic Purchase Behavior," Marketing Science, INFORMS, vol. 40(5), pages 844-870, September.
    12. Kaatz, Christopher & Brock, Christian & Figura, Lilli, 2019. "Are you still online or are you already mobile? – Predicting the path to successful conversions across different devices," Journal of Retailing and Consumer Services, Elsevier, vol. 50(C), pages 10-21.
    13. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    14. Jia Liu & Olivier Toubia & Shawndra Hill, 2021. "Content-Based Model of Web Search Behavior: An Application to TV Show Search," Management Science, INFORMS, vol. 67(10), pages 6378-6398, October.
    15. Paramveer S. Dhillon & Sinan Aral, 2021. "Modeling Dynamic User Interests: A Neural Matrix Factorization Approach," Marketing Science, INFORMS, vol. 40(6), pages 1059-1080, November.
    16. Daria Dzyabura & Siham El Kihal & John R. Hauser & Marat Ibragimov, 2023. "Leveraging the Power of Images in Managing Product Return Rates," Marketing Science, INFORMS, vol. 42(6), pages 1125-1142, November.
    17. Huang, Ming-Hui & Rust, Roland T., 2022. "A Framework for Collaborative Artificial Intelligence in Marketing," Journal of Retailing, Elsevier, vol. 98(2), pages 209-223.
    18. Hongshuang (Alice) Li, 2022. "Converting free users to paid subscribers in the SaaS context: The impact of marketing touchpoints, message content, and usage," Production and Operations Management, Production and Operations Management Society, vol. 31(5), pages 2185-2203, May.
    19. Ma, Liye & Sun, Baohong, 2020. "Machine learning and AI in marketing – Connecting computing power to human insights," International Journal of Research in Marketing, Elsevier, vol. 37(3), pages 481-504.
    20. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    21. Lijia Ma & Xingchen Xu & Yong Tan, 2024. "Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines," Papers 2402.19421, arXiv.org.
    22. Shah Jahan Miah & Huy Quan Vu & Damminda Alahakoon, 2022. "A social media analytics perspective for human‐oriented smart city planning and management," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(1), pages 119-135, January.
    23. Peter Landry, 2021. "Keywords, limited consideration, and organic product listings," Quantitative Marketing and Economics (QME), Springer, vol. 19(3), pages 505-566, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    2. Anindya Ghose & Panagiotis G. Ipeirotis & Beibei Li, 2019. "Modeling Consumer Footprints on Search Engines: An Interplay with Social Media," Management Science, INFORMS, vol. 65(3), pages 1363-1385, March.
    3. Jia Liu & Olivier Toubia & Shawndra Hill, 2021. "Content-Based Model of Web Search Behavior: An Application to TV Show Search," Management Science, INFORMS, vol. 67(10), pages 6378-6398, October.
    4. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2017. "A multidisciplinary perspective of big data in management research," International Journal of Production Economics, Elsevier, vol. 191(C), pages 97-112.
    5. Dominik Gutt & Jürgen Neumann & Steffen Zimmermann & Dennis Kundisch & Jianqing Chen, 2018. "Design of Review Systems - A Strategic Instrument to shape Online Review Behavior and Economic Outcomes," Working Papers Dissertations 42, Paderborn University, Faculty of Business Administration and Economics.
    6. Li, Xi & Shi, Mengze & Wang, Xin (Shane), 2019. "Video mining: Measuring visual information using automatic methods," International Journal of Research in Marketing, Elsevier, vol. 36(2), pages 216-231.
    7. Xiao Liu & Param Vir Singh & Kannan Srinivasan, 2016. "A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing," Marketing Science, INFORMS, vol. 35(3), pages 363-388, May.
    8. Marc R. Dotson & Joachim Büschken & Greg M. Allenby, 2020. "Explaining Preference Heterogeneity with Mixed Membership Modeling," Marketing Science, INFORMS, vol. 39(2), pages 407-426, March.
    9. Dinesh Puranam & Vishal Narayan & Vrinda Kadiyali, 2017. "The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors," Marketing Science, INFORMS, vol. 36(5), pages 726-746, September.
    10. Xin (Shane) Wang & Feng Mai & Roger H. L. Chiang, 2014. "Database Submission ---Market Dynamics and User-Generated Content About Tablet Computers," Marketing Science, INFORMS, vol. 33(3), pages 449-458, May.
    11. Hartmann, Jochen & Huppertz, Juliana & Schamp, Christina & Heitmann, Mark, 2019. "Comparing automated text classification methods," International Journal of Research in Marketing, Elsevier, vol. 36(1), pages 20-38.
    12. Saeed Tajdini, 2023. "The effects of internet search intensity for products on companies’ stock returns: a competitive intelligence perspective," Journal of Marketing Analytics, Palgrave Macmillan, vol. 11(3), pages 352-365, September.
    13. Alantari, Huwail J. & Currim, Imran S. & Deng, Yiting & Singh, Sameer, 2022. "An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews," International Journal of Research in Marketing, Elsevier, vol. 39(1), pages 1-19.
    14. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    15. Moon, Sangkil & Kamakura, Wagner A., 2017. "A picture is worth a thousand words: Translating product reviews into a product positioning map," International Journal of Research in Marketing, Elsevier, vol. 34(1), pages 265-285.
    16. Jorge Mejia & Shawn Mankad & Anandasivam Gopal, 2019. "A for Effort? Using the Crowd to Identify Moral Hazard in New York City Restaurant Hygiene Inspections," Information Systems Research, INFORMS, vol. 30(4), pages 1363-1386, December.
    17. Mengxia Zhang & Lan Luo, 2023. "Can Consumer-Posted Photos Serve as a Leading Indicator of Restaurant Survival? Evidence from Yelp," Management Science, INFORMS, vol. 69(1), pages 25-50, January.
    18. Hasmat Malik & Asyraf Afthanorhan & Noor Aina Amirah & Nuzhat Fatema, 2021. "Machine Learning Approach for Targeting and Recommending a Product for Project Management," Mathematics, MDPI, vol. 9(16), pages 1-29, August.
    19. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2019. "Technology in the 21st century: New challenges and opportunities," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 321-335.
    20. Dokyun Lee & Kartik Hosanagar & Harikesh S. Nair, 2018. "Advertising Content and Consumer Engagement on Social Media: Evidence from Facebook," Management Science, INFORMS, vol. 64(11), pages 5105-5131, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:37:y:2018:i:6:p:930-952. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.