IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2412.15239.html
   My bibliography  Save this paper

Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs

Author

Listed:
  • Hortense Fong
  • George Gui

Abstract

Understanding when and why consumers engage with stories is crucial for content creators and platforms. While existing theories suggest that audience beliefs of what is going to happen should play an important role in engagement decisions, empirical work has mostly focused on developing techniques to directly extract features from actual content, rather than capturing forward-looking beliefs, due to the lack of a principled way to model such beliefs in unstructured narrative data. To complement existing feature extraction techniques, this paper introduces a novel framework that leverages large language models to model audience forward-looking beliefs about how stories might unfold. Our method generates multiple potential continuations for each story and extracts features related to expectations, uncertainty, and surprise using established content analysis techniques. Applying our method to over 30,000 book chapters, we demonstrate that our framework complements existing feature engineering techniques by amplifying their marginal explanatory power on average by 31%. The results reveal that different types of engagement-continuing to read, commenting, and voting-are driven by distinct combinations of current and anticipated content features. Our framework provides a novel way to study and explore how audience forward-looking beliefs shape their engagement with narrative media, with implications for marketing strategy in content-focused industries.

Suggested Citation

  • Hortense Fong & George Gui, 2024. "Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs," Papers 2412.15239, arXiv.org, revised Mar 2025.
  • Handle: RePEc:arx:papers:2412.15239
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2412.15239
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rust, John, 1987. "Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher," Econometrica, Econometric Society, vol. 55(5), pages 999-1033, September.
    2. Sanjog Misra & Harikesh Nair, 2011. "A structural model of sales-force compensation dynamics: Estimation and field implementation," Quantitative Marketing and Economics (QME), Springer, vol. 9(3), pages 211-257, September.
    3. Tülin Erdem & Michael P. Keane, 1996. "Decision-Making Under Uncertainty: Capturing Dynamic Brand Choice Processes in Turbulent Consumer Goods Markets," Marketing Science, INFORMS, vol. 15(1), pages 1-20.
    4. Charles F. Manski, 2004. "Measuring Expectations," Econometrica, Econometric Society, vol. 72(5), pages 1329-1376, September.
    5. John J. Horton, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," NBER Working Papers 31122, National Bureau of Economic Research, Inc.
    6. John J. Horton, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," Papers 2301.07543, arXiv.org.
    7. Peiyao Li & Noah Castelo & Zsolt Katona & Miklos Sarvary, 2024. "Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis," Marketing Science, INFORMS, vol. 43(2), pages 254-266, March.
    8. Jeffrey Ely & Alexander Frankel & Emir Kamenica, 2015. "Suspense and Surprise," Journal of Political Economy, University of Chicago Press, vol. 123(1), pages 215-260.
    9. Harikesh Nair, 2007. "Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games," Quantitative Marketing and Economics (QME), Springer, vol. 5(3), pages 239-292, September.
    10. Jehoshua Eliashberg & Sam K. Hui & Z. John Zhang, 2007. "From Story Line to Box Office: A New Approach for Green-Lighting Movie Scripts," Management Science, INFORMS, vol. 53(6), pages 881-893, June.
    11. Günter J. Hitsch, 2006. "An Empirical Model of Optimal Dynamic Product Launch and Exit Under Demand Uncertainty," Marketing Science, INFORMS, vol. 25(1), pages 25-50, 01-02.
    12. George Gui & Olivier Toubia, 2023. "The Challenge of Using LLMs to Simulate Human Behavior: A Causal Inference Perspective," Papers 2312.15524, arXiv.org, revised Jan 2025.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guofang Huang & Hong Luo & Jing Xia, 2019. "Invest in Information or Wing It? A Model of Dynamic Pricing with Seller Learning," Management Science, INFORMS, vol. 65(12), pages 5556-5583, December.
    2. Pradeep K. Chintagunta & Harikesh S. Nair, 2011. "Structural Workshop Paper --Discrete-Choice Models of Consumer Demand in Marketing," Marketing Science, INFORMS, vol. 30(6), pages 977-996, November.
    3. Andrew Ching & Susumu Imai & Masakazu Ishihara & Neelam Jain, 2012. "A practitioner’s guide to Bayesian estimation of discrete choice dynamic programming models," Quantitative Marketing and Economics (QME), Springer, vol. 10(2), pages 151-196, June.
    4. Gonca P. Soysal & Lakshman Krishnamurthi, 2012. "Demand Dynamics in the Seasonal Goods Industry: An Empirical Analysis," Marketing Science, INFORMS, vol. 31(2), pages 293-316, March.
    5. Gautam Gowrisankaran & Marc Rysman, 2012. "Dynamics of Consumer Demand for New Durable Goods," Journal of Political Economy, University of Chicago Press, vol. 120(6), pages 1173-1219.
    6. Yuan Gao & Dokyun Lee & Gordon Burtch & Sina Fazelpour, 2024. "Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina," Papers 2410.19599, arXiv.org, revised Jan 2025.
    7. Jialie Chen, 2023. "Learning and skill set formation: A structural examination of version upgrades, user visibility, and AI strategies," Production and Operations Management, Production and Operations Management Society, vol. 32(12), pages 3856-3872, December.
    8. Brett R. Gordon, 2009. "A Dynamic Model of Consumer Replacement Cycles in the PC Processor Industry," Marketing Science, INFORMS, vol. 28(5), pages 846-867, 09-10.
    9. Daniel Albert & Stephan Billinger, 2024. "Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models," Papers 2410.06932, arXiv.org.
    10. Benjamin R. Handel & Kanishka Misra, 2015. "Robust New Product Pricing," Marketing Science, INFORMS, vol. 34(6), pages 864-881, November.
    11. Peter Arcidiacono & Paul B. Ellickson, 2011. "Practical Methods for Estimation of Dynamic Discrete Choice Models," Annual Review of Economics, Annual Reviews, vol. 3(1), pages 363-394, September.
    12. Yufeng Huang, 2019. "Learning by Doing and the Demand for Advanced Products," Marketing Science, INFORMS, vol. 38(1), pages 107-128, January.
    13. Ricardo Montoya & Oded Netzer & Kamel Jedidi, 2010. "Dynamic Allocation of Pharmaceutical Detailing and Sampling for Long-Term Profitability," Marketing Science, INFORMS, vol. 29(5), pages 909-924, 09-10.
    14. Seyed Morteza Emadi & Bradley R. Staats, 2020. "A Structural Estimation Approach to Study Agent Attrition," Management Science, INFORMS, vol. 66(9), pages 4071-4095, September.
    15. Lizhen Xu & Jason A. Duan & Yu Jeffrey Hu & Yuan Cheng & Yan Zhu, 2019. "Forward-Looking Behavior in Mobile Data Consumption and Targeted Promotion Design: A Dynamic Structural Model," Information Systems Research, INFORMS, vol. 30(2), pages 616-635, June.
    16. Juanjuan Zhang, 2010. "The Sound of Silence: Observational Learning in the U.S. Kidney Market," Marketing Science, INFORMS, vol. 29(2), pages 315-335, 03-04.
    17. Kanishka Misra & Eric M. Schwartz & Jacob Abernethy, 2019. "Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments," Marketing Science, INFORMS, vol. 38(2), pages 226-252, March.
    18. Ali Goli & Amandeep Singh, 2024. "Frontiers: Can Large Language Models Capture Human Preferences?," Marketing Science, INFORMS, vol. 43(4), pages 709-722, July.
    19. Andreas Lanz & Gregor Reich & Ole Wilms, 2022. "Adaptive grids for the estimation of dynamic models," Quantitative Marketing and Economics (QME), Springer, vol. 20(2), pages 179-238, June.
    20. Yingyao Hu & Yi Xin, 2019. "Identi?cation and estimation of dynamic structural models with unobserved choices," CeMMAP working papers CWP35/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2412.15239. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.