Contractual Reinforcement Learning: Pulling Arms with Invisible Hands

My bibliography Save this paper

Contractual Reinforcement Learning: Pulling Arms with Invisible Hands

Author

Listed:

Jibang Wu
Siyu Chen
Mengdi Wang
Huazheng Wang
Haifeng Xu

Registered:

Abstract

The agency problem emerges in today's large scale machine learning tasks, where the learners are unable to direct content creation or enforce data collection. In this work, we propose a theoretical framework for aligning economic interests of different stakeholders in the online learning problems through contract design. The problem, termed \emph{contractual reinforcement learning}, naturally arises from the classic model of Markov decision processes, where a learning principal seeks to optimally influence the agent's action policy for their common interests through a set of payment rules contingent on the realization of next state. For the planning problem, we design an efficient dynamic programming algorithm to determine the optimal contracts against the far-sighted agent. For the learning problem, we introduce a generic design of no-regret learning algorithms to untangle the challenges from robust design of contracts to the balance of exploration and exploitation, reducing the complexity analysis to the construction of efficient search algorithms. For several natural classes of problems, we design tailored search algorithms that provably achieve $\tilde{O}(\sqrt{T})$ regret. We also present an algorithm with $\tilde{O}(T^{2/3})$ for the general problem that improves the existing analysis in online contract design with mild technical assumptions.

Suggested Citation

Jibang Wu & Siyu Chen & Mengdi Wang & Huazheng Wang & Haifeng Xu, 2024. "Contractual Reinforcement Learning: Pulling Arms with Invisible Hands," Papers 2407.01458, arXiv.org, revised Jul 2024.

Handle: RePEc:arx:papers:2407.01458

Download full text from publisher

References listed on IDEAS

Jibang Wu & Zixuan Zhang & Zhe Feng & Zhaoran Wang & Zhuoran Yang & Michael I. Jordan & Haifeng Xu, 2022. "Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning," Papers 2202.10678, arXiv.org.
Hemant K. Bhargava, 2022. "The Creator Economy: Managing Ecosystem Supply, Revenue Sharing, and Platform Design," Management Science, INFORMS, vol. 68(7), pages 5233-5251, July.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Naixin Zhu, 2023. "Dissertation on Applied Microeconomics of Freemium Pricing Strategies in Mobile App Market," Papers 2305.09479, arXiv.org.
Siyu Chen & Jibang Wu & Yifan Wu & Zhuoran Yang, 2023. "Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model," Papers 2303.08613, arXiv.org, revised Aug 2023.
Jin Li & Gary Pisano & Yejia Xu & Feng Zhu, 2023. "Marketplace Scalability and Strategic Use of Platform Investment," Management Science, INFORMS, vol. 69(7), pages 3958-3975, July.
Lena Abou El-Komboz & Anna Kerkhof & Johannes Loh, 2023. "Platform Partnership Programs and Content Supply: Evidence from the YouTube “Adpocalypse”," CESifo Working Paper Series 10363, CESifo.
Qitian Ren, 2024. "Advertising and Content Creation on Digital Content Platforms," Marketing Science, INFORMS, vol. 43(4), pages 734-750, July.
Chen Liang & Murat Tunc & Gordon Burtch, 2024. "The Market Consequences of Perceived Strategic Generosity: An Empirical Examination of NFT Charity Fundraisers," Papers 2401.12064, arXiv.org, revised Dec 2024.
Cong, Lin William & Li, Siguang, 2024. "Influencer marketing and product competition," Journal of Economic Theory, Elsevier, vol. 220(C).
Barak Libai & Ana Babić Rosario & Maximilian Beichert & Bas Donkers & Michael Haenlein & Reto Hofstetter & P. K. Kannan & Ralf Lans & Andreas Lanz & H. Alice Li & Dina Mayzlin & Eitan Muller & Daniel , 2025. "Influencer marketing unlocked: Understanding the value chains driving the creator economy," Journal of the Academy of Marketing Science, Springer, vol. 53(1), pages 4-28, January.
Lee, Crystal T. & Shen, Yung-Cheng, 2024. "Exploring determinants of non-fungible token creators’ engagement behaviors on metaverse-based NFT platforms: A multi-analytical SEM-IPMA method," Journal of Business Research, Elsevier, vol. 185(C).
Cai, Yajun & Wu, Yibin & Xue, Weili, 2024. "Social media retailing in the creator economy," Omega, Elsevier, vol. 124(C).
Song, Haiqing & Wang, Rui & Tang, Yanli, 2024. "Competition or cooperation: Strategy analysis for a social commerce platform," European Journal of Operational Research, Elsevier, vol. 318(2), pages 560-574.
Kyungmin Park & Stephanie Lee & Shahryar Doosti & Yong Tan, 2023. "Provision of helpful review videos: Effects of video characteristics on perceived helpfulness," Production and Operations Management, Production and Operations Management Society, vol. 32(7), pages 2031-2048, July.
Foerster, Manuel & Hellmann, Tim & Vega-Redondo, Fernando, 2024. "Strategic use of social media influencer marketing," UC3M Working papers. Economics 43985, Universidad Carlos III de Madrid. Departamento de EconomÃa.
Daniel Huttenlocher & Hannah Li & Liang Lyu & Asuman Ozdaglar & James Siderius, 2023. "Matching of Users and Creators in Two-Sided Markets with Departures," Papers 2401.00313, arXiv.org, revised Jan 2024.
Natalie Collina & Aaron Roth & Han Shao, 2023. "Efficient Prior-Free Mechanisms for No-Regret Agents," Papers 2311.07754, arXiv.org.
Zhang, Xiaojing & Zhang, Yulin, 2024. "Content marketing in the social media platform: Examining the effect of content creation modes on the payoff of participants," Journal of Retailing and Consumer Services, Elsevier, vol. 77(C).
Hemant K. Bhargava & Kitty Wang & Xingyue (Luna) Zhang, 2022. "Fending Off Critics of Platform Power with Differential Revenue Sharing: Doing Well by Doing Good?," Management Science, INFORMS, vol. 68(11), pages 8249-8260, November.
Evangelos Katsamakas & J. Manuel Sanchez-Cartas, 2024. "Generative Artificial Intelligence, Content Creation, and Platforms," Journal of Industry, Competition and Trade, Springer, vol. 24(1), pages 1-20, December.
Bleier, Alexander & Fossen, Beth L. & Shapira, Michal, 2024. "On the role of social media platforms in the creator economy," International Journal of Research in Marketing, Elsevier, vol. 41(3), pages 411-426.
Xueyu Liu & Shue Mei & Weijun Zhong, 2024. "UGC creator's video‐generation and program‐participation decisions in the presence of ad‐revenue‐sharing programs," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 45(6), pages 4330-4349, September.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-08-12 (Big Data)
NEP-CMP-2024-08-12 (Computational Economics)
NEP-CTA-2024-08-12 (Contract Theory and Applications)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2407.01458. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Contractual Reinforcement Learning: Pulling Arms with Invisible Hands

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data