IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2410.21109.html
   My bibliography  Save this paper

Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment

Author

Listed:
  • Yi Zheng
  • Zehao Li
  • Peng Jiang
  • Yijie Peng

Abstract

We study the dynamic pricing and replenishment problems under inconsistent decision frequencies. Different from the traditional demand assumption, the discreteness of demand and the parameter within the Poisson distribution as a function of price introduce complexity into analyzing the problem property. We demonstrate the concavity of the single-period profit function with respect to product price and inventory within their respective domains. The demand model is enhanced by integrating a decision tree-based machine learning approach, trained on comprehensive market data. Employing a two-timescale stochastic approximation scheme, we address the discrepancies in decision frequencies between pricing and replenishment, ensuring convergence to local optimum. We further refine our methodology by incorporating deep reinforcement learning (DRL) techniques and propose a fast-slow dual-agent DRL algorithm. In this approach, two agents handle pricing and inventory and are updated on different scales. Numerical results from both single and multiple products scenarios validate the effectiveness of our methods.

Suggested Citation

  • Yi Zheng & Zehao Li & Peng Jiang & Yijie Peng, 2024. "Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment," Papers 2410.21109, arXiv.org.
  • Handle: RePEc:arx:papers:2410.21109
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2410.21109
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ehrenthal, J.C.F. & Honhon, D. & Van Woensel, T., 2014. "Demand seasonality in retail inventory management," European Journal of Operational Research, Elsevier, vol. 238(2), pages 527-539.
    2. Awi Federgruen & Aliza Heching, 1999. "Combined Pricing and Inventory Control Under Uncertainty," Operations Research, INFORMS, vol. 47(3), pages 454-475, June.
    3. Yanzhe (Murray) Lei & Stefanus Jasin & Amitabh Sinha, 2018. "Joint Dynamic Pricing and Order Fulfillment for E-commerce Retailers," Manufacturing & Service Operations Management, INFORMS, vol. 20(2), pages 269-284, May.
    4. Fernando Bernstein & Yang Li & Kevin Shang, 2016. "A Simple Heuristic for Joint Inventory and Pricing Models with Lead Time and Backorders," Management Science, INFORMS, vol. 62(8), pages 2358-2373, August.
    5. Yossi Aviv & Amit Pazgal, 2005. "A Partially Observed Markov Decision Process for Dynamic Pricing," Management Science, INFORMS, vol. 51(9), pages 1400-1416, September.
    6. Qi Feng & Sirong Luo & Dan Zhang, 2014. "Dynamic Inventory–Pricing Control Under Backorder: Demand Estimation and Policy Optimization," Manufacturing & Service Operations Management, INFORMS, vol. 16(1), pages 149-160, February.
    7. Qi Feng & Sirong Luo & J. George Shanthikumar, 2020. "Integrating Dynamic Pricing with Inventory Decisions Under Lost Sales," Management Science, INFORMS, vol. 66(5), pages 2232-2247, May.
    8. Gal Raz & Evan L. Porteus, 2006. "A Fractiles Perspective to the Joint Price/Quantity Newsvendor Model," Management Science, INFORMS, vol. 52(11), pages 1764-1777, November.
    9. Guillermo Gallego & Garrett van Ryzin, 1997. "A Multiproduct Dynamic Pricing Problem and Its Applications to Network Yield Management," Operations Research, INFORMS, vol. 45(1), pages 24-41, February.
    10. Thomas E. Morton, 1971. "The Near-Myopic Nature of the Lagged-Proportional-Cost Inventory Problem with Lost Sales," Operations Research, INFORMS, vol. 19(7), pages 1708-1716, December.
    11. Guillermo Gallego & Garrett van Ryzin, 1994. "Optimal Dynamic Pricing of Inventories with Stochastic Demand over Finite Horizons," Management Science, INFORMS, vol. 40(8), pages 999-1020, August.
    12. Rana, Rupal & Oliveira, Fernando S., 2014. "Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning," Omega, Elsevier, vol. 47(C), pages 116-126.
    13. Benny Mantin & Daniel Granot & Frieda Granot, 2011. "Dynamic pricing under first order Markovian competition," Naval Research Logistics (NRL), John Wiley & Sons, vol. 58(6), pages 608-617, September.
    14. Erik Brynjolfsson & Michael D. Smith, 2000. "Frictionless Commerce? A Comparison of Internet and Conventional Retailers," Management Science, INFORMS, vol. 46(4), pages 563-585, April.
    15. Lap Mui Ann Chan & David Simchi-Levi & Julie Swann, 2006. "Pricing, Production, and Inventory Policies for Manufacturing with Stochastic Demand and Discretionary Sales," Manufacturing & Service Operations Management, INFORMS, vol. 8(2), pages 149-168, January.
    16. Arnab Bisi & Maqbool Dada, 2007. "Dynamic learning, pricing, and ordering by a censored newsvendor," Naval Research Logistics (NRL), John Wiley & Sons, vol. 54(4), pages 448-461, June.
    17. R. Schlosser & K. Richly, 2019. "Dynamic pricing under competition with data-driven price anticipations and endogenous reference price effects," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 18(6), pages 451-464, December.
    18. Gunnar T. Thowsen, 1975. "A dynamic, nonstationary inventory problem for a price/quantity setting firm," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 22(3), pages 461-476, September.
    19. Schulte, Benedikt & Sachs, Anna-Lena, 2020. "The price-setting newsvendor with Poisson demand," European Journal of Operational Research, Elsevier, vol. 283(1), pages 125-137.
    20. Tunc, Huseyin & Kilic, Onur A. & Tarim, S. Armagan & Eksioglu, Burak, 2011. "The cost of using stationary inventory policies when demand is non-stationary," Omega, Elsevier, vol. 39(4), pages 410-415, August.
    21. Zied Jemai & M. Zied Babai & Y. Dallery, 2011. "Analysis of order-up-to-level inventory systems with compound Poisson demand," Post-Print hal-01672399, HAL.
    22. Maxime C. Cohen & Ruben Lobel & Georgia Perakis, 2018. "Dynamic Pricing through Data Sampling," Production and Operations Management, Production and Operations Management Society, vol. 27(6), pages 1074-1088, June.
    23. Xin Chen & David Simchi-Levi, 2004. "Coordinating Inventory Control and Pricing Strategies with Random Demand and Fixed Ordering Cost: The Finite Horizon Case," Operations Research, INFORMS, vol. 52(6), pages 887-896, December.
    24. Babai, M.Z. & Jemai, Z. & Dallery, Y., 2011. "Analysis of order-up-to-level inventory systems with compound Poisson demand," European Journal of Operational Research, Elsevier, vol. 210(3), pages 552-558, May.
    25. Xiaowei Xu & Wallace J. Hopp, 2006. "A Monopolistic and Oligopolistic Stochastic Flow Revenue Management Model," Operations Research, INFORMS, vol. 54(6), pages 1098-1109, December.
    26. Wedad Elmaghraby & P{i}nar Keskinocak, 2003. "Dynamic Pricing in the Presence of Inventory Considerations: Research Overview, Current Practices, and Future Directions," Management Science, INFORMS, vol. 49(10), pages 1287-1309, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Torsten J. Gerpott & Jan Berends, 2022. "Competitive pricing on online markets: a literature review," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 21(6), pages 596-622, December.
    2. Li, Mengmeng & Mizuno, Shinji, 2022. "Dynamic pricing and inventory management of a dual-channel supply chain under different power structures," European Journal of Operational Research, Elsevier, vol. 303(1), pages 273-285.
    3. Serguei Netessine & Sergei Savin & Wenqiang Xiao, 2006. "Revenue Management Through Dynamic Cross Selling in E-Commerce Retailing," Operations Research, INFORMS, vol. 54(5), pages 893-913, October.
    4. Ibrahim, Michael Nawar & Atiya, Amir F., 2016. "Analytical solutions to the dynamic pricing problem for time-normalized revenue," European Journal of Operational Research, Elsevier, vol. 254(2), pages 632-643.
    5. Brahimi, Nadjib & Absi, Nabil & Dauzère-Pérès, Stéphane & Nordli, Atle, 2017. "Single-item dynamic lot-sizing problems: An updated survey," European Journal of Operational Research, Elsevier, vol. 263(3), pages 838-863.
    6. Yongbo Xiao, 2018. "Dynamic pricing and replenishment: Optimality, bounds, and asymptotics," Naval Research Logistics (NRL), John Wiley & Sons, vol. 65(1), pages 3-25, February.
    7. Xiting Gong & Youhua (Frank) Chen & Quan Yuan, 2022. "Coordinating Inventory and Pricing Decisions Under Total Minimum Commitment Contracts," Production and Operations Management, Production and Operations Management Society, vol. 31(2), pages 511-528, February.
    8. Sirong Luo & Jianrong Wang, 2017. "A technical note on the dynamic nonstationary inventory-pricing control model with lost sale," International Journal of Production Research, Taylor & Francis Journals, vol. 55(19), pages 5816-5825, October.
    9. Hanzhang Qin & David Simchi-Levi & Li Wang, 2022. "Data-Driven Approximation Schemes for Joint Pricing and Inventory Control Models," Management Science, INFORMS, vol. 68(9), pages 6591-6609, September.
    10. Ilan Lobel, 2021. "Revenue Management and the Rise of the Algorithmic Economy," Management Science, INFORMS, vol. 67(9), pages 5389-5398, September.
    11. Doan, Xuan Vinh & Lei, Xiao & Shen, Siqian, 2020. "Pricing of reusable resources under ambiguous distributions of demand and service time with emerging applications," European Journal of Operational Research, Elsevier, vol. 282(1), pages 235-251.
    12. Bhatia, Nishika & Gülpınar, Nalan & Aydın, Nurşen, 2020. "Dynamic production-pricing strategies for multi-generation products under uncertainty," International Journal of Production Economics, Elsevier, vol. 230(C).
    13. Dasci, A. & Karakul, M., 2009. "Two-period dynamic versus fixed-ratio pricing in a capacity constrained duopoly," European Journal of Operational Research, Elsevier, vol. 197(3), pages 945-968, September.
    14. Nan Yang & Renyu Zhang, 2022. "Dynamic pricing and inventory management in the presence of online reviews," Production and Operations Management, Production and Operations Management Society, vol. 31(8), pages 3180-3197, August.
    15. Gurkan, M. Edib & Tunc, Huseyin & Tarim, S. Armagan, 2022. "The joint stochastic lot sizing and pricing problem," Omega, Elsevier, vol. 108(C).
    16. Lingxiu Dong & Panos Kouvelis & Zhongjun Tian, 2009. "Dynamic Pricing and Inventory Control of Substitute Products," Manufacturing & Service Operations Management, INFORMS, vol. 11(2), pages 317-339, December.
    17. Qi Feng & Sirong Luo & J. George Shanthikumar, 2020. "Integrating Dynamic Pricing with Inventory Decisions Under Lost Sales," Management Science, INFORMS, vol. 66(5), pages 2232-2247, May.
    18. Pavithra Harsha & Shivaram Subramanian & Joline Uichanco, 2019. "Dynamic Pricing of Omnichannel Inventories," Service Science, INFORMS, vol. 21(1), pages 47-65, January.
    19. Li, Yang & Liu, Feng, 2021. "Joint inventory and pricing control with lagged price responses," International Journal of Production Economics, Elsevier, vol. 241(C).
    20. Lap Mui Ann Chan & David Simchi-Levi & Julie Swann, 2006. "Pricing, Production, and Inventory Policies for Manufacturing with Stochastic Demand and Discretionary Sales," Manufacturing & Service Operations Management, INFORMS, vol. 8(2), pages 149-168, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2410.21109. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.