IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v15y2023i13p10480-d1186000.html
   My bibliography  Save this article

Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism

Author

Listed:
  • Yilin Liang

    (School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
    Guangdong Intelligent Business Engineering Technology Research Center, Guangzhou 510330, China)

  • Yuping Hu

    (School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
    Guangdong Intelligent Business Engineering Technology Research Center, Guangzhou 510330, China)

  • Dongjun Luo

    (School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
    Guangdong Intelligent Business Engineering Technology Research Center, Guangzhou 510330, China)

  • Qi Zhu

    (School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
    Guangdong Intelligent Business Engineering Technology Research Center, Guangzhou 510330, China)

  • Qingxuan Chen

    (School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
    Guangdong Intelligent Business Engineering Technology Research Center, Guangzhou 510330, China)

  • Chunmei Wang

    (College of Internet Finance & Information Engineering, Guangdong University of Finance, Guangzhou 510521, China)

Abstract

Despite the emergence of a presale mechanism that reduces manufacturing and ordering risks for retailers, optimizing the real-time pricing strategy in this mechanism and unknown demand environment remains an unsolved issue. Consequently, we propose an automatic real-time pricing system for e-retailers under the inventory backlog impact in the presale mode, using deep reinforcement learning technology based on the Dueling DQN algorithm. This system models the multicycle pricing problem with a finite sales horizon as a Markov decision process (MDP) to cope with the uncertain environment. We train and evaluate the proposed environment and agent in a simulation environment and compare it with two tabular reinforcement learning algorithms (Q-learning and SARSA). The computational results demonstrate that our proposed real-time pricing learning framework for joint inventory impact can effectively maximize retailers’ profits and has universal applicability to a wide range of presale models. Furthermore, according to a series of experiments, we find that retailers should not neglect the impact of the presale or previous prices on consumers’ purchase behavior. If consumers pay more attention to past prices, the retailer must decrease the current price. When the cost of inventory backlog increases, they need to offer deeper discounts in the early selling period. Additionally, introducing blockchain technology can improve the transparency of commodity traceability information, thus increasing consumer demand for purchase.

Suggested Citation

  • Yilin Liang & Yuping Hu & Dongjun Luo & Qi Zhu & Qingxuan Chen & Chunmei Wang, 2023. "Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism," Sustainability, MDPI, vol. 15(13), pages 1-20, July.
  • Handle: RePEc:gam:jsusta:v:15:y:2023:i:13:p:10480-:d:1186000
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/15/13/10480/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/15/13/10480/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mei, Wanxia & Du, Li & Niu, Baozhuang & Wang, Jincheng & Feng, Jiejian, 2016. "The effects of an undisclosed regular price and a positive leadtime in a presale mechanism," European Journal of Operational Research, Elsevier, vol. 250(3), pages 1013-1025.
    2. Alexei Alexandrov & Özlem Bedre-Defolie, 2014. "The Equivalence of Bundling and Advance Sales," Marketing Science, INFORMS, vol. 33(2), pages 259-272, March.
    3. Rana, Rupal & Oliveira, Fernando S., 2014. "Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning," Omega, Elsevier, vol. 47(C), pages 116-126.
    4. Zhong, Shengyuan & Wang, Xiaoyuan & Zhao, Jun & Li, Wenjia & Li, Hao & Wang, Yongzhen & Deng, Shuai & Zhu, Jiebei, 2021. "Deep reinforcement learning framework for dynamic pricing demand response of regenerative electric heating," Applied Energy, Elsevier, vol. 288(C).
    5. Chenhang Zeng, 2013. "Optimal Advance Selling Strategy under Price Commitment," Pacific Economic Review, Wiley Blackwell, vol. 18(2), pages 233-258, May.
    6. Lin, Kyle Y., 2006. "Dynamic pricing with real-time demand learning," European Journal of Operational Research, Elsevier, vol. 174(1), pages 522-538, October.
    7. Touzani, Samir & Prakash, Anand Krishnan & Wang, Zhe & Agarwal, Shreya & Pritoni, Marco & Kiran, Mariam & Brown, Richard & Granderson, Jessica, 2021. "Controlling distributed energy resources via deep reinforcement learning for load flexibility and energy efficiency," Applied Energy, Elsevier, vol. 304(C).
    8. Keller, Alisa & Vogelsang, Mila & Totzek, Dirk, 2022. "How displaying price discounts can mitigate negative customer reactions to dynamic pricing," Journal of Business Research, Elsevier, vol. 148(C), pages 277-291.
    9. Wang Chi Cheung & David Simchi-Levi & He Wang, 2017. "Technical Note—Dynamic Pricing and Demand Learning with Limited Price Experimentation," Operations Research, INFORMS, vol. 65(6), pages 1722-1731, December.
    10. Nunan, Daniel & Di Domenico, MariaLaura, 2022. "Value creation in an algorithmic world: Towards an ethics of dynamic pricing," Journal of Business Research, Elsevier, vol. 150(C), pages 451-460.
    11. Wedad Elmaghraby & P{i}nar Keskinocak, 2003. "Dynamic Pricing in the Presence of Inventory Considerations: Research Overview, Current Practices, and Future Directions," Management Science, INFORMS, vol. 49(10), pages 1287-1309, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. den Boer, Arnoud V., 2015. "Tracking the market: Dynamic pricing and learning in a changing environment," European Journal of Operational Research, Elsevier, vol. 247(3), pages 914-927.
    2. Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
    3. Athanassios N. Avramidis & Arnoud V. Boer, 2021. "Dynamic pricing with finite price sets: a non-parametric approach," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 94(1), pages 1-34, August.
    4. Gel, Esma S. & Salman, F. Sibel, 2022. "Dynamic ordering decisions with approximate learning of supply yield uncertainty," International Journal of Production Economics, Elsevier, vol. 243(C).
    5. Yurong Pei & Mengying Xie & Qiuling Yang & Yi Liao & Yuping Wu, 2021. "Effect of Consumer Strategic Behavior on Online Presale Strategy," Sustainability, MDPI, vol. 13(19), pages 1-21, October.
    6. Kimmo Berg & Harri Ehtamo, 2012. "Continuous learning methods in two-buyer pricing problem," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 75(3), pages 287-304, June.
    7. Hu Wang & Di Li & Changbin Jiang & Yuxiang Zhang, 2023. "Exploring the Interactive Relationship between Retailers’ Free Shipping Decisions and Manufacturers’ Product Sales in Digital Retailing," Sustainability, MDPI, vol. 15(17), pages 1-19, August.
    8. Diego Escobari, 2012. "Dynamic Pricing, Advance Sales and Aggregate Demand Learning in Airlines," Journal of Industrial Economics, Wiley Blackwell, vol. 60(4), pages 697-724, December.
    9. Tatsiana Levina & Yuri Levin & Jeff McGill & Mikhail Nediak, 2009. "Dynamic Pricing with Online Learning and Strategic Consumers: An Application of the Aggregating Algorithm," Operations Research, INFORMS, vol. 57(2), pages 327-341, April.
    10. Ming Chen & Zhi-Long Chen, 2018. "Robust Dynamic Pricing with Two Substitutable Products," Manufacturing & Service Operations Management, INFORMS, vol. 20(2), pages 249-268, May.
    11. Boxiao Chen & David Simchi-Levi & Yining Wang & Yuan Zhou, 2022. "Dynamic Pricing and Inventory Control with Fixed Ordering Cost and Incomplete Demand Information," Management Science, INFORMS, vol. 68(8), pages 5684-5703, August.
    12. Vincent C. Li & Yat-wah Wan & Chi-Leung Chu & Yi-Cheng Lin, 2020. "A Dynamic Programming-Based Heuristic for Markdown Pricing and Inventory Allocation of a Seasonal Product in a Retail Chain," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 37(01), pages 1-30, January.
    13. Gao, Yuan & Matsunami, Yuki & Miyata, Shohei & Akashi, Yasunori, 2022. "Multi-agent reinforcement learning dealing with hybrid action spaces: A case study for off-grid oriented renewable building energy system," Applied Energy, Elsevier, vol. 326(C).
    14. Chen, Jing & Dong, Ming & Rong, Ying & Yang, Liang, 2018. "Dynamic pricing for deteriorating products with menu cost," Omega, Elsevier, vol. 75(C), pages 13-26.
    15. Ilan Lobel, 2021. "Revenue Management and the Rise of the Algorithmic Economy," Management Science, INFORMS, vol. 67(9), pages 5389-5398, September.
    16. Sentao Miao & Xi Chen & Xiuli Chao & Jiaxi Liu & Yidong Zhang, 2022. "Context‐based dynamic pricing with online clustering," Production and Operations Management, Production and Operations Management Society, vol. 31(9), pages 3559-3575, September.
    17. Jianjun Li & Xiaodi Xu & Yu Yang, 2023. "Research on the Regulation of Algorithmic Price Discrimination Behaviour of E-Commerce Platform Based on Tripartite Evolutionary Game," Sustainability, MDPI, vol. 15(10), pages 1-19, May.
    18. Rana, Rupal & Oliveira, Fernando S., 2014. "Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning," Omega, Elsevier, vol. 47(C), pages 116-126.
    19. Vincent Mak & Amnon Rapoport & Eyran J. Gisches & Jiaojie Han, 2014. "Purchasing Scarce Products Under Dynamic Pricing: An Experimental Investigation," Manufacturing & Service Operations Management, INFORMS, vol. 16(3), pages 425-438, July.
    20. Régis Chenavaz & Corina Paraschiv & Gabriel Turinici, 2017. "Dynamic Pricing of New Products in Competitive Markets: A Mean-Field Game Approach," Working Papers hal-01592958, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:15:y:2023:i:13:p:10480-:d:1186000. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.