IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2301.08360.html
   My bibliography  Save this paper

Domain-adapted Learning and Imitation: DRL for Power Arbitrage

Author

Listed:
  • Yuanrong Wang
  • Vignesh Raja Swaminathan
  • Nikita P. Granger
  • Carlos Ros Perez
  • Christian Michler

Abstract

In this paper, we discuss the Dutch power market, which is comprised of a day-ahead market and an intraday balancing market that operates like an auction. Due to fluctuations in power supply and demand, there is often an imbalance that leads to different prices in the two markets, providing an opportunity for arbitrage. To address this issue, we restructure the problem and propose a collaborative dual-agent reinforcement learning approach for this bi-level simulation and optimization of European power arbitrage trading. We also introduce two new implementations designed to incorporate domain-specific knowledge by imitating the trading behaviours of power traders. By utilizing reward engineering to imitate domain expertise, we are able to reform the reward system for the RL agent, which improves convergence during training and enhances overall performance. Additionally, the tranching of orders increases bidding success rates and significantly boosts profit and loss (P&L). Our study demonstrates that by leveraging domain expertise in a general learning problem, the performance can be improved substantially, and the final integrated approach leads to a three-fold improvement in cumulative P&L compared to the original agent. Furthermore, our methodology outperforms the highest benchmark policy by around 50% while maintaining efficient computational performance.

Suggested Citation

  • Yuanrong Wang & Vignesh Raja Swaminathan & Nikita P. Granger & Carlos Ros Perez & Christian Michler, 2023. "Domain-adapted Learning and Imitation: DRL for Power Arbitrage," Papers 2301.08360, arXiv.org, revised Sep 2023.
  • Handle: RePEc:arx:papers:2301.08360
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2301.08360
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yuanrong Wang & Tomaso Aste, 2022. "Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series," Papers 2203.03991, arXiv.org.
    2. Zou, Peng & Chen, Qixin & Xia, Qing & He, Guannan & Kang, Chongqing & Conejo, Antonio J., 2016. "Pool equilibria including strategic storage," Applied Energy, Elsevier, vol. 177(C), pages 260-270.
    3. Rui Albuquerque, 2012. "Skewness in Stock Returns: Reconciling the Evidence on Firm Versus Aggregate Returns," The Review of Financial Studies, Society for Financial Studies, vol. 25(5), pages 1630-1673.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang, Qisheng & Xu, Yunjian & Courcoubetis, Costas, 2020. "Stackelberg competition between merchant and regulated storage investment in wholesale electricity markets," Applied Energy, Elsevier, vol. 264(C).
    2. De Vivero-Serrano, Gustavo & Bruninx, Kenneth & Delarue, Erik, 2019. "Implications of bid structures on the offering strategies of merchant energy storage systems," Applied Energy, Elsevier, vol. 251(C), pages 1-1.
    3. Shahmohammadi, Ali & Sioshansi, Ramteen & Conejo, Antonio J. & Afsharnia, Saeed, 2018. "Market equilibria and interactions between strategic generation, wind, and storage," Applied Energy, Elsevier, vol. 220(C), pages 876-892.
    4. Christos N. Dimitriadis & Evangelos G. Tsimopoulos & Michael C. Georgiadis, 2021. "A Review on the Complementarity Modelling in Competitive Electricity Markets," Energies, MDPI, vol. 14(21), pages 1-27, November.
    5. Do, Hung Xuan & Brooks, Robert & Treepongkaruna, Sirimon & Wu, Eliza, 2014. "How does trading volume affect financial return distributions?," International Review of Financial Analysis, Elsevier, vol. 35(C), pages 190-206.
    6. Chung, Y. Peter & Hong, Hyun A. & Kim, S. Thomas, 2019. "What causes the asymmetric correlation in stock returns?," Journal of Empirical Finance, Elsevier, vol. 54(C), pages 190-212.
    7. Viral V. Acharya & Peter DeMarzo & Ilan Kremer, 2011. "Endogenous Information Flows and the Clustering of Announcements," American Economic Review, American Economic Association, vol. 101(7), pages 2955-2979, December.
    8. Langlois, Hugues, 2020. "Measuring skewness premia," Journal of Financial Economics, Elsevier, vol. 135(2), pages 399-424.
    9. Chang, Bo Young & Christoffersen, Peter & Jacobs, Kris, 2013. "Market skewness risk and the cross section of stock returns," Journal of Financial Economics, Elsevier, vol. 107(1), pages 46-68.
    10. Calvillo, C.F. & Sánchez-Miralles, A. & Villar, J. & Martín, F., 2016. "Optimal planning and operation of aggregated distributed energy resources with market participation," Applied Energy, Elsevier, vol. 182(C), pages 340-357.
    11. Robert Brooks & Robert Faff & Sirimon Treepongkaruna & Eliza Wu, 2015. "Do Sovereign Re-Ratings Destabilize Equity Markets during Financial Crises? New Evidence from Higher Return Moments," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 42(5-6), pages 777-799, June.
    12. Jondeau, Eric & Zhang, Qunzi & Zhu, Xiaoneng, 2019. "Average skewness matters," Journal of Financial Economics, Elsevier, vol. 134(1), pages 29-47.
    13. Aryani, Morteza & Ahmadian, Mohammad & Sheikh-El-Eslami, Mohammad-Kazem, 2020. "Designing a regulatory tool for coordinated investment in renewable and conventional generation capacities considering market equilibria," Applied Energy, Elsevier, vol. 279(C).
    14. Condie, Scott & Ganguli, Jayant, 2017. "The pricing effects of ambiguous private information," Journal of Economic Theory, Elsevier, vol. 172(C), pages 512-557.
    15. Annaert, Jan & De Ceuster, Marc & Van Cappellen, Jef, 2023. "Can average skewness really predict financial returns? The euro area case," Finance Research Letters, Elsevier, vol. 52(C).
    16. Wu, Qi & Yan, Xing, 2019. "Capturing deep tail risk via sequential learning of quantile dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 109(C).
    17. Stephen G Dimmock & Roy Kouwenberg & Olivia S Mitchell & Kim Peijnenburg, 2021. "Household Portfolio Underdiversification and Probability Weighting: Evidence from the Field," The Review of Financial Studies, Society for Financial Studies, vol. 34(9), pages 4524-4563.
    18. Pfeifer, Antun & Feijoo, Felipe & Duić, Neven, 2023. "Fast energy transition as a best strategy for all? The nash equilibrium of long-term energy planning strategies in coupled power markets," Energy, Elsevier, vol. 284(C).
    19. Alexander Arimond & Damian Borth & Andreas Hoepner & Michael Klawunn & Stefan Weisheit, 2020. "Neural Networks and Value at Risk," Papers 2005.01686, arXiv.org, revised May 2020.
    20. Yuanrong Wang & Yinsen Miao & Alexander CY Wong & Nikita P Granger & Christian Michler, 2023. "Domain-adapted Learning and Interpretability: DRL for Gas Trading," Papers 2301.08359, arXiv.org, revised Sep 2023.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2301.08360. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.