IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v232y2023ics0951832022006962.html
   My bibliography  Save this article

Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units

Author

Listed:
  • Rokhforoz, Pegah
  • Montazeri, Mina
  • Fink, Olga

Abstract

This paper proposes a safe reinforcement learning algorithm for generation bidding decisions and unit maintenance scheduling in a competitive electricity market environment. In this problem, each unit aims to find a bidding strategy that maximizes its revenue while concurrently retaining its reliability by scheduling preventive maintenance. The maintenance scheduling provides some safety constraints which should be satisfied at all times. Meeting the critical safety and reliability requirements when the generation units have incomplete information regarding each other’s bidding strategy is a challenging problem. Bi-level optimization and reinforcement learning are state-of-the-art approaches for solving this type of problem. However, neither bi-level optimization nor reinforcement learning can handle the challenges of incomplete information and critical safety constraints. To tackle these challenges, we propose the safe deep deterministic policy gradient reinforcement learning algorithm, which is based on a combination of reinforcement learning and a predicted safety filter. The case study demonstrates that the proposed approach can yield a higher profit compared to other state-of-the-art methods while concurrently satisfying the system safety constraints. Moreover, the case study shows that the reward of the learning algorithm with incomplete information can converge to a reward of the complete information game.

Suggested Citation

  • Rokhforoz, Pegah & Montazeri, Mina & Fink, Olga, 2023. "Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units," Reliability Engineering and System Safety, Elsevier, vol. 232(C).
  • Handle: RePEc:eee:reensy:v:232:y:2023:i:c:s0951832022006962
    DOI: 10.1016/j.ress.2022.109081
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832022006962
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2022.109081?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mohammadi, Reza & He, Qing, 2022. "A deep reinforcement learning approach for rail renewal and maintenance planning," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    2. Sadeghian, Omid & Mohammadpour Shotorbani, Amin & Mohammadi-Ivatloo, Behnam & Sadiq, Rehan & Hewage, Kasun, 2021. "Risk-averse maintenance scheduling of generation units in combined heat and power systems with demand response," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    3. BULUT, Merve & ÖZCAN, Evrencan, 2021. "A new approach to determine maintenance periods of the most critical hydroelectric power plant equipment," Reliability Engineering and System Safety, Elsevier, vol. 205(C).
    4. Rokhforoz, Pegah & Gjorgiev, Blazhe & Sansavini, Giovanni & Fink, Olga, 2021. "Multi-agent maintenance scheduling based on the coordination between central operator and decentralized producers in an electricity market," Reliability Engineering and System Safety, Elsevier, vol. 210(C).
    5. Zhou, Yifan & Li, Bangcheng & Lin, Tian Ran, 2022. "Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    6. Jagtap, Hanumant P. & Bewoor, Anand K. & Kumar, Ravinder & Ahmadi, Mohammad Hossein & Chen, Lingen, 2020. "Performance analysis and availability optimization to improve maintenance schedule for the turbo-generator subsystem of a thermal power plant using particle swarm optimization," Reliability Engineering and System Safety, Elsevier, vol. 204(C).
    7. Ishai Menache & Shie Mannor & Nahum Shimkin, 2005. "Basis Function Adaptation in Temporal Difference Reinforcement Learning," Annals of Operations Research, Springer, vol. 134(1), pages 215-238, February.
    8. Volkanovski, Andrija & Mavko, Borut & Boševski, Tome & Čauševski, Anton & Čepin, Marko, 2008. "Genetic algorithm optimisation of the maintenance scheduling of generating units in a power system," Reliability Engineering and System Safety, Elsevier, vol. 93(6), pages 779-789.
    9. Min, C.G. & Kim, M.K. & Park, J.K. & Yoon, Y.T., 2013. "Game-theory-based generation maintenance scheduling in electricity markets," Energy, Elsevier, vol. 55(C), pages 310-318.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Panit Prukpanit & Phisan Kaewprapha & Nopbhorn Leeprechanon, 2023. "Optimizing Generation Maintenance Scheduling Considering Emission Factors," Energies, MDPI, vol. 16(23), pages 1-22, November.
    2. Yang, Sen & Zhang, Yi & Lu, Xinzheng & Guo, Wei & Miao, Huiquan, 2024. "Multi-agent deep reinforcement learning based decision support model for resilient community post-hazard recovery," Reliability Engineering and System Safety, Elsevier, vol. 242(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rokhforoz, Pegah & Gjorgiev, Blazhe & Sansavini, Giovanni & Fink, Olga, 2021. "Multi-agent maintenance scheduling based on the coordination between central operator and decentralized producers in an electricity market," Reliability Engineering and System Safety, Elsevier, vol. 210(C).
    2. Sadeghian, Omid & Mohammadpour Shotorbani, Amin & Mohammadi-Ivatloo, Behnam & Sadiq, Rehan & Hewage, Kasun, 2021. "Risk-averse maintenance scheduling of generation units in combined heat and power systems with demand response," Reliability Engineering and System Safety, Elsevier, vol. 216(C).
    3. Jakov Batelić & Karlo Griparić & Dario Matika, 2021. "Impact of Remediation-Based Maintenance on the Reliability of a Coal-Fired Power Plant Using Generalized Stochastic Petri Nets," Energies, MDPI, vol. 14(18), pages 1-14, September.
    4. Froger, Aurélien & Gendreau, Michel & Mendoza, Jorge E. & Pinson, Éric & Rousseau, Louis-Martin, 2016. "Maintenance scheduling in the electricity industry: A literature review," European Journal of Operational Research, Elsevier, vol. 251(3), pages 695-706.
    5. Stover, Oliver & Karve, Pranav & Mahadevan, Sankaran, 2023. "Reliability and risk metrics to assess operational adequacy and flexibility of power grids," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    6. Chen, Biyun & Chen, Yanni & Zhou, Hengwang & Bai, Xiaoqing & Li, Bin & Guo, Xiaoxuan, 2023. "A Bi-level gaming programming for regional integrated energy system considering the users’ reliability incentive," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    7. Lee, Dongkyu & Song, Junho, 2023. "Risk-informed operation and maintenance of complex lifeline systems using parallelized multi-agent deep Q-network," Reliability Engineering and System Safety, Elsevier, vol. 239(C).
    8. Morato, P.G. & Andriotis, C.P. & Papakonstantinou, K.G. & Rigo, P., 2023. "Inference and dynamic decision-making for deteriorating systems with probabilistic dependencies through Bayesian networks and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    9. Saleh, Ali & Chiachío, Manuel & Salas, Juan Fernández & Kolios, Athanasios, 2023. "Self-adaptive optimized maintenance of offshore wind turbines by intelligent Petri nets," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    10. Rokhforoz, Pegah & Fink, Olga, 2022. "Maintenance scheduling of manufacturing systems based on optimal price of the network," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    11. Nitin Panwar & Sanjeev Kumar, 2022. "Mathematical modelling and performance analysis of screening unit in paper plant," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(5), pages 2751-2763, October.
    12. Zhou, Dengji & Yu, Ziqiang & Zhang, Huisheng & Weng, Shilie, 2016. "A novel grey prognostic model based on Markov process and grey incidence analysis for energy conversion equipment degradation," Energy, Elsevier, vol. 109(C), pages 420-429.
    13. Eryilmaz, Serkan & Navarro, Jorge, 2022. "A decision theoretic framework for reliability-based optimal wind turbine selection," Reliability Engineering and System Safety, Elsevier, vol. 221(C).
    14. Zhuang, Liangliang & Xu, Ancha & Wang, Xiao-Lin, 2023. "A prognostic driven predictive maintenance framework based on Bayesian deep learning," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    15. Zheng, Rui & Najafi, Seyedvahid & Zhang, Yingzhi, 2022. "A recursive method for the health assessment of systems using the proportional hazards model," Reliability Engineering and System Safety, Elsevier, vol. 221(C).
    16. Vu, Hai Canh & Do, Phuc & Barros, Anne & Bérenguer, Christophe, 2014. "Maintenance grouping strategy for multi-component systems with dynamic contexts," Reliability Engineering and System Safety, Elsevier, vol. 132(C), pages 233-249.
    17. Pavel Y. Gubin & Vladislav P. Oboskalov & Anatolijs Mahnitko & Roman Petrichenko, 2020. "Simulated Annealing, Differential Evolution and Directed Search Methods for Generator Maintenance Scheduling," Energies, MDPI, vol. 13(20), pages 1-26, October.
    18. Do, Phuc & Vu, Hai Canh & Barros, Anne & Bérenguer, Christophe, 2015. "Maintenance grouping for multi-component systems with availability constraints and limited maintenance teams," Reliability Engineering and System Safety, Elsevier, vol. 142(C), pages 56-67.
    19. Li, Qing & Li, Mingchu & Tian, Yuan & Gan, Jianyuan, 2023. "A risk-averse tri-level stochastic model for locating and recovering facilities against attacks in an uncertain environment," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    20. Tseremoglou, Iordanis & Santos, Bruno F., 2024. "Condition-Based Maintenance scheduling of an aircraft fleet under partial observability: A Deep Reinforcement Learning approach," Reliability Engineering and System Safety, Elsevier, vol. 241(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:232:y:2023:i:c:s0951832022006962. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.