IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i1p151-d1559633.html
   My bibliography  Save this article

Deep Reinforcement Learning for Intraday Multireservoir Hydropower Management

Author

Listed:
  • Rodrigo Castro-Freibott

    (baobab soluciones, José Abascal 55, 28003 Madrid, Spain)

  • Álvaro García-Sánchez

    (Industrial Engineering, Business Administration and Statistics Department, Escuela Técnica Superior de Ingenieros Industriales, Universidad Politécnica de Madrid, José Gutierrez Abascal 2, 28006 Madrid, Spain)

  • Francisco Espiga-Fernández

    (Industrial Engineering, Business Administration and Statistics Department, Escuela Técnica Superior de Ingenieros Industriales, Universidad Politécnica de Madrid, José Gutierrez Abascal 2, 28006 Madrid, Spain)

  • Guillermo González-Santander de la Cruz

    (baobab soluciones, José Abascal 55, 28003 Madrid, Spain)

Abstract

This study investigates the application of Reinforcement Learning (RL) to optimize intraday operations of hydropower reservoirs. Unlike previous approaches that focus on long-term planning with coarse temporal resolutions and discretized state-action spaces, we propose an RL framework tailored to the Hydropower Reservoirs Intraday Economic Optimization problem. This framework manages continuous state-action spaces while accounting for fine-grained temporal dynamics, including dam-to-turbine delays, gate movement constraints, and power group operations. Our methodology evaluates three distinct action space formulations (continuous, discrete, and adjustments) implemented using modern RL algorithms (A2C, PPO, and SAC). We compare them against both a greedy baseline and Mixed-Integer Linear Programming (MILP) solutions. Experiments on real-world data from a two-reservoir system and a simulated six-reservoir system demonstrate that while MILP achieves superior performance in the smaller system, its performance degrades significantly when scaled to six reservoirs. In contrast, RL agents, particularly those using discrete action spaces and trained with PPO, maintain consistent performance across both configurations, achieving considerable improvements with less than one second of execution time. These results suggest that RL offers a scalable alternative to traditional optimization methods for hydropower operations, particularly in scenarios requiring real-time decision making or involving larger systems.

Suggested Citation

  • Rodrigo Castro-Freibott & Álvaro García-Sánchez & Francisco Espiga-Fernández & Guillermo González-Santander de la Cruz, 2025. "Deep Reinforcement Learning for Intraday Multireservoir Hydropower Management," Mathematics, MDPI, vol. 13(1), pages 1-18, January.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:1:p:151-:d:1559633
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/1/151/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/1/151/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:1:p:151-:d:1559633. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.