IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2308.01910.html
   My bibliography  Save this paper

Deep Policy Gradient Methods in Commodity Markets

Author

Listed:
  • Jonas Hanetho

Abstract

The energy transition has increased the reliance on intermittent energy sources, destabilizing energy markets and causing unprecedented volatility, culminating in the global energy crisis of 2021. In addition to harming producers and consumers, volatile energy markets may jeopardize vital decarbonization efforts. Traders play an important role in stabilizing markets by providing liquidity and reducing volatility. Several mathematical and statistical models have been proposed for forecasting future returns. However, developing such models is non-trivial due to financial markets' low signal-to-noise ratios and nonstationary dynamics. This thesis investigates the effectiveness of deep reinforcement learning methods in commodities trading. It formalizes the commodities trading problem as a continuing discrete-time stochastic dynamical system. This system employs a novel time-discretization scheme that is reactive and adaptive to market volatility, providing better statistical properties for the sub-sampled financial time series. Two policy gradient algorithms, an actor-based and an actor-critic-based, are proposed for optimizing a transaction-cost- and risk-sensitive trading agent. The agent maps historical price observations to market positions through parametric function approximators utilizing deep neural network architectures, specifically CNNs and LSTMs. On average, the deep reinforcement learning models produce an 83 percent higher Sharpe ratio than the buy-and-hold baseline when backtested on front-month natural gas futures from 2017 to 2022. The backtests demonstrate that the risk tolerance of the deep reinforcement learning agents can be adjusted using a risk-sensitivity term. The actor-based policy gradient algorithm performs significantly better than the actor-critic-based algorithm, and the CNN-based models perform slightly better than those based on the LSTM.

Suggested Citation

  • Jonas Hanetho, 2023. "Deep Policy Gradient Methods in Commodity Markets," Papers 2308.01910, arXiv.org.
  • Handle: RePEc:arx:papers:2308.01910
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2308.01910
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Benoit Mandelbrot & Howard M. Taylor, 1967. "On the Distribution of Stock Price Differences," Operations Research, INFORMS, vol. 15(6), pages 1057-1062, December.
    2. Hasbrouck, Joel, 2007. "Empirical Market Microstructure: The Institutions, Economics, and Econometrics of Securities Trading," OUP Catalogue, Oxford University Press, number 9780195301649.
    3. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    4. Makridakis, Spyros & Spiliotis, Evangelos & Assimakopoulos, Vassilios, 2022. "M5 accuracy competition: Results, findings, and conclusions," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1346-1364.
    5. Sinsel, Simon R. & Riemke, Rhea L. & Hoffmann, Volker H., 2020. "Challenges and solution technologies for the integration of variable renewable energy sources—a review," Renewable Energy, Elsevier, vol. 145(C), pages 2271-2285.
    6. Clark, Peter K, 1973. "A Subordinated Stochastic Process Model with Finite Variance for Speculative Prices," Econometrica, Econometric Society, vol. 41(1), pages 135-155, January.
    7. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    8. Thierry Ané & Hélyette Geman, 2000. "Order Flow, Transaction Clock, and Normality of Asset Returns," Journal of Finance, American Finance Association, vol. 55(5), pages 2259-2284, October.
    9. Sima Siami-Namini & Akbar Siami Namin, 2018. "Forecasting Economics and Financial Time Series: ARIMA vs. LSTM," Papers 1803.06386, arXiv.org.
    10. Chien Yi Huang, 2018. "Financial Trading as a Game: A Deep Reinforcement Learning Approach," Papers 1807.02787, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jonas Hanetho, 2023. "Commodities Trading through Deep Policy Gradient Methods," Papers 2309.00630, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:spo:wpmain:info:hdl:2441/f6h8764enu2lskk9p4oq9ig8k is not listed on IDEAS
    2. Jean-Philippe Bouchaud & J. Doyne Farmer & Fabrizio Lillo, 2008. "How markets slowly digest changes in supply and demand," Papers 0809.0822, arXiv.org.
    3. Leal, Sandrine Jacob & Napoletano, Mauro, 2019. "Market stability vs. market resilience: Regulatory policies experiments in an agent-based model with low- and high-frequency trading," Journal of Economic Behavior & Organization, Elsevier, vol. 157(C), pages 15-41.
    4. repec:hal:spmain:info:hdl:2441/f6h8764enu2lskk9p4oq9ig8k is not listed on IDEAS
    5. Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the clock: An agent-based model of low- and high-frequency trading," Journal of Evolutionary Economics, Springer, vol. 26(1), pages 49-76, March.
    6. Gabriele La Spada & J. Doyne Farmer & Fabrizio Lillo, 2010. "Tick size and price diffusion," Papers 1009.2329, arXiv.org, revised Oct 2010.
    7. Joann Jasiak, 2003. "First‐Order Autoregressive Processes with Heterogeneous Persistence," Journal of Time Series Analysis, Wiley Blackwell, vol. 24(3), pages 283-309, May.
    8. Feng-Tse Tsai, 2019. "Option Implied Stock Buy-Side and Sell-Side Market Depths," Risks, MDPI, vol. 7(4), pages 1-16, October.
    9. Aldrich, Eric M. & Heckenbach, Indra & Laughlin, Gregory, 2016. "A compound duration model for high-frequency asset returns," Journal of Empirical Finance, Elsevier, vol. 39(PA), pages 105-128.
    10. Senarathne, Chamil W & Jayasinghe, Prabhath, 2017. "Information Flow Interpretation of Heteroskedasticity for Capital Asset Pricing: An Expectation-based View of Risk," MPRA Paper 78771, University Library of Munich, Germany, revised 04 Apr 2017.
    11. Scalas, Enrico, 2006. "The application of continuous-time random walks in finance and economics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 362(2), pages 225-239.
    12. Aldrich, Eric M. & Lee, Seung, 2018. "Relative spread and price discovery," Journal of Empirical Finance, Elsevier, vol. 48(C), pages 81-98.
    13. Seemann, Lars & McCauley, Joseph L. & Gunaratne, Gemunu H., 2011. "Intraday volatility and scaling in high frequency foreign exchange markets," International Review of Financial Analysis, Elsevier, vol. 20(3), pages 121-126, June.
    14. Degiannakis, Stavros & Xekalaki, Evdokia, 2004. "Autoregressive Conditional Heteroskedasticity (ARCH) Models: A Review," MPRA Paper 80487, University Library of Munich, Germany.
    15. Eric M. Aldrich & Indra Heckenbach & Gregory Laughlin, 2014. "A Compound Multifractal Model for High-Frequency Asset Returns," BYU Macroeconomics and Computational Laboratory Working Paper Series 2014-05, Brigham Young University, Department of Economics, BYU Macroeconomics and Computational Laboratory.
    16. Gabaix, Xavier & Gopikrishnan, Parameswaran & Plerou, Vasiliki & Eugene Stanley, H., 2008. "Quantifying and understanding the economics of large financial movements," Journal of Economic Dynamics and Control, Elsevier, vol. 32(1), pages 303-319, January.
    17. James B. Glattfelder & Anton Golub, 2022. "Bridging the Gap: Decoding the Intrinsic Nature of Time in Market Data," Papers 2204.02682, arXiv.org.
    18. Jonas Hanetho, 2023. "Commodities Trading through Deep Policy Gradient Methods," Papers 2309.00630, arXiv.org.
    19. Rene Carmona & Kevin Webster, 2017. "The microstructure of high frequency markets," Papers 1709.02015, arXiv.org.
    20. Aki-Hiro Sato & Takaki Hayashi & Janusz Hołyst, 2012. "Comprehensive analysis of market conditions in the foreign exchange market," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 7(2), pages 167-179, October.
    21. Cornelis A. Los, 2004. "Nonparametric Efficiency Testing of Asian Stock Markets Using Weekly Data," Finance 0409033, University Library of Munich, Germany.
    22. Abhinava Tripathi, 2021. "The Arrival of Information and Price Adjustment Across Extreme Quantiles: Global Evidence," IIM Kozhikode Society & Management Review, , vol. 10(1), pages 7-19, January.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2308.01910. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.