IDEAS home Printed from https://ideas.repec.org/a/spr/joptap/v203y2024i1d10.1007_s10957-024-02521-3.html
   My bibliography  Save this article

Linear RNNs Provably Learn Linear Dynamical Systems

Author

Listed:
  • Lifu Wang

    (China Information Technology Security Evaluation Center (CNITSEC))

  • Tianyu Wang

    (China Information Technology Security Evaluation Center (CNITSEC))

  • Shengwei Yi

    (China Information Technology Security Evaluation Center (CNITSEC))

  • Bo Shen

    (Beijing Jiaotong University)

  • Bo Hu

    (Beijing Jiaotong University)

  • Xing Cao

    (Beijing Jiaotong University)

Abstract

In this paper, we investigate the learning abilities of linear recurrent neural networks (RNNs) trained using Gradient Descent. We present a theoretical guarantee demonstrating that these linear RNNs can effectively learn any stable linear dynamical system with polynomial complexity. Importantly, our derived generalization error bound is independent of the episode length. For any stable linear system with a transition matrix C characterized by a parameter $$\rho _C$$ ρ C related to the spectral radius, we prove that despite the non-convexity of the parameter optimization loss, a linear RNN can learn the system with polynomial sample and time complexity in $$\frac{1}{1-\rho _C}$$ 1 1 - ρ C , provided that the RNN has sufficient width. Notably, the required width of the hidden layers does not depend on the length of the input sequence. This work provides the first rigorous theoretical foundation for learning linear RNNs. Our findings suggest that linear RNNs are capable of efficiently learning complex dynamical systems, paving the way for further research into the learning capabilities of more general RNN architectures.

Suggested Citation

  • Lifu Wang & Tianyu Wang & Shengwei Yi & Bo Shen & Bo Hu & Xing Cao, 2024. "Linear RNNs Provably Learn Linear Dynamical Systems," Journal of Optimization Theory and Applications, Springer, vol. 203(1), pages 488-528, October.
  • Handle: RePEc:spr:joptap:v:203:y:2024:i:1:d:10.1007_s10957-024-02521-3
    DOI: 10.1007/s10957-024-02521-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10957-024-02521-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10957-024-02521-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:203:y:2024:i:1:d:10.1007_s10957-024-02521-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.