IDEAS home Printed from https://ideas.repec.org/a/spr/snopef/v5y2024i4d10.1007_s43069-024-00377-x.html
   My bibliography  Save this article

A Hybrid Sobolev Gradient Method for Learning NODEs

Author

Listed:
  • George Baravdish

    (Linköping University)

  • Gabriel Eilertsen

    (Linköping University)

  • Rym Jaroudi

    (Linköping University)

  • B. Tomas Johansson

    (Linköping University)

  • Lukáš Malý

    (Linköping University)

  • Jonas Unger

    (Linköping University)

Abstract

The inverse problem of supervised reconstruction of depth-variable (time-dependent) parameters in ordinary differential equations is considered, with the typical application of finding weights of a neural ordinary differential equation (NODE) for a residual network with time continuous layers. The differential equation is treated as an abstract and isolated entity, termed a standalone NODE (sNODE), to facilitate for a wide range of applications. The proposed parameter reconstruction is performed by minimizing a cost functional covering a variety of loss functions and penalty terms. Regularization via penalty terms is incorporated to enhance ethical and trustworthy AI formulations. A nonlinear conjugate gradient mini-batch optimization scheme (NCG) is derived for the training having the benefit of including a sensitivity problem. The model (differential equation)-based approach is thus combined with a data-driven learning procedure. Mathematical properties are stated for the differential equation and the cost functional. The adjoint problem needed is derived together with the sensitivity problem. The sensitivity problem itself can estimate changes in the output under perturbation of the trained parameters. To preserve smoothness during the iterations, the Sobolev gradient is calculated and incorporated. Numerical results are included to validate the procedure for a NODE and synthetic datasets and compared with standard gradient approaches. For stability, using the sensitivity problem, a strategy for adversarial attacks is constructed, and it is shown that the given method with Sobolev gradients is more robust than standard approaches for parameter identification.

Suggested Citation

  • George Baravdish & Gabriel Eilertsen & Rym Jaroudi & B. Tomas Johansson & Lukáš Malý & Jonas Unger, 2024. "A Hybrid Sobolev Gradient Method for Learning NODEs," SN Operations Research Forum, Springer, vol. 5(4), pages 1-39, December.
  • Handle: RePEc:spr:snopef:v:5:y:2024:i:4:d:10.1007_s43069-024-00377-x
    DOI: 10.1007/s43069-024-00377-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s43069-024-00377-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s43069-024-00377-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bernd Hofmann & Christopher Hofmann, 2020. "The Impact of the Discrepancy Principle on the Tikhonov-Regularized Solutions with Oversmoothing Penalties," Mathematics, MDPI, vol. 8(3), pages 1-16, March.
    2. Kamil A. Khan & Paul I. Barton, 2014. "Generalized Derivatives for Solutions of Parametric Ordinary Differential Equations with Non-differentiable Right-Hand Sides," Journal of Optimization Theory and Applications, Springer, vol. 163(2), pages 355-386, November.
    3. Neculai Andrei, 2020. "Nonlinear Conjugate Gradient Methods for Unconstrained Optimization," Springer Optimization and Its Applications, Springer, number 978-3-030-42950-8, December.
    4. Neculai Andrei, 2020. "General Convergence Results for Nonlinear Conjugate Gradient Methods," Springer Optimization and Its Applications, in: Nonlinear Conjugate Gradient Methods for Unconstrained Optimization, chapter 0, pages 89-123, Springer.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wumei Sun & Hongwei Liu & Zexian Liu, 2021. "A Class of Accelerated Subspace Minimization Conjugate Gradient Methods," Journal of Optimization Theory and Applications, Springer, vol. 190(3), pages 811-840, September.
    2. Salihu, Nasiru & Kumam, Poom & Sulaiman, Ibrahim Mohammed & Arzuka, Ibrahim & Kumam, Wiyada, 2024. "An efficient Newton-like conjugate gradient method with restart strategy and its application," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 226(C), pages 354-372.
    3. Abubakar, Auwal Bala & Kumam, Poom & Malik, Maulana & Ibrahim, Abdulkarim Hassan, 2022. "A hybrid conjugate gradient based approach for solving unconstrained optimization and motion control problems," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 201(C), pages 640-657.
    4. Tiantian Zhao & Wei Hong Yang, 2023. "A Nonlinear Conjugate Gradient Method Using Inexact First-Order Information," Journal of Optimization Theory and Applications, Springer, vol. 198(2), pages 502-530, August.
    5. Jose Alberto Gomez & Kai Höffner & Kamil A. Khan & Paul I. Barton, 2018. "Generalized Derivatives of Lexicographic Linear Programs," Journal of Optimization Theory and Applications, Springer, vol. 178(2), pages 477-501, August.
    6. Peter Stechlinski, 2020. "Optimization-Constrained Differential Equations with Active Set Changes," Journal of Optimization Theory and Applications, Springer, vol. 187(1), pages 266-293, October.
    7. Peter G. Stechlinski & Paul I. Barton, 2016. "Generalized Derivatives of Differential–Algebraic Equations," Journal of Optimization Theory and Applications, Springer, vol. 171(1), pages 1-26, October.
    8. Vladimir Rakočević & Milena J. Petrović, 2022. "Comparative Analysis of Accelerated Models for Solving Unconstrained Optimization Problems with Application of Khan’s Hybrid Rule," Mathematics, MDPI, vol. 10(23), pages 1-13, November.
    9. Ackley, Matthew & Stechlinski, Peter, 2021. "Lexicographic derivatives of nonsmooth glucose-insulin kinetics under normal and artificial pancreatic responses," Applied Mathematics and Computation, Elsevier, vol. 395(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:snopef:v:5:y:2024:i:4:d:10.1007_s43069-024-00377-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.