IDEAS home Printed from https://ideas.repec.org/a/spr/comgts/v16y2019i4d10.1007_s10287-019-00357-1.html
   My bibliography  Save this article

A simultaneous perturbation weak derivative estimator for stochastic neural networks

Author

Listed:
  • Thomas Flynn

    (Brookhaven National Laboratory)

  • Felisa Vázquez-Abad

    (Hunter College)

Abstract

In this paper we study gradient estimation for a network of nonlinear stochastic units known as the Little model. Many machine learning systems can be described as networks of homogeneous units, and the Little model is of a particularly general form, which includes as special cases several popular machine learning architectures. However, since a closed form solution for the stationary distribution is not known, gradient methods which work for similar models such as the Boltzmann machine or sigmoid belief network cannot be used. To address this we introduce a method to calculate derivatives for this system based on measure-valued differentiation and simultaneous perturbation. This extends previous works in which gradient estimation algorithm’s were presented for networks with restrictive features like symmetry or acyclic connectivity.

Suggested Citation

  • Thomas Flynn & Felisa Vázquez-Abad, 2019. "A simultaneous perturbation weak derivative estimator for stochastic neural networks," Computational Management Science, Springer, vol. 16(4), pages 715-738, October.
  • Handle: RePEc:spr:comgts:v:16:y:2019:i:4:d:10.1007_s10287-019-00357-1
    DOI: 10.1007/s10287-019-00357-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10287-019-00357-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10287-019-00357-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. B. Heidergott & F. J. Vázquez-Abad, 2008. "Measure-Valued Differentiation for Markov Chains," Journal of Optimization Theory and Applications, Springer, vol. 136(2), pages 187-209, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Koch, Erwan & Robert, Christian Y., 2022. "Stochastic derivative estimation for max-stable random fields," European Journal of Operational Research, Elsevier, vol. 302(2), pages 575-588.
    2. Bernd Heidergott & Haralambie Leahu, 2010. "Weak Differentiability of Product Measures," Mathematics of Operations Research, INFORMS, vol. 35(1), pages 27-51, February.
    3. Bernd Heidergott & Taoying Farenhorst-Yuan, 2010. "Gradient Estimation for Multicomponent Maintenance Systems with Age-Replacement Policy," Operations Research, INFORMS, vol. 58(3), pages 706-718, June.
    4. Sandjai Bhulai & Taoying Farenhorst-Yuan & Bernd Heidergott & Dinard Laan, 2012. "Optimal balanced control for call centers," Annals of Operations Research, Springer, vol. 201(1), pages 39-62, December.
    5. Kloeden Peter E. & Sanz-Chacón Carlos, 2011. "Efficient price sensitivity estimation of financial derivatives by weak derivatives," Monte Carlo Methods and Applications, De Gruyter, vol. 17(1), pages 47-75, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:comgts:v:16:y:2019:i:4:d:10.1007_s10287-019-00357-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.