IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v562y2018i7726d10.1038_s41586-018-0533-0.html
   My bibliography  Save this article

Glider soaring via reinforcement learning in the field

Author

Listed:
  • Gautam Reddy

    (University of California, San Diego)

  • Jerome Wong-Ng

    (University of California, San Diego)

  • Antonio Celani

    (The Abdus Salam International Center for Theoretical Physics)

  • Terrence J. Sejnowski

    (The Salk Institute for Biological Studies
    University of California, San Diego)

  • Massimo Vergassola

    (University of California, San Diego)

Abstract

Soaring birds often rely on ascending thermal plumes (thermals) in the atmosphere as they search for prey or migrate across large distances1–4. The landscape of convective currents is rugged and shifts on timescales of a few minutes as thermals constantly form, disintegrate or are transported away by the wind5,6. How soaring birds find and navigate thermals within this complex landscape is unknown. Reinforcement learning7 provides an appropriate framework in which to identify an effective navigational strategy as a sequence of decisions made in response to environmental cues. Here we use reinforcement learning to train a glider in the field to navigate atmospheric thermals autonomously. We equipped a glider of two-metre wingspan with a flight controller that precisely controlled the bank angle and pitch, modulating these at intervals with the aim of gaining as much lift as possible. A navigational strategy was determined solely from the glider’s pooled experiences, collected over several days in the field. The strategy relies on on-board methods to accurately estimate the local vertical wind accelerations and the roll-wise torques on the glider, which serve as navigational cues. We establish the validity of our learned flight policy through field experiments, numerical simulations and estimates of the noise in measurements caused by atmospheric turbulence. Our results highlight the role of vertical wind accelerations and roll-wise torques as effective mechanosensory cues for soaring birds and provide a navigational strategy that is directly applicable to the development of autonomous soaring vehicles.

Suggested Citation

  • Gautam Reddy & Jerome Wong-Ng & Antonio Celani & Terrence J. Sejnowski & Massimo Vergassola, 2018. "Glider soaring via reinforcement learning in the field," Nature, Nature, vol. 562(7726), pages 236-239, October.
  • Handle: RePEc:nat:nature:v:562:y:2018:i:7726:d:10.1038_s41586-018-0533-0
    DOI: 10.1038/s41586-018-0533-0
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-018-0533-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-018-0533-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yoav Flato & Roi Harel & Aviv Tamar & Ran Nathan & Tsevi Beatus, 2024. "Revealing principles of autonomous thermal soaring in windy conditions using vulture-inspired deep reinforcement-learning," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    2. Joshua D. McGraw & Donsuk Lee & Justin N. Wood, 2024. "Parallel development of social behavior in biological and artificial fish," Nature Communications, Nature, vol. 15(1), pages 1-16, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:562:y:2018:i:7726:d:10.1038_s41586-018-0533-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.