IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i18p3893-d1238757.html
   My bibliography  Save this article

A Deep Reinforcement Learning Approach to Optimal Morphologies Generation in Reconfigurable Tiling Robots

Author

Listed:
  • Manivannan Kalimuthu

    (ROAR Lab, Engineering Product Development Pillar, Singapore University of Technology and Design (SUTD), Singapore 487372, Singapore)

  • Abdullah Aamir Hayat

    (ROAR Lab, Engineering Product Development Pillar, Singapore University of Technology and Design (SUTD), Singapore 487372, Singapore)

  • Thejus Pathmakumar

    (ROAR Lab, Engineering Product Development Pillar, Singapore University of Technology and Design (SUTD), Singapore 487372, Singapore)

  • Mohan Rajesh Elara

    (ROAR Lab, Engineering Product Development Pillar, Singapore University of Technology and Design (SUTD), Singapore 487372, Singapore)

  • Kristin Lee Wood

    (College of Engineering, Design and Computing, University of Colorado Denver, 1200 Larimer St, Ste. 3034, Denver, CO 80204, USA)

Abstract

Reconfigurable robots have the potential to perform complex tasks by adapting their morphology to different environments. However, designing optimal morphologies for these robots is challenging due to the large design space and the complex interactions between the robot and the environment. An in-house robot named S m o r p h i , having four holonomic mobile units connected with three hinge joints, is designed to maximize area coverage with its shape-changing features using transformation design principles (TDP). The reinforcement learning (RL) approach is used to identify the optimal morphologies out of a vast combination of hinge angles for a given task by maximizing a reward signal that reflects the robot’s performance. The proposed approach involves three steps: (i) Modeling the Smorphi design space with a Markov decision process (MDP) for sequential decision-making; (ii) a footprint-based complete coverage path planner to compute coverage and path length metrics for various Smorphi morphologies; and (iii) pptimizing policies through proximal policy optimization (PPO) and asynchronous advantage actor–critic (A3C) reinforcement learning techniques, resulting in the generation of energy-efficient, optimal Smorphi robot configurations by maximizing rewards. The proposed approach is applied and validated using two different environment maps, and the results are also compared with the suboptimal random shapes along with the Pareto front solutions using NSGA-II. The study contributes to the field of reconfigurable robots by providing a systematic approach for generating optimal morphologies that can improve the performance of reconfigurable robots in a variety of tasks.

Suggested Citation

  • Manivannan Kalimuthu & Abdullah Aamir Hayat & Thejus Pathmakumar & Mohan Rajesh Elara & Kristin Lee Wood, 2023. "A Deep Reinforcement Learning Approach to Optimal Morphologies Generation in Reconfigurable Tiling Robots," Mathematics, MDPI, vol. 11(18), pages 1-22, September.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:18:p:3893-:d:1238757
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/18/3893/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/18/3893/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Manivannan Kalimuthu & Thejus Pathmakumar & Abdullah Aamir Hayat & Prabakaran Veerajagadheswar & Mohan Rajesh Elara & Kristin Lee Wood, 2023. "Optimal Morphologies of n-Omino-Based Reconfigurable Robot for Area Coverage Task Using Metaheuristic Optimization," Mathematics, MDPI, vol. 11(4), pages 1-23, February.
    2. Abdullah Aamir Hayat & Parasuraman Karthikeyan & Manuel Vega-Heredia & Mohan Rajesh Elara, 2019. "Modeling and Assessing of Self-Reconfigurable Cleaning Robot hTetro Based on Energy Consumption," Energies, MDPI, vol. 12(21), pages 1-19, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lim Yi & Anh Vu Le & Joel Chan Cheng Hoong & Abdullah Aamir Hayat & Balakrishnan Ramalingam & Rajesh Elara Mohan & Kristor Leong & Karthikeyan Elangovan & Minh Tran & Minh V. Bui & Phan Van Duc, 2022. "Multi-Objective Instantaneous Center of Rotation Optimization Using Sensors Feedback for Navigation in Self-Reconfigurable Pavement Sweeping Robot," Mathematics, MDPI, vol. 10(17), pages 1-22, September.
    2. Manivannan Kalimuthu & Thejus Pathmakumar & Abdullah Aamir Hayat & Prabakaran Veerajagadheswar & Mohan Rajesh Elara & Kristin Lee Wood, 2023. "Optimal Morphologies of n-Omino-Based Reconfigurable Robot for Area Coverage Task Using Metaheuristic Optimization," Mathematics, MDPI, vol. 11(4), pages 1-23, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:18:p:3893-:d:1238757. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.