IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i16p3437-d1212427.html
   My bibliography  Save this article

A Deep Reinforcement Learning Scheme for Spectrum Sensing and Resource Allocation in ITS

Author

Listed:
  • Huang Wei

    (The School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China)

  • Yuyang Peng

    (The School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China)

  • Ming Yue

    (The School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China)

  • Jiale Long

    (Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China)

  • Fawaz AL-Hazemi

    (Department of Computer and Network Engineering, University of Jeddah, Jeddah 21959, Saudi Arabia)

  • Mohammad Meraj Mirza

    (Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia)

Abstract

In recent years, the Internet of Vehicles (IoV) has been found to be of huge potential value in the promotion of the development of intelligent transportation systems (ITSs) and smart cities. However, the traditional scheme in IoV has difficulty in dealing with an uncertain environment, while reinforcement learning has the advantage of being able to deal with an uncertain environment. Spectrum resource allocation in IoV faces the uncertain environment in most cases. Therefore, this paper investigates the spectrum resource allocation problem by deep reinforcement learning after using spectrum sensing technology in the ITS, including the vehicle-to-infrastructure (V2I) link and the vehicle-to-vehicle (V2V) link. The spectrum resource allocation is modeled as a reinforcement learning-based multi-agent problem which is solved by using the soft actor critic (SAC) algorithm. Considered an agent, each V2V link interacts with the vehicle environment and makes a joint action. After that, each agent receives different observations as well as the same reward, and updates networks through the experiences from the memory. Therefore, during a certain time, each V2V link can optimize its spectrum allocation scheme to maximize the V2I capacity as well as increase the V2V payload delivery transmission rate. However, the number of SAC networks increases linearly as the number of V2V links increases, which means that the networks may have a problem in terms of convergence when there are an excessive number of V2V links. Consequently, a new algorithm, namely parameter sharing soft actor critic (PSSAC), is proposed to reduce the complexity for which the model is easier to converge. The simulation results show that both SAC and PSSAC can improve the V2I capacity and increase the V2V payload transmission success probability within a certain time. Specifically, these novel schemes have a 10 percent performance improvement compared with the existing scheme in the vehicular environment. Additionally, PSSAC has a lower complexity.

Suggested Citation

  • Huang Wei & Yuyang Peng & Ming Yue & Jiale Long & Fawaz AL-Hazemi & Mohammad Meraj Mirza, 2023. "A Deep Reinforcement Learning Scheme for Spectrum Sensing and Resource Allocation in ITS," Mathematics, MDPI, vol. 11(16), pages 1-15, August.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:16:p:3437-:d:1212427
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/16/3437/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/16/3437/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:16:p:3437-:d:1212427. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.