IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i24p4735-d1002392.html
   My bibliography  Save this article

Semantic Segmentation of UAV Images Based on Transformer Framework with Context Information

Author

Listed:
  • Satyawant Kumar

    (Department of Artificial Intelligence, Kyungpook National University, Daegu 41566, Republic of Korea)

  • Abhishek Kumar

    (Department of Artificial Intelligence, Kyungpook National University, Daegu 41566, Republic of Korea)

  • Dong-Gyu Lee

    (Department of Artificial Intelligence, Kyungpook National University, Daegu 41566, Republic of Korea)

Abstract

With the advances in Unmanned Aerial Vehicles (UAVs) technology, aerial images with huge variations in the appearance of objects and complex backgrounds have opened a new direction of work for researchers. The task of semantic segmentation becomes more challenging when capturing inherent features in the global and local context for UAV images. In this paper, we proposed a transformer-based encoder-decoder architecture to address this issue for the precise segmentation of UAV images. The inherent feature representation of the UAV images is exploited in the encoder network using a self-attention-based transformer framework to capture long-range global contextual information. A Token Spatial Information Fusion (TSIF) module is proposed to take advantage of a convolution mechanism that can capture local details. It fuses the local contextual details about the neighboring pixels with the encoder network and makes semantically rich feature representations. We proposed a decoder network that processes the output of the encoder network for the final semantic level prediction of each pixel. We demonstrate the effectiveness of this architecture on UAVid and Urban Drone datasets, where we achieved mIoU of 61.93% and 73.65%, respectively.

Suggested Citation

  • Satyawant Kumar & Abhishek Kumar & Dong-Gyu Lee, 2022. "Semantic Segmentation of UAV Images Based on Transformer Framework with Context Information," Mathematics, MDPI, vol. 10(24), pages 1-17, December.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:24:p:4735-:d:1002392
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/24/4735/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/24/4735/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Athul Shibu & Dong-Gyu Lee, 2023. "EvolveNet: Evolving Networks by Learning Scale of Depth and Width," Mathematics, MDPI, vol. 11(16), pages 1-14, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:24:p:4735-:d:1002392. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.