IDEAS home Printed from https://ideas.repec.org/a/gam/jgeogr/v4y2024i3p25-480d1445012.html
   My bibliography  Save this article

MSCAC: A Multi-Scale Swin–CNN Framework for Progressive Remote Sensing Scene Classification

Author

Listed:
  • A. Arun Solomon

    (Department of Civil Engineering, GMR Institute of Technology, Rajam 532127, India)

  • S. Akila Agnes

    (Department of Computer Science and Engineering, GMR Institute of Technology, Rajam 532127, India)

Abstract

Recent advancements in deep learning have significantly improved the performance of remote sensing scene classification, a critical task in remote sensing applications. This study presents a new aerial scene classification model, the Multi-Scale Swin–CNN Aerial Classifier (MSCAC), which employs the Swin Transformer, an advanced architecture that has demonstrated exceptional performance in a range of computer vision applications. The Swin Transformer leverages shifted window mechanisms to efficiently model long-range dependencies and local features in images, making it particularly suitable for the complex and varied textures in aerial imagery. The model is designed to capture intricate spatial hierarchies and diverse scene characteristics at multiple scales. A framework is developed that integrates the Swin Transformer with a multi-scale strategy, enabling the extraction of robust features from aerial images of different resolutions and contexts. This approach allows the model to effectively learn from both global structures and fine-grained details, which is crucial for accurate scene classification. The model’s performance is evaluated on several benchmark datasets, including UC-Merced, WHU-RS19, RSSCN7, and AID, where it demonstrates a superior or comparable accuracy to state-of-the-art models. The MSCAC model’s adaptability to varying amounts of training data and its ability to improve with increased data make it a promising tool for real-world remote sensing applications. This study highlights the potential of integrating advanced deep-learning architectures like the Swin Transformer into aerial scene classification, paving the way for more sophisticated and accurate remote sensing systems. The findings suggest that the proposed model has significant potential for various remote sensing applications, including land cover mapping, urban planning, and environmental monitoring.

Suggested Citation

  • A. Arun Solomon & S. Akila Agnes, 2024. "MSCAC: A Multi-Scale Swin–CNN Framework for Progressive Remote Sensing Scene Classification," Geographies, MDPI, vol. 4(3), pages 1-19, July.
  • Handle: RePEc:gam:jgeogr:v:4:y:2024:i:3:p:25-480:d:1445012
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2673-7086/4/3/25/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2673-7086/4/3/25/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jgeogr:v:4:y:2024:i:3:p:25-480:d:1445012. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.