IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i5p1127-d1078787.html
   My bibliography  Save this article

Scene Recognition for Visually-Impaired People’s Navigation Assistance Based on Vision Transformer with Dual Multiscale Attention

Author

Listed:
  • Yahia Said

    (Remote Sensing Unit, College of Engineering, Northern Border University, Arar 91431, Saudi Arabia
    King Salman Center for Disability Research, Riyadh 11614, Saudi Arabia
    Laboratory of Electronics and Microelectronics (LR99ES30), University of Monastir, Monatir 5019, Tunisia)

  • Mohamed Atri

    (College of Computer Sciences, King Khalid University, Abha 62529, Saudi Arabia)

  • Marwan Ali Albahar

    (School of Computer Science, Umm Al-Qura University, Mecca 24382, Saudi Arabia)

  • Ahmed Ben Atitallah

    (Department of Electrical Engineering, College of Engineering, Jouf University, Sakaka 72388, Saudi Arabia)

  • Yazan Ahmad Alsariera

    (College of Science, Northern Border University, Arar 91431, Saudi Arabia)

Abstract

Notable progress was achieved by recent technologies. As the main goal of technology is to make daily life easier, we will investigate the development of an intelligent system for the assistance of impaired people in their navigation. For visually impaired people, navigating is a very complex task that requires assistance. To reduce the complexity of this task, it is preferred to provide information that allows the understanding of surrounding spaces. Particularly, recognizing indoor scenes such as a room, supermarket, or office provides a valuable guide to the visually impaired person to understand the surrounding environment. In this paper, we proposed an indoor scene recognition system based on recent deep learning techniques. Vision transformer (ViT) is a recent deep learning technique that has achieved high performance on image classification. So, it was deployed for indoor scene recognition. To achieve better performance and to reduce the computation complexity, we proposed dual multiscale attention to collect features at different scales for better representation. The main idea was to process small patches and large patches separately and a fusion technique was proposed to combine the features. The proposed fusion technique requires linear time for memory and computation compared to existing techniques that require quadratic time. To prove the efficiency of the proposed technique, extensive experiments were performed on a public dataset which is the MIT67 dataset. The achieved results demonstrated the superiority of the proposed technique compared to the state-of-the-art. Further, the proposed indoor scene recognition system is suitable for implementation on mobile devices with fewer parameters and FLOPs.

Suggested Citation

  • Yahia Said & Mohamed Atri & Marwan Ali Albahar & Ahmed Ben Atitallah & Yazan Ahmad Alsariera, 2023. "Scene Recognition for Visually-Impaired People’s Navigation Assistance Based on Vision Transformer with Dual Multiscale Attention," Mathematics, MDPI, vol. 11(5), pages 1-16, February.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:5:p:1127-:d:1078787
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/5/1127/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/5/1127/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:5:p:1127-:d:1078787. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.