IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i8p1818-d1121094.html
   My bibliography  Save this article

CISA: Context Substitution for Image Semantics Augmentation

Author

Listed:
  • Sergey Nesteruk

    (Skolkovo Institute of Science and Technology (Skoltech), 121205 Moscow, Russia)

  • Ilya Zherebtsov

    (Voronezh State University of Engineering Technology (VSUET), 394036 Voronezh, Russia)

  • Svetlana Illarionova

    (Skolkovo Institute of Science and Technology (Skoltech), 121205 Moscow, Russia)

  • Dmitrii Shadrin

    (Skolkovo Institute of Science and Technology (Skoltech), 121205 Moscow, Russia
    Irkutsk National Research Technical University (INRTU), 664074 Irkutsk, Russia)

  • Andrey Somov

    (Skolkovo Institute of Science and Technology (Skoltech), 121205 Moscow, Russia)

  • Sergey V. Bezzateev

    (Saint-Petrsburg State University of Aerospace Instrumentation (SUAI), 190000 Saint Petersburg, Russia)

  • Tatiana Yelina

    (Saint-Petrsburg State University of Aerospace Instrumentation (SUAI), 190000 Saint Petersburg, Russia)

  • Vladimir Denisenko

    (Voronezh State University of Engineering Technology (VSUET), 394036 Voronezh, Russia)

  • Ivan Oseledets

    (Skolkovo Institute of Science and Technology (Skoltech), 121205 Moscow, Russia)

Abstract

Large datasets catalyze the rapid expansion of deep learning and computer vision. At the same time, in many domains, there is a lack of training data, which may become an obstacle for the practical application of deep computer vision models. To overcome this problem, it is popular to apply image augmentation. When a dataset contains instance segmentation masks, it is possible to apply instance-level augmentation. It operates by cutting an instance from the original image and pasting to new backgrounds. This article challenges a dataset with the same objects present in various domains. We introduce the Context Substitution for Image Semantics Augmentation framework (CISA), which is focused on choosing good background images. We compare several ways to find backgrounds that match the context of the test set, including Contrastive Language–Image Pre-Training (CLIP) image retrieval and diffusion image generation. We prove that our augmentation method is effective for classification, segmentation, and object detection with different dataset complexity and different model types. The average percentage increase in accuracy across all the tasks on a fruits and vegetables recognition dataset is 4.95 % . Moreover, we show that the Fréchet Inception Distance (FID) metrics has a strong correlation with model accuracy, and it can help to choose better backgrounds without model training. The average negative correlation between model accuracy and the FID between the augmented and test datasets is 0.55 in our experiments.

Suggested Citation

  • Sergey Nesteruk & Ilya Zherebtsov & Svetlana Illarionova & Dmitrii Shadrin & Andrey Somov & Sergey V. Bezzateev & Tatiana Yelina & Vladimir Denisenko & Ivan Oseledets, 2023. "CISA: Context Substitution for Image Semantics Augmentation," Mathematics, MDPI, vol. 11(8), pages 1-24, April.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1818-:d:1121094
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/8/1818/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/8/1818/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1818-:d:1121094. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.