IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i19p3032-d1487923.html
   My bibliography  Save this article

Optimizing Convolutional Neural Network Architectures

Author

Listed:
  • Luis Balderas

    (Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain
    Distributed Computational Intelligence and Time Series Lab, University of Granada, 18071 Granada, Spain
    Sport and Health University Research Institute, University of Granada, 18071 Granada, Spain
    Andalusian Research Institute in Data Science and Computational Intelligence, University of Granada, 18071 Granada, Spain)

  • Miguel Lastra

    (Distributed Computational Intelligence and Time Series Lab, University of Granada, 18071 Granada, Spain
    Sport and Health University Research Institute, University of Granada, 18071 Granada, Spain
    Andalusian Research Institute in Data Science and Computational Intelligence, University of Granada, 18071 Granada, Spain
    Department of Software Engineering, University of Granada, 18071 Granada, Spain)

  • José M. Benítez

    (Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain
    Distributed Computational Intelligence and Time Series Lab, University of Granada, 18071 Granada, Spain
    Sport and Health University Research Institute, University of Granada, 18071 Granada, Spain
    Andalusian Research Institute in Data Science and Computational Intelligence, University of Granada, 18071 Granada, Spain)

Abstract

Convolutional neural networks (CNNs) are commonly employed for demanding applications, such as speech recognition, natural language processing, and computer vision. As CNN architectures become more complex, their computational demands grow, leading to substantial energy consumption and complicating their use on devices with limited resources (e.g., edge devices). Furthermore, a new line of research seeking more sustainable approaches to Artificial Intelligence development and research is increasingly drawing attention: Green AI. Motivated by an interest in optimizing Machine Learning models, in this paper, we propose Optimizing Convolutional Neural Network Architectures (OCNNA). It is a novel CNN optimization and construction method based on pruning designed to establish the importance of convolutional layers. The proposal was evaluated through a thorough empirical study including the best known datasets (CIFAR-10, CIFAR-100, and Imagenet) and CNN architectures (VGG-16, ResNet-50, DenseNet-40, and MobileNet), setting accuracy drop and the remaining parameters ratio as objective metrics to compare the performance of OCNNA with the other state-of-the-art approaches. Our method was compared with more than 20 convolutional neural network simplification algorithms, obtaining outstanding results. As a result, OCNNA is a competitive CNN construction method which could ease the deployment of neural networks on the IoT or resource-limited devices.

Suggested Citation

  • Luis Balderas & Miguel Lastra & José M. Benítez, 2024. "Optimizing Convolutional Neural Network Architectures," Mathematics, MDPI, vol. 12(19), pages 1-19, September.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:19:p:3032-:d:1487923
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/19/3032/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/19/3032/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:19:p:3032-:d:1487923. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.