IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v11y2019i2p42-d205468.html
   My bibliography  Save this article

3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition

Author

Listed:
  • Sheeraz Arif

    (Information and Communication Engineering, Beijing Institute of Technology, Beijing 100081, China)

  • Jing Wang

    (Information and Communication Engineering, Beijing Institute of Technology, Beijing 100081, China)

  • Tehseen Ul Hassan

    (Information and Communication Engineering, Beijing Institute of Technology, Beijing 100081, China)

  • Zesong Fei

    (Information and Communication Engineering, Beijing Institute of Technology, Beijing 100081, China)

Abstract

Human activity recognition is an active field of research in computer vision with numerous applications. Recently, deep convolutional networks and recurrent neural networks (RNN) have received increasing attention in multimedia studies, and have yielded state-of-the-art results. In this research work, we propose a new framework which intelligently combines 3D-CNN and LSTM networks. First, we integrate discriminative information from a video into a map called a ‘motion map’ by using a deep 3-dimensional convolutional network (C3D). A motion map and the next video frame can be integrated into a new motion map, and this technique can be trained by increasing the training video length iteratively; then, the final acquired network can be used for generating the motion map of the whole video. Next, a linear weighted fusion scheme is used to fuse the network feature maps into spatio-temporal features. Finally, we use a Long-Short-Term-Memory (LSTM) encoder-decoder for final predictions. This method is simple to implement and retains discriminative and dynamic information. The improved results on benchmark public datasets prove the effectiveness and practicability of the proposed method.

Suggested Citation

  • Sheeraz Arif & Jing Wang & Tehseen Ul Hassan & Zesong Fei, 2019. "3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition," Future Internet, MDPI, vol. 11(2), pages 1-17, February.
  • Handle: RePEc:gam:jftint:v:11:y:2019:i:2:p:42-:d:205468
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/11/2/42/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/11/2/42/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:11:y:2019:i:2:p:42-:d:205468. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.