IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/1440323.html
   My bibliography  Save this article

Transformer-Based Data-Driven Video Coding Acceleration for Industrial Applications

Author

Listed:
  • Yixiao Li
  • Lixiang Li
  • Zirui Zhuang
  • Yuan Fang
  • Haipeng Peng
  • Nam Ling
  • Ardashir Mohammadzadeh

Abstract

With the exploding development of edge intelligence and smart industry, deep learning-based intelligent industrial solutions are promptly applied in the manufacturing process. Many intelligent industrial solutions such as automatic manufacturing inspection are computer vision based and require fast and efficient video encoding techniques so that video streams can be processed as quickly as possible either at the edge cluster or over the cloud. As one of the most popular video coding standards, the high efficiency video coding (HEVC) standard has been applied to various industrial scenes. However, HEVC brings not only a higher compression rate but also a significant increase in encoding complexity, which hinders its practical application in industrial scenarios. Fortunately, a large amount of video coding data makes it possible to accelerate the encoding process in the industry. To speed up the video coding process in some industrial scenes, this paper proposes a data-driven fast approach for coding tree unit (CTU) partitioning in HEVC intracoding. First, we propose a method to represent the partition result of a CTU as a column vector of length 21. Then, we employ lots of encoding data produced in normal industry scenes to train transformer models used to predict the partitioning vector of the CTU. Finally, the final partitioning structure of the CTU is generated from the partitioning vector after a postprocessing operation and used by an industrial encoder. Compared with the original HEVC encoder used by some industrial applications, experiment results show that our approach achieves 58.77% encoding time reduction with 3.9% bit rate loss, which indicates that our data-driven approach for video coding has great capacity working in industrial applications.

Suggested Citation

  • Yixiao Li & Lixiang Li & Zirui Zhuang & Yuan Fang & Haipeng Peng & Nam Ling & Ardashir Mohammadzadeh, 2022. "Transformer-Based Data-Driven Video Coding Acceleration for Industrial Applications," Mathematical Problems in Engineering, Hindawi, vol. 2022, pages 1-11, September.
  • Handle: RePEc:hin:jnlmpe:1440323
    DOI: 10.1155/2022/1440323
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/mpe/2022/1440323.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/mpe/2022/1440323.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2022/1440323?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:1440323. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.