Author
Listed:
- Wu Wen
(School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)
- Tianhao Li
(School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)
- Amr Tolba
(Computer Science Department, Community College, King Saud University, Riyadh 11437, Saudi Arabia)
- Ziyi Liu
(School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)
- Kai Shao
(School of Software, Dalian University of Technology, Dalian 116024, China)
Abstract
The rapid advancement of Wise Information Technology of med (WITMED) has made the integration of traditional Chinese medicine tongue diagnosis and computer technology an increasingly significant area of research. The doctor obtains patient’s tongue images to make a further diagnosis. However, the tongue image may be broken during the process of collecting the tongue image. Due to the extremely complex texture of the tongue and significant individual differences, existing methods fail to fully obtain sufficient feature information, which result in inaccurate inpainted tongue images. To address this problem, we propose a recurrent tongue image inpainting algorithm based on multi-scale feature fusion called Multi-Scale Fusion Module and Recurrent Attention Mechanism Network (MSFM-RAM-Net). We first propose Multi-Scale Fusion Module (MSFM), which preserves the feature information of tongue images at different scales and enhances the consistency between structures. To simultaneously accelerate the inpainting process and enhance the quality of the inpainted results, Recurrent Attention Mechanism (RAM) is proposed. RAM focuses the network’s attention on important areas and uses known information to gradually inpaint image, which can avoid redundant feature information and the problem of texture confusion caused by large missing areas. Finally, we establish a tongue image dataset and use this dataset to qualitatively and quantitatively evaluate the MSFM-RAM-Net. The results shows that the MSFM-RAM-Net has a better effect on tongue image inpainting, with PSNR and SSIM increasing by 2.1% and 3.3%, respectively.
Suggested Citation
Wu Wen & Tianhao Li & Amr Tolba & Ziyi Liu & Kai Shao, 2023.
"Progressively Multi-Scale Feature Fusion for Image Inpainting,"
Mathematics, MDPI, vol. 11(24), pages 1-20, December.
Handle:
RePEc:gam:jmathe:v:11:y:2023:i:24:p:4908-:d:1296888
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:24:p:4908-:d:1296888. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.