IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v11y2024i5d10.1007_s40745-024-00531-6.html
   My bibliography  Save this article

Unified Image Harmonization with Region Augmented Attention Normalization

Author

Listed:
  • Junjie Hou

    (University of Chinese Academy of Sciences
    Chinese Academy of Sciences
    Chinese Academy of Sciences)

  • Yuqi Zhang

    (University of Chinese Academy of Sciences
    Chinese Academy of Sciences
    Chinese Academy of Sciences)

  • Duo Su

    (University of Chinese Academy of Sciences
    Chinese Academy of Sciences
    Chinese Academy of Sciences)

Abstract

The image harmonization task endeavors to adjust foreground information within an image synthesis process to achieve visual consistency by leveraging background information. In academic research, this task conventionally involves the utilization of simple synthesized images and matching masks as inputs. However, obtaining precise masks for image harmonization in practical applications poses a significant challenge, thereby creating a notable disparity between research findings and real-world applicability. To mitigate this disparity, we propose a redefinition of the image harmonization task as “Unified Image Harmonization,” where the input comprises only a single image, thereby enhancing its applicability in real-world scenarios. To address this challenge, we have developed a novel framework. Within this framework, we initially employ inharmonious region localization to detect the mask, which is subsequently utilized for harmonization tasks. The pivotal aspect of the harmonization process lies in normalization, which is accountable for information transfer. Nonetheless, the current background-to-foreground information transfer and guidance mechanisms are limited by single-layer guidance, thereby constraining their effectiveness. To overcome this limitation, we introduce Region Augmented Attention Normalization (RA2N), which enhances the attention mechanism for foreground feature alignment, consequently leading to improved alignment and transfer capabilities. Through qualitative and quantitative comparisons on the iHarmony4 dataset, our model exhibits exceptional performance not only in unified image harmonization but also in conventional image harmonization tasks.

Suggested Citation

  • Junjie Hou & Yuqi Zhang & Duo Su, 2024. "Unified Image Harmonization with Region Augmented Attention Normalization," Annals of Data Science, Springer, vol. 11(5), pages 1865-1886, October.
  • Handle: RePEc:spr:aodasc:v:11:y:2024:i:5:d:10.1007_s40745-024-00531-6
    DOI: 10.1007/s40745-024-00531-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-024-00531-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-024-00531-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. James M. Tien, 2017. "Internet of Things, Real-Time Decision Making, and Artificial Intelligence," Annals of Data Science, Springer, vol. 4(2), pages 149-178, June.
    2. Fadi Thabtah & Li Zhang & Neda Abdelhamid, 2019. "NBA Game Result Prediction Using Feature Analysis and Machine Learning," Annals of Data Science, Springer, vol. 6(1), pages 103-116, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shah Hussain & Muhammad Qasim Khan, 2023. "Student-Performulator: Predicting Students’ Academic Performance at Secondary and Intermediate Level Using Machine Learning," Annals of Data Science, Springer, vol. 10(3), pages 637-655, June.
    2. Manoj Verma & Harish Kumar Ghritlahre, 2023. "Forecasting of Wind Speed by Using Three Different Techniques of Prediction Models," Annals of Data Science, Springer, vol. 10(3), pages 679-711, June.
    3. Durgesh Samariya & Amit Thakkar, 2023. "A Comprehensive Survey of Anomaly Detection Algorithms," Annals of Data Science, Springer, vol. 10(3), pages 829-850, June.
    4. Aidin Zehtab-Salmasi & Ali-Reza Feizi-Derakhshi & Narjes Nikzad-Khasmakhi & Meysam Asgari-Chenaghlu & Saeideh Nabipour, 2023. "Multimodal Price Prediction," Annals of Data Science, Springer, vol. 10(3), pages 619-635, June.
    5. Anthony Gramaje & Fadi Thabtah & Neda Abdelhamid & Sayan Kumar Ray, 2021. "Patient Discharge Classification Using Machine Learning Techniques," Annals of Data Science, Springer, vol. 8(4), pages 755-767, December.
    6. Heba Soltan Mohamed & M. Masoom Ali & Haitham M. Yousof, 2023. "The Lindley Gompertz Model for Estimating the Survival Rates: Properties and Applications in Insurance," Annals of Data Science, Springer, vol. 10(5), pages 1199-1216, October.
    7. Patrick Osatohanmwen & Eferhonore Efe-Eyefia & Francis O. Oyegue & Joseph E. Osemwenkhae & Sunday M. Ogbonmwan & Benson A. Afere, 2022. "The Exponentiated Gumbel–Weibull {Logistic} Distribution with Application to Nigeria’s COVID-19 Infections Data," Annals of Data Science, Springer, vol. 9(5), pages 909-943, October.
    8. Petar Radanliev & David Roure & Rob Walton & Max Kleek & Omar Santos & La’Treall Maddox, 2022. "What Country, University, or Research Institute, Performed the Best on Covid-19 During the First Wave of the Pandemic?," Annals of Data Science, Springer, vol. 9(5), pages 1049-1067, October.
    9. Roberto Moro-Visconti & Salvador Cruz Rambaud & Joaquín López Pascual, 2023. "Artificial intelligence-driven scalability and its impact on the sustainability and valuation of traditional firms," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-14, December.
    10. Anjan Mukherjee & Abhik Mukherjee, 2022. "Interval-Valued Intuitionistic Fuzzy Soft Rough Approximation Operators and Their Applications in Decision Making Problem," Annals of Data Science, Springer, vol. 9(3), pages 611-625, June.
    11. Mansoureh Beheshti Nejad & Seyed Mahmoud Zanjirchi & Seyed Mojtaba Hosseini Bamakan & Negar Jalilian, 2024. "Blockchain Adoption in Operations Management: A Systematic Literature Review of 14 Years of Research," Annals of Data Science, Springer, vol. 11(4), pages 1361-1389, August.
    12. M. Sridharan, 2023. "Generalized Regression Neural Network Model Based Estimation of Global Solar Energy Using Meteorological Parameters," Annals of Data Science, Springer, vol. 10(4), pages 1107-1125, August.
    13. Guangrui Tang & Neng Fan, 2022. "A Survey of Solution Path Algorithms for Regression and Classification Models," Annals of Data Science, Springer, vol. 9(4), pages 749-789, August.
    14. Amaal Elsayed Mubarak & Ehab Mohamed Almetwally, 2024. "Modelling and Forecasting of Covid-19 Using Periodical ARIMA Models," Annals of Data Science, Springer, vol. 11(4), pages 1483-1502, August.
    15. Xueyan Xu & Fusheng Yu & Runjun Wan, 2023. "A Determining Degree-Based Method for Classification Problems with Interval-Valued Attributes," Annals of Data Science, Springer, vol. 10(2), pages 393-413, April.
    16. Qinghua Zheng & Chutong Yang & Haijun Yang & Jianhe Zhou, 2020. "A Fast Exact Algorithm for Deployment of Sensor Nodes for Internet of Things," Information Systems Frontiers, Springer, vol. 22(4), pages 829-842, August.
    17. Prashant Singh & Prashant Verma & Nikhil Singh, 2022. "Offline Signature Verification: An Application of GLCM Features in Machine Learning," Annals of Data Science, Springer, vol. 9(6), pages 1309-1321, December.
    18. Terence D. Agbeyegbe, 2023. "The Link Between Output Growth and Output Growth Volatility: Barbados," Annals of Data Science, Springer, vol. 10(3), pages 787-804, June.
    19. Ali Najafi & Araz Gholipour-Shilabin & Rahim Dehkharghani & Ali Mohammadpur-Fard & Meysam Asgari-Chenaghlu, 2023. "ComStreamClust: a Communicative Multi-Agent Approach to Text Clustering in Streaming Data," Annals of Data Science, Springer, vol. 10(6), pages 1583-1605, December.
    20. A. R. Sherwani & Q. M. Ali, 2023. "Parametric Classification using Fuzzy Approach for Handling the Problem of Mixed Pixels in Ground Truth Data for a Satellite Image," Annals of Data Science, Springer, vol. 10(6), pages 1459-1472, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:11:y:2024:i:5:d:10.1007_s40745-024-00531-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.