The Review of Image Inpainting

Tongyang Zhu; Li Zhao

doi:10.2478/ijanmc-2025-0026

.blurhash-client-img { display: none !important; }

The Review of Image Inpainting

International Journal of Advanced Network, Monitoring and Controls

Volume 10 (2025): Issue 3 (September 2025)

By: Tongyang Zhu and Li Zhao

Open Access

|Sep 2025

Figures & Tables

Comparison of methods

Feature	Progressive Image Inpainting	CNN	Transformer	Diffusion Models
Core Principles	Multi-stage processing (e.g., structure recovery followed by detail refinement, as in EdgeConnect's edge-prediction-and-filling stages)	Local feature extraction via convolutional kernels (e.g., Partial Conv's mask-aware convolution)	Global dependency modeling via self-attention (e.g., MAT's long-range reasoning)	Iterative denoising process for image generation (e.g., RePaint's stepwise restoration)
Key Strengths	1.High structural integrity2.Natural texture transitions (e.g., RFR-Net's recursive feature refinement)	1.Strong local feature extraction2.High computational efficiency (e.g., DeepFill's real-time performance)	1.Robust global semantics2.Effective for large missing regions (e.g., Swin Transformer's multi-scale fusion)	1.Highest generation quality2.Exceptional detail recovery (e.g., DiffBIR's realistic textures)
Key Weaknesses	1.High computational complexity2.Multi-stage training challenges (e.g., PRVS's convergence issues)	1.Limited receptive field2.Poor long-range dependency modeling (e.g., structural discontinuities in traditional CNNs)	1.High resource consumption2.Overfitting risks with small datasets (e.g., ViT's billion-scale pre-training requirement)	1.Slow inference2.High memory usage (e.g., DDPM's 1,000-step iterations)
Typical Use Cases	Complex structural restoration (e.g., artifact crack repair)	Small-area fast restoration (e.g., watermark removal from phone photos)	Large-area semantic restoration (e.g., street view occlusion removal)	High-fidelity detail generation (e.g., medical image super-resolution)
Computatio nal Efficiency	Moderate (requires multiple forward passes)	High (parallelizable computations)	Low (quadratic attention complexity)	Very Low (hundreds of denoising steps)
Training Data Needs	Moderate (requires structural annotations like edge maps)	Moderate (millions of images)	Very High (billion-scale pretraining)	Very High (massive high-quality datasets)
Representat ive Methods	EdgeConnect, RFR-Net	Partial Conv, DeepFill	MAT, SwinIR	RePaint, DiffBIR

References

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.2478/ijanmc-2025-0026 | Journal eISSN: 2470-8038

Journal RSS Feed

Language: English

Page range: 54 - 71

Published on: Sep 30, 2025

Published by: Xi’an Technological University

In partnership with: Paradigm Publishing Services

Publication frequency: 4 issues per year

Keywords:

Digital Image Processing

Related subjects:

Computer sciences,

Computer sciences, other

© 2025 Tongyang Zhu, Li Zhao, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 10 (2025): Issue 3 (September 2025)

The Review of Image Inpainting

Figures & Tables

Figure 1.

Figure 2.

Figure 3.

Comparison of methods

Paradigm

My account