Have a personal or library account? Click to login
Image Super-Resolution Reconstruction Method Based on Improved Generative Adversarial Network Cover

Image Super-Resolution Reconstruction Method Based on Improved Generative Adversarial Network

Open Access
|Sep 2025

Figures & Tables

Figure 1.

Replace the original Resblock with the RRDB structure
Replace the original Resblock with the RRDB structure

Figure 2.

RRDB structure
RRDB structure

Figure 3.

Representative feature maps before and after activation
Representative feature maps before and after activation

Figure 4.

The architecture diagram of ViT
The architecture diagram of ViT

Figure 5.

Comparison of CNN's local receptive field (left) and ViT's global attention weight distribution (right).
Comparison of CNN's local receptive field (left) and ViT's global attention weight distribution (right).

Figure 6.

The improved generator network structure
The improved generator network structure

Figure 7.

The improved generator network structure
The improved generator network structure

Figure 8.

Comparison of 4x upsampling experimental results for img062.png in the Urban100 dataset.
Comparison of 4x upsampling experimental results for img062.png in the Urban100 dataset.

PARAMETER CONFIGURATION OF THE VIT-BASE MODULE

Module ComponentKey ParametersMain Function
Patch EmbeddingInput Size:64×64×64Divide the feature map into 64 patches to reduce computational complexity.
Patch Size:8×8
Output Dimension:768
Multi-Head AttentionNumber of Heads:12Establish global correlations to enhance complex feature modeling.
Dimension per Head:64
Feed-Forward NetworkMLP Structure:768→3072→768Perform nonlinear transformations of features to stabilize training in conjunction with LayerNorm.
Activation Function:GeLU
Residual ConnectionApplication Position:After each MSA/FFN sublayerPrevent gradient vanishing and accelerate convergence.
Stacked StructureNumber of Encoder Layers:12Construct a deep feature transformer to improve global context modeling capabilities.
Total Parameters:86M

COMPARISON OF PSNR (DB) / SSIM FOR DIFFERENT METHODS ON BENCHMARK DATASETS UNDER ×2 AND ×4 SUPER-RESOLUTION

MultipleModelSet5PSNR (dB)/SSIMSet14PSNR (dB)/SSIMBSD100PSNR (dB)/SSIMUrban100PSNR (dB)/SSIM
×2SRCNN36.66/0.954232.42/0.906331.36/0.891829.50/0.8946
EDSR38.11/0.960333.92/0.918032.46/0.901532.93/0.9355
RCAN38.31/0.961434.15/0.920932.63/0.902733.34/0.9410
HAN38.34/0.961834.18/0.921432.68/0.903233.41/0.9422
NLSA38.35 / 0.962034.20 / 0.921732.69 / 0.903633.48 / 0.9430
SwinIR38.40 / 0.962434.25 / 0.922132.71 / 0.904033.57 / 0.9438
Our38.51 / 0.96334.40 / 0.923532.85 / 0.905433.95 / 0.9465
×4SRCNN30.49 / 0.863027.50 / 0.751026.91 / 0.710524.54 / 0.7260
EDSR32.65 / 0.900028.95 / 0.791827.80 / 0.744926.98 / 0.8085
RCAN32.78 / 0.900429.06 / 0.793227.89 / 0.746827.14 / 0.8125
HAN32.80 / 0.900829.08 / 0.793927.91 / 0.747327.24 / 0.8140
NLSA32.82 / 0.901029.10 / 0.794227.93 / 0.748027.31 / 0.8153
SwinIR32.88 / 0.901429.14 / 0.794827.96 / 0.748827.45 / 0.8165
Our33.00 / 0.902529.28 / 0.796228.10 / 0.750527.73 / 0.8200
Language: English
Page range: 11 - 19
Published on: Sep 30, 2025
Published by: Xi’an Technological University
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Feng Xiong, Jun Yu, Zhiyi Hu, Yu Li, Chaoyi Dong, Hongpei Zhang, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.