S3diff: Semantic Fusion and Structure-Guided Global Generation from a Single Image with Diffusion Models

Zhang, Xianjie; Zhang, Yusen; He, Yujie; Li, Min

S3diff: Semantic Fusion and Structure-Guided Global Generation from a Single Image with Diffusion Models

Journal of Artificial Intelligence and Soft Computing Research

Volume 16 (2026): Issue 1 (January 2026)

By:

Xianjie Zhang

, Yusen Zhang

, Yujie He

and Min Li

Open Access

|Sep 2025

Abstract

Single-image generation models generate high-quality and diverse images by learning the internal distribution of patches within a single image, addressing the issue of data scarcity and attracting increasing attention. However, existing methods are unsatisfactory when dealing with images with global structures, such as animal images. To address this issue, we propose Semantic fusion and Structure-guided global generation from a Single image with Diffusion models (S³Diff). Specifically, during training, we employ a semantic extractor to extract high-level semantic features from training images and use the proposed semantic fusion block to fuse semantic features with image features, enhancing the model’s understanding of image semantics and improving the quality of the generated images. During sampling, we apply manifold constrained gradient based on image structure to enforce the generation path to regress to the manifold of the original image, preserving reasonable global structures. Extensive experiments on public datasets demonstrate the thorough exploration of hyperparameters and the rationality of key designs, with quantitative and qualitative comparisons against baseline methods and validating that our proposed method preserves reasonable semantic and structural relationships, can generate high-quality and diverse images, significantly improving the model’s global generation capabilities.

DOI: https://doi.org/10.2478/jaiscr-2026-0002

Journal RSS Feed

Language: English

Page range: 39 - 53

Submitted on: Jun 6, 2025

Accepted on: Aug 22, 2025

Published on: Sep 26, 2025

Published by: SAN University

In partnership with: Paradigm Publishing Services

Publication frequency: 4 times per year

Keywords:

diffusion models,

single-image generation models,

global generation

Related subjects:

Computer sciences,

Databases and data mining,

Artificial intelligence

© 2025 Xianjie Zhang, Yusen Zhang, Yujie He, Min Li, published by SAN University
This work is licensed under the Creative Commons Attribution 4.0 License.

Previous article Volume 16 (2026): Issue 1 (January 2026)Next article