| gltamiosso@inf.ufrgs.br |
cbmuller@inf.ufrgs.br |
lsbombana@inf.ufrgs.br |
oliveira@inf.ufrgs.br |
Computers & Graphics.
Volume 132 (2025) Article 104389 pp. 1-10. [DOI]
| Abstract | Examples | Downloads | Reference |
Diffusion models are powerful tools for image synthesis and editing, yet preserving structural content from a guidance image remains challenging. Filter-Guided Diffusion (FGD) tackles this by applying edge-preserving filtering at each denoising step. However, the original FGD relies on joint bilateral filtering, which incurs high VRAM and computational costs, limiting its scalability to high-resolution images. We propose Domain Transform Filter-Guided Diffusion (DT-FGD), a lightweight variant that replaces bilateral filtering with the efficient domain transform filter and introduces a normalization strategy for the guidance image’s latent representation. DT-FGD achieves significantly lower VRAM usage and faster inference while improving structural consistency. Our method produces images that better align with the text prompt and vary smoothly under filter parameter changes, leading to more predictable outcomes. Experiments show that DT-FGD can reduce VRAM consumption by over 50%, accelerates inference, and scales to high resolutions on a single GPU—unlike prior approaches. We further present a variant that offers even greater memory savings at the cost of additional inference time. DT-FGD enables structure-preserving diffusion on resource-constrained hardware and opens new directions for high-resolution, controllable image synthesis.
Diffusion Models; Structure Guidance; Domain Transform Filter; Edge-preserving Filtering; Image Synthesis.
Gustavo L. Tamiosso, Caetano B. Müller, Lucas S. Bombana, Manuel M. Oliveira. Memory-Efficient Filter-Guided Diffusion with Domain Transform Filtering, Computers & Graphics, Volume 132 (2025) 104389.
@article{TamiossoEtAl2025DT-FGD,
author = {Gustavo L. Tamiosso and Caetano B. Müller and Lucas S. Bombana and Manuel M. Oliveira},
title = {Memory-Efficient Filter-Guided Diffusion with Domain Transform Filtering},
journal = {Computers & Graphics},
volume = {132},
number = {104389}
DOI = {10.1016/j.cag.2025.104389},
ISSN = {0097-8493},
pages = {1--10},
year = {2025}
}
This work was sponsored by
| CNPq-Brazil fellowship 305474/2022-7. | |
| CAPES, Brazil Finance Code 001. |