floyo logobeta logo
Powered by
ThinkDiffusion
floyo logobeta logo
Powered by
ThinkDiffusion

sd-perturbed-attention

263

Last updated
2025-06-24

Various guidance techniques have been implemented for ComfyUI and SD WebUI (reForge), including Perturbed-Attention Guidance (PAG), Smoothed Energy Guidance (SEG), Sliding Window Guidance (SWG), PLADIS, Normalized Attention Guidance (NAG), and Token Perturbation Guidance (TPG). These tools enhance the image generation process by providing advanced methods for guiding diffusion models, thereby improving the quality and control of generated outputs.

  • Supports multiple guidance techniques tailored for different diffusion model frameworks.
  • Offers adjustable parameters allowing users to fine-tune the guidance effects on image generation.
  • Compatible with both SD1.5 and SDXL models, ensuring broad usability across different versions of Stable Diffusion.

Context

This repository provides a collection of advanced guidance implementations designed to work specifically with ComfyUI and SD WebUI (reForge). The primary aim is to enhance the performance of diffusion models by leveraging various guidance strategies, which help in achieving more coherent and visually appealing images during the generation process.

Key Features & Benefits

The tool includes several unique guidance methods such as Perturbed-Attention Guidance and Smoothed Energy Guidance, which allow users to manipulate the generation process effectively. By adjusting parameters like guidance scale and adaptive scaling, users can control the structural integrity and clarity of the generated images, leading to higher quality outputs.

Advanced Functionalities

The repository features specialized capabilities like the ability to apply guidance selectively to specific U-Net layers, allowing for a more nuanced control over the image generation process. Users can specify blocks and layers within the U-Net architecture, tailoring the guidance application to optimize performance based on their specific needs.

Practical Benefits

By integrating these guidance techniques into ComfyUI, users gain improved workflow efficiency and enhanced control over the image generation process. The ability to fine-tune various parameters not only boosts the quality of the generated images but also allows for a more streamlined creative process, reducing the time spent on adjustments and iterations.

Credits/Acknowledgments

The project is developed by contributors to the ComfyUI community and is based on various academic papers that detail the guidance methodologies. The repository is open-source, allowing for collaboration and further development by the community.