floyo logo
Powered by
ThinkDiffusion
⚡️Nano Banana 2 ⚡️ just landed. Start creating now.
floyo logo
Powered by
ThinkDiffusion
⚡️Nano Banana 2 ⚡️ just landed. Start creating now.

Wan 2.1 Text2Image

Created by @yanokusnir on Reddit, please support the original creator! https://www.reddit.com/r/StableDiffusion/comments/1lu7nxx/wan_21_txt2img_is_amazing/ If this is your workflow, please contact us at team@floyo.ai to claim it! Original post from the creator: Hello. This may not be news to some of you, but Wan 2.1 can generate beautiful cinematic images. I was wondering how Wan would work if I generated only one frame, so to use it as a txt2img model. I am honestly shocked by the results. All the attached images were generated in fullHD (1920x1080px) and on my RTX 4080 graphics card (16GB VRAM) it took about 42s per image. I used the GGUF model Q5_K_S, but I also tried Q3_K_S and the quality was still great. The only postprocessing I did was adding film grain. It adds the right vibe to the images and it wouldn't be as good without it. Last thing: For the first 5 images I used sampler euler with beta scheluder - the images are beautiful with vibrant colors. For the last three I used ddim_uniform as the scheluder and as you can see they are different, but I like the look even though it is not as striking. :) Enjoy.

287

Generates in about 2 mins 39 secs

Nodes & Models

EmptyHunyuanLatentVideo
VAELoader
wan_2.1_vae.safetensors
Label (rgthree)
MarkdownNote
Note
LoraLoader
Wan21_T2V_14B_lightx2v_cfg_step_distill_lora_rank32.safetensors
CLIPTextEncode
ModelSamplingSD3
KSampler
VAEDecode
SaveImage
UnetLoaderGGUF
CLIPLoaderGGUF
UnetLoaderGGUF
CLIPLoaderGGUF
UnetLoaderGGUF
CLIPLoaderGGUF
PathchSageAttentionKJ
ModelPatchTorchSettings
WanVideoNAG
WanVideoNAG
FastFilmGrain
easy cleanGpuUsed
easy clearCacheAll

Created by @yanokusnir on Reddit, please support the original creator!

https://www.reddit.com/r/StableDiffusion/comments/1lu7nxx/wan_21_txt2img_is_amazing/

If this is your workflow, please contact us at team@floyo.ai to claim it!

Original post from the creator:

Hello. This may not be news to some of you, but Wan 2.1 can generate beautiful cinematic images.

I was wondering how Wan would work if I generated only one frame, so to use it as a txt2img model. I am honestly shocked by the results.

All the attached images were generated in fullHD (1920x1080px) and on my RTX 4080 graphics card (16GB VRAM) it took about 42s per image. I used the GGUF model Q5_K_S, but I also tried Q3_K_S and the quality was still great.

The only postprocessing I did was adding film grain. It adds the right vibe to the images and it wouldn't be as good without it.

Last thing: For the first 5 images I used sampler euler with beta scheluder - the images are beautiful with vibrant colors. For the last three I used ddim_uniform as the scheluder and as you can see they are different, but I like the look even though it is not as striking. :) Enjoy.

Read more

N