ThinkDiffusion

Product

Pricing

Enterprise

Docs

ThinkDiffusion

Flux 2 Text-to-Image Generation

Flux2

Marketing

Photography

Text2Image

969

Flux 2 is Black Forest Labs' latest image generation model, and it's a huge step up from Flux 1. If you've been working with AI image tools, you'll notice the difference immediately. The text actually renders properly now, you can use multiple reference images without everything falling apart, and prompts work the way you'd expect them to.

Run Flux 2 directly on Floyo - no installation, no local GPU required. Generate images in your browser in seconds.

Flux.2 Pro

Best for: production workflows and fast turnaround.
Designed to balance high image quality with lower cost and faster generation.
Supports text-to-image and image editing up to 4MP output.
Multi-reference: up to 8 reference images via API (9MP total limit).

Flux.2 Flex

Best for: maximum quality and fine control over the image.
Higher latency and higher price per megapixel than Pro, in exchange for better detail and control.
Supports text-to-image and image editing up to 4MP output, same resolution rules as Pro.
Multi-reference: up to 10 reference images via API.

The Real Improvements

Text that actually works

This was a major pain point in Flux 1. It could barely handle a sentence without garbling half the letters. Flux 2 can now generate complex typography, infographics, even UI mockups with legible fine text. Results are reliable enough for production use - things like coffee shop menus, posters, or social media graphics come out with clean text on the first try in most cases.

Multiple reference images

Flux 2 accepts up to 10 reference images at once. The character consistency is noticeably better. If you're trying to keep a person or product looking the same across multiple generations, this actually delivers consistent results now. Great for maintaining brand assets across different compositions without the usual drift from text descriptions alone.

Better at following instructions

The prompt adherence got a real boost. When you write a structured prompt with multiple elements (say, "person in foreground, mountain in background, golden hour lighting"), Flux 2 is more likely to give you all those things in the right places. Less trial and error, fewer completely off-base results.

Higher resolution editing

Flux 2 handles image editing at up to 4 megapixels. That's high enough for most practical applications. The detail preservation during edits is better too. There's less quality loss when modifying an existing image.

The Model Variants

Flux 2 [dev] - The open-weight model at 32B parameters. Available through various API providers and can be run locally. This is what most developers use since it offers full control and supports LoRA fine-tuning. There's an optimized fp8 version that runs on consumer GPUs like RTX cards.

Flux 2 [flex] - Provides parameter control (steps, guidance scale) for balancing speed vs quality. More steps = better typography and detail, fewer steps = faster generation.

Flux 2 [pro] - The managed API version with the highest quality output. More expensive but generates faster than self-hosted [dev] in most scenarios.

Flux 2 [klein] - Coming soon. Smaller, Apache 2.0 licensed model distilled from the base model.

How It Actually Works

Flux 2 uses a latent flow matching architecture that couples a Mistral-3 24B vision-language model with a rectified flow transformer. The VLM handles world knowledge and context, while the transformer manages spatial relationships and material properties.

Black Forest Labs also rebuilt the latent space from scratch with a new VAE, which is part of why image quality improved while keeping the model efficient.

Practical Use Cases

Generate product shots with proper lighting and realistic materials
Create infographics where the text is actually readable
Maintain character consistency across a series of images using multiple references
Edit existing images at high resolution without destroying details
Use JSON-structured prompts for precise control over complex scenes
Specify exact colors with HEX codes for brand consistency

When to Use Flux 2

The upgrade makes the most sense when:

Text rendering quality directly impacts your work (marketing materials, infographics, UI mockups)
Character or product consistency across multiple images is critical
You're doing production work where reliability matters more than experimentation
You need high-resolution image editing capabilities

For simple one-off image generation or casual experimentation, Flux 1 might still meet your needs. The improvements in Flux 2 really show up in repetitive production workflows where consistency and reliability are essential.

Generates in about 1 min 47 secs

floyoofficial

Nodes & Models

ComfyUI Official

KSamplerSelect

Note

RandomNoise

UNETLoader

FLUX2/flux2_dev_fp8mixed.safetensors

CLIPLoader

mistral_3_small_flux2_fp8.safetensors

VAELoader

flux2-vae.safetensors

LoadImage

EmptyFlux2LatentImage

Flux2Scheduler

CLIPTextEncode

ImageScaleToTotalPixels

FluxGuidance

VAEEncode

ReferenceLatent

BasicGuider

SamplerCustomAdvanced

VAEDecode

SaveImage

Best for: production workflows and fast turnaround.
Designed to balance high image quality with lower cost and faster generation.
Supports text-to-image and image editing up to 4MP output.
Multi-reference: up to 8 reference images via API (9MP total limit).