
COMMUNITY PAGE
Run Nano Banana on Floyo
Home / Model / Nano Banana on Floyo
AI IMAGE GENERATION
Run Nano Banana on Floyo
Google's Gemini-powered image generation and editing models. Nano Banana Pro delivers 4K output, 94%+ text accuracy, and character consistency for up to 5 people. Nano Banana 2 brings near-Pro quality at Flash speed.
Run Google's Nano Banana models through ComfyUI in your browser. No API key, no installs, no local GPU.
|
Max Resolution 4K (Pro) / 2K (Banana 2) |
Text Accuracy 94%+ (Pro) |
|
Character Consistency Up to 5 people |
Reference Inputs Up to 14 images |
| Try Nano Banana Now → | Browse All Models |
No installation. Runs in browser. Updated April 2026.









What you get?
Nano Banana is Google's family of Gemini-powered image generation and editing models. Nano Banana Pro (Gemini 3 Pro Image) delivers native 4K output, 94%+ text rendering accuracy, character consistency for up to 5 people, and up to 14 reference images per generation. Nano Banana 2 (Gemini 3.1 Flash Image) brings near-Pro quality at 3-5 second speeds. Both support multi-turn conversational editing, thinking mode for complex prompts, and SynthID watermarking. Available as ComfyUI API nodes on Floyo with 10+ ready-to-run workflows.
NANO BANANA WORKFLOWS ON FLOYO
Nano Banana Pro Text-to-Image: Gemini 3 Pro
Nano Banana 2 - Google's #1 Ranked Image Model
Nano Banana Pro Edit Image to Image
Nano Banana Pro for Multi Grid View of Product Ads
360 Degree Product Video Using Nano Banana Pro
Craft Stunning Edits Instantly with Nano Banana Edit
AI Influencer Video Maker (Nano Banana + Kling)
AI Influencer Ad Generator (Nano Banana + Wan 2.6)
What is Nano Banana?
Nano Banana is Google DeepMind's family of Gemini-powered image generation and editing models. There are three variants: Nano Banana (Gemini 2.5 Flash Image, August 2025), Nano Banana Pro (Gemini 3 Pro Image, November 2025), and Nano Banana 2 (Gemini 3.1 Flash Image, February 2026). Each is built on a different Gemini backbone, trading off speed, quality, and cost.
Nano Banana Pro is the quality tier. Built on Gemini 3 Pro, it uses the model's reasoning capabilities to decompose complex prompts before rendering. It generates native 4K images (up to 4096x4096) with 94%+ text rendering accuracy, maintains character consistency for up to 5 people, and handles up to 14 reference images in a single generation. Generation takes 8-12 seconds per image.
Nano Banana 2 is the speed tier. Built on Gemini 3.1 Flash, it generates in 3-5 seconds with quality that closes the gap with Pro. Max resolution is 2K. Text accuracy is around 85%. It makes the Pro features (world knowledge, text rendering, editing) accessible for rapid iteration workflows where speed matters more than final polish.
Both models support multi-turn conversational editing. You can generate an image, then ask the model to change the background, adjust lighting, add text, or swap elements, all in the same conversation. This is different from single-shot models where editing requires a separate pass.
On Floyo, Nano Banana runs through ComfyUI API nodes. The workflows cover text-to-image, image editing, multi-grid product photography, 360-degree product videos, AI influencer generation, and beauty/travel ad creation pipelines. You can chain Nano Banana with other models (Kling, Wan 2.6/2.7) in the same workflow for full production pipelines.
What are Nano Banana's technical specifications?
The Nano Banana family spans three Gemini-based models. Nano Banana Pro (Gemini 3 Pro Image) delivers native 4K at 94%+ text accuracy with 8-12 second generation. Nano Banana 2 (Gemini 3.1 Flash Image) delivers 2K at 85% text accuracy in 3-5 seconds. Both support thinking mode, multi-turn editing, character consistency, and web search grounding for real-time context.
| Spec | Details |
|---|---|
| Developer | Google DeepMind |
| Nano Banana Pro | Gemini 3 Pro Image (gemini-3-pro-image-preview) |
| Nano Banana 2 | Gemini 3.1 Flash Image (gemini-3.1-flash-image-preview) |
| Nano Banana (original) | Gemini 2.5 Flash Image (gemini-2.5-flash-image) |
| Max Resolution (Pro) | 4K (4096x4096) |
| Max Resolution (Banana 2) | 2K |
| Text Accuracy (Pro) | 94-96% |
| Text Accuracy (Banana 2) | ~85% |
| Character Consistency | Up to 5 people (95%+ resemblance) |
| Reference Images | Up to 14 inputs in a single generation |
| Generation Speed (Pro) | 8-12 seconds per image |
| Generation Speed (Banana 2) | 3-5 seconds per image |
| FID Score (Pro) | 12.4 (best in class) |
| Thinking Mode | Yes (reasoning chain for complex prompts) |
| Multi-Turn Editing | Yes (conversational image editing) |
| Web Search Grounding | Yes (real-time information in generated images) |
| Text Rendering Languages | Multilingual (English, Chinese, Japanese, Korean, and more) |
| Watermark | SynthID (imperceptible, on all outputs) |
| ComfyUI Access | API-based nodes on Floyo (10+ workflows) |
| Release Dates | Nano Banana: Aug 2025 / Pro: Nov 2025 / Banana 2: Feb 2026 |
What can you create with Nano Banana?
Nano Banana covers text-to-image generation, conversational image editing, product photography (multi-grid and 360-degree), character-consistent ad pipelines, infographics, mockups, diagrams, multilingual marketing assets, and multi-model production workflows. On Floyo, ready-to-run workflows combine Nano Banana with Kling and Wan 2.6 for full video ad pipelines.
| Capability | What It Does | Use Case |
|---|---|---|
| 4K Image Generation | Generate native 4K images (4096x4096) with photorealistic quality. Pro uses reasoning to plan composition before rendering. | Hero images, print materials, large-format displays |
| Text-in-Image | 94%+ accurate text rendering in multiple languages. Signs, labels, infographics, menus, and marketing copy render legibly. | Ad creatives, posters, social graphics, multilingual content |
| Character Consistency | Maintain visual resemblance of up to 5 people across unlimited generations. Works with reference images for identity locking. | AI influencer content, brand ambassadors, storytelling |
| Multi-Turn Editing | Generate an image, then edit it conversationally. Change backgrounds, adjust lighting, add or remove elements, swap styles. | Client revisions, iterative design, product mockups |
| Product Photography | Multi-grid product views and 360-degree product video generation. Create e-commerce assets from a single product image. | E-commerce listings, product catalogs, Amazon/Shopify stores |
| Ad Production Pipelines | Chain Nano Banana with Kling or Wan 2.6 video models. Generate character-consistent influencer images, then animate into video ads. | Social ads, beauty campaigns, travel content, UGC-style videos |
What are Nano Banana's key features?
Nano Banana's feature set is built around two ideas: Gemini's reasoning backbone makes image generation smarter (not just prettier), and conversational editing replaces the traditional generate-then-edit-in-Photoshop pipeline. Every feature benefits from the fact that the model understands what you're asking, not the keywords you use.
Thinking Mode
Nano Banana Pro uses Gemini 3 Pro's reasoning capabilities to decompose complex prompts before rendering. A prompt like "an infographic showing smartphone market share by region with a pie chart and bar graph side by side" gets planned structurally before any pixels are generated. This is why it handles layouts, diagrams, and data visualizations better than models that go straight from prompt to pixels.
4K Native Resolution
Nano Banana Pro generates at up to 4096x4096 natively. This is not upscaling. The model renders at 4K directly. Fine details, textures, and text are sharp at print resolution. Generation takes 8-12 seconds per 4K image. Nano Banana 2 caps at 2K but generates in 3-5 seconds.
94%+ Text Rendering Accuracy
Signs, labels, posters, menus, infographics, and marketing copy render legibly in multiple languages. This is a major differentiator: most image models garble text. Nano Banana Pro makes in-image text reliable enough for production use. Translation and localization of text within images is also supported.
Character Consistency (Up to 5 People)
Maintain facial resemblance and visual identity of up to 5 different people across unlimited generations. This is what makes the AI influencer and ad production workflows possible. Define your characters once, then generate them in any scene, outfit, or setting while keeping them recognizable.
Up to 14 Reference Images
Blend products, logos, characters, backgrounds, and style references into a single generation. Six high-fidelity reference slots plus up to fourteen standard inputs total. This enables complex compositions for advertising: combine a product photo, a brand style guide, a model reference, and a background scene in one prompt.
Web Search Grounding
The model pulls from Gemini's real-world knowledge and real-time web information. Ask for "the current Tesla Model Y in a mountain setting" and it uses current product knowledge to render the right model year. This extends to diagrams, data visualizations, and infographics that reference real-world information.
Professional Camera Controls
Control lighting, camera angle, depth of field, focus, and color grading through natural language. Describe the shot the way a photographer would: "golden hour backlighting with shallow depth of field and warm color grading." The model interprets these as physical parameters, not style keywords.
How does Nano Banana compare to other image models?
Nano Banana 2 is ranked #1 on the Artificial Analysis Image Arena and LM Arena as of March 2026. Nano Banana Pro leads on 4K resolution, text rendering accuracy, and character consistency. Midjourney leads on aesthetic variation and community ecosystem. Uni-1 leads on structured reasoning with role-labeled references. FLUX leads on open-source flexibility and LoRA support.
| Model | Max Resolution | Text Accuracy | Character Lock | Arena Rank |
|---|---|---|---|---|
| Nano Banana Pro | 4K native | 94%+ | 5 people | Top tier |
| Nano Banana 2 | 2K | ~85% | Yes | #1 overall |
| Midjourney v6.1 | 2K (upscale to 4K) | Moderate | Style refs only | Top tier |
| Uni-1 (Luma AI) | Up to 4K | EN + CN | 9 role-labeled refs | #1 Elo (quality) |
| FLUX.2 Pro | Up to 2K | Moderate | Via LoRA/IP-Adapter | Top tier |
Source: Artificial Analysis Image Arena (March 2026), LM Arena rankings, Google DeepMind documentation, and third-party benchmark reports. Arena rankings change frequently; check current standings for the latest.
How does Nano Banana work?
Nano Banana is built on Gemini's multimodal architecture. The models natively process text and images in the same token space, which is why they can generate, edit, and reason about images conversationally. Nano Banana Pro uses Gemini 3 Pro's full reasoning stack. Nano Banana 2 uses Gemini 3.1 Flash's optimized inference path for speed.
When you write a prompt, the model first reasons through the composition. For complex requests (infographics, multi-object scenes, structured layouts), it plans the spatial arrangement, determines object relationships, and resolves text placement before generating pixels. This is visible in thinking mode, where the model shows its planning steps.
Character consistency works through reference image embedding. You provide a photo, and the model extracts identity features (facial structure, skin tone, hair) that persist across subsequent generations. The model can maintain up to 5 separate identity embeddings in a single scene, which is why the ad production workflows on Floyo work: generate a consistent AI influencer, then place them in different product scenarios.
On Floyo, Nano Banana runs as API-based ComfyUI nodes. Your prompt and reference images are sent to Google's inference servers, and the generated image returns to your ComfyUI canvas. You can chain Nano Banana with local processing nodes (upscaling, color grading) or with other API models (Kling for video, Wan 2.6 for animation) in the same workflow. The 10+ pre-built workflows on Floyo cover the most common production use cases.
Note: Nano Banana is API-based, not a local model. Generation runs on Google's servers with content filtering active. All outputs include SynthID watermarks (imperceptible). The model can still struggle with small faces, complex spelling, and fine details. Infographics and data visualizations may contain factual errors. Always verify data-driven outputs. API pricing applies through your Floyo API Wallet.
Frequently Asked Questions
Common questions about running Nano Banana on Floyo.
Nano Banana runs as an API node, so generation costs come from your API Wallet (separate from FloTime). Floyo gives $0.25 in free API credits on signup. After that, pricing depends on the variant and resolution. Pro costs about $0.134 per 2K image and $0.24 per 4K image. Banana 2 costs about $0.067 per 2K image.
Open Floyo in your browser, search "Nano Banana" in the template library, and pick a workflow. Click Run, write your prompt, and generate. Floyo handles the ComfyUI environment and API connection to Google's servers. No local install, no Python setup, no API key management required.
Google DeepMind. Nano Banana (original) launched August 2025 on Gemini 2.5 Flash. Nano Banana Pro launched November 2025 on Gemini 3 Pro. Nano Banana 2 launched February 2026 on Gemini 3.1 Flash. After Nano Banana Pro's release, the Gemini app became the #1 free app on the US iPhone App Store.
Nano Banana Pro is the quality tier: 4K native, 94%+ text accuracy, 8-12 second generation. Nano Banana 2 is the speed tier: 2K max, ~85% text accuracy, 3-5 second generation. Pro is better for final production assets. Banana 2 is better for rapid iteration and exploration. Use Banana 2 to generate 20+ variations, then regenerate the winner with Pro for maximum quality.
Nano Banana Pro generates native 4K (Midjourney upscales to 4K from lower resolution). Text rendering is significantly more accurate in Nano Banana. Character consistency works for up to 5 people vs. Midjourney's style references. Midjourney has broader aesthetic range and a stronger community ecosystem. Nano Banana has multi-turn editing (Midjourney does not). Nano Banana runs on Floyo inside ComfyUI pipelines; Midjourney is Discord-only or standalone.
Yes. That is what the Floyo production workflows are built for. Generate a character-consistent AI influencer with Nano Banana, then animate them into a video with Kling or Wan 2.6. Ready-to-run workflows cover AI influencer video makers, beauty ad builders, travel vlogger generators, and 360-degree product videos.
Yes. Nano Banana Pro renders text at 94%+ accuracy in multiple languages. Signs, posters, menus, infographics, and marketing copy come out legible. Nano Banana 2 is at ~85%, which is adequate for social media and web but not for print materials with critical text. Both models can translate and localize text within images.
Yes. Images generated through the Gemini API (which Floyo uses) can be used for commercial purposes. All outputs include SynthID watermarks for transparency. Check Google's terms of service for full usage details, especially around generated images of people and branded content.
Try Nano Banana on Floyo
4K image generation, 94%+ text rendering, character consistency for 5 people, and multi-turn editing. Run it in your browser.
| Try Nano Banana Now → | Browse All Models |
Related Reading
AI Ad Creatives for Social and Web
Character and Concept Design on Floyo
Last updated: April 2026. Specs from Google DeepMind official documentation, Google AI for Developers API docs, Artificial Analysis Image Arena, LM Arena rankings, and third-party benchmark comparisons.
floyoofficial
12.9k
API
Floyo API
Image2Image
Nano Banana Pro
Google just released Nano Banana Pro, and honestly, it's a pretty big step up from the original Nano Banana. The main thing? It can actually put legible text in images now. Like, real text that you can read, not the garbled nonsense most AI models spit out.
Nano Banana Pro Text-to-Image: Gemini 3 Pro
Google just released Nano Banana Pro, and honestly, it's a pretty big step up from the original Nano Banana. The main thing? It can actually put legible text in images now. Like, real text that you can read, not the garbled nonsense most AI models spit out.
floyoofficial
2.9k
API
Image2Image
Nano Banana
Nano Banana 2
Text2Image
The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.
Nano Banana 2 - Google's #1 ranked image model
The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.
Nano Banana Pro Edit Image to Image
API
Ecommerce
Image2Image
Nano Banana Pro
Product Ads
Create grids of different angles for your ecommerce products.
Nano Banana Pro for Multi Grid View of Product Ads
Create grids of different angles for your ecommerce products.
360 Degree Product Video Using Nano Banana Pro
Craft Stunning Edits Instantly with Nano Banana Edit
nikhil07
264
Influencer
kling
Nano banana Pro
Product
Build your AI influencer, stage the product moment, and animate the full promo in one workflow.
AI Influencer Video Maker (Nano Banana + kling)
Build your AI influencer, stage the product moment, and animate the full promo in one workflow.
nikhil07
233
Influencer
Product
Build your AI influencer, stage the product moment, and animate the full promo in one workflow.
AI Influencer Ad Generator (Nano Banana + Wan 2.6)
Build your AI influencer, stage the product moment, and animate the full promo in one workflow.
nikhil07
207
Beauty
Influencer
kling
Makeup
Nano banana Pro
Product
UGC
Generate a realistic beauty influencer photo and transform it into a talking promotional video.
AI Beauty Ad Builder (Nano Banana + Kling)
Generate a realistic beauty influencer photo and transform it into a talking promotional video.
nikhil07
122
Influencer
kling
Nano banana Pro
Travel
Vlog
Create social-ready travel reels from a single image and location reference.
AI Travel Vlogger Generator (Nano Banana + Kling)
Create social-ready travel reels from a single image and location reference.



_1767784624270.png?width=400&height=300&quality=80&resize=cover)
_1767112903535.webp?width=400&height=300&quality=80&resize=cover)

_1772104822143.gif?width=400&height=300&quality=80&resize=cover)
_1771834011124.gif?width=400&height=300&quality=80&resize=cover)
_1772486002401.gif?width=400&height=300&quality=80&resize=cover)
_1772477545790.gif?width=400&height=300&quality=80&resize=cover)