API

Pricing

Workflows

API

Pricing

ComfyUI-Gemini

Author Visionatrix

https://github.com/Visionatrix/ComfyUI-Gemini

Last updated

2025-07-07

Run hundreds of ComfyUI nodes and workflows in your browser.

These nodes facilitate the integration of the Gemini API within ComfyUI, enabling users to send prompts and images to Gemini AI models for various AI-driven tasks. This tool enhances the capabilities of ComfyUI by leveraging advanced AI models for generating responses based on user inputs.

Allows for the inclusion of multiple images to enhance prompt context.
Offers customizable system instructions to tailor AI behavior.
Provides adjustable safety settings to manage content filtering effectively.

Context

This tool serves as a bridge between ComfyUI and the Gemini API, allowing users to utilize the advanced AI models offered by Google Gemini. Its primary purpose is to enhance the user experience by enabling the submission of prompts and images, thereby expanding the creative possibilities within the ComfyUI environment.

Key Features & Benefits

The tool includes several practical features such as the ability to specify an error fallback value, which ensures that users receive a default response when the API is inaccessible. Users can also select the response format, either plain text or JSON, catering to different workflow needs. Additionally, the option to include up to three images alongside prompts provides valuable visual context, enhancing the quality of AI-generated outputs.

Advanced Functionalities

One of the standout capabilities of this tool is the ability to set custom system instructions, which guides the AI's responses more effectively. Users can define how the AI should behave, ensuring that the outputs align with specific requirements or tones. The adjustable safety settings further allow users to control the filtering of inappropriate content, making it suitable for a wide range of applications.

Practical Benefits

This integration significantly improves workflow efficiency by allowing users to interact with AI models directly within ComfyUI. By providing options for customized instructions and content filtering, users gain greater control over the output quality and relevance. The inclusion of images alongside prompts not only enhances the context but also elevates the overall results of AI interactions.

Credits/Acknowledgments

The tool was developed by contributors to the ComfyUI-Gemini repository, with the implementation based on the Gemini API from Google. The repository is open source, allowing for community contributions and improvements.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

Visionatrix