floyo logobeta logo
Powered by
ThinkDiffusion
floyo logobeta logo
Powered by
ThinkDiffusion

ComfyUI Gemini Pro Node

7

Last updated
2025-07-05

This tool is a comprehensive node for ComfyUI that integrates the Google Gemini Pro API, enabling users to work with various input types, including text, images, videos, and audio. It is designed to facilitate advanced image generation and editing capabilities, streamlining the creative process for users.

  • Supports a variety of input formats, allowing for versatile content creation.
  • Features built-in retry mechanisms and timeout controls to enhance reliability during API calls.
  • Offers customization options for system prompts, output tokens, and temperature settings, providing users with greater control over the generation process.

Context

The ComfyUI Gemini Pro Node serves as a powerful extension for ComfyUI, leveraging the capabilities of the Google Gemini Pro API. Its main purpose is to allow users to generate and manipulate content across multiple media formats, enhancing the creative workflow within ComfyUI.

Key Features & Benefits

This node supports multiple input formats, including text, images, videos, and audio, making it a versatile tool for various creative projects. The ability to configure system prompts and customize parameters such as temperature and maximum output tokens gives users fine-tuned control over the generated content, ensuring it meets specific needs.

Advanced Functionalities

Advanced functionalities include a built-in retry mechanism that automatically attempts to resend requests in case of failures, as well as timeout controls that prevent long waits during API calls. Users can also set up proxy configurations for improved network management and performance.

Practical Benefits

By integrating the Google Gemini Pro API, this tool significantly enhances the efficiency and quality of content generation within ComfyUI. It allows for seamless transitions between different media types and provides users with the flexibility to adjust parameters according to their creative requirements, ultimately improving overall workflow and output quality.

Credits/Acknowledgments

The development of this tool is credited to the original authors and contributors, including ZHO and CY-CHENYUE, who have provided valuable resources and support in the creation of the ComfyUI Gemini Pro Node. The tool is released under the MIT License, allowing for broad usage and modification.