API

Pricing

Workflows

API

Pricing

ComfyUI-QwenVL-Mod

Author huchukato

https://github.com/huchukato/ComfyUI-QwenVL-Mod

Last updated

2026-05-13

Run hundreds of ComfyUI nodes and workflows in your browser.

The ComfyUI-QwenVL custom node integrates advanced Qwen-VL vision-language models, including Qwen2.5-VL and the latest Qwen3-VL, into the ComfyUI framework, supporting GGUF for enhanced multimodal AI applications in text generation, image comprehension, and video analysis.

Supports both standard and advanced nodes for flexible usage and detailed control.
Features smart prompt caching and a bypass mode for efficient workflow management.
Offers specialized prompts for cinematic video generation, including I2V (image-to-video) and T2V (text-to-video) capabilities.

Context

The ComfyUI-QwenVL custom node is designed to enhance the ComfyUI environment by incorporating powerful Qwen-VL series models from Alibaba Cloud. Its primary purpose is to facilitate advanced multimodal AI operations, allowing users to seamlessly generate text, analyze images, and process video content.

Key Features & Benefits

This tool provides a range of practical features that significantly enhance user productivity:

Standard and Advanced Nodes: Users can choose between a straightforward node for quick tasks and an advanced version that offers fine-tuned control over generation parameters.
Smart Prompt Caching: This feature prevents the regeneration of identical prompts, improving performance during repeated inputs and maintaining cache across sessions.
Bypass Mode: Users can preserve previously generated prompts without needing to regenerate them, which conserves computational resources and streamlines workflows.

Advanced Functionalities

The node includes specialized capabilities for cinematic video generation:

WAN 2.2 Integration: This allows for detailed cinematic scene descriptions in video outputs, enhancing the quality and coherence of generated videos.
Fixed Seed Mode: This feature ensures consistent output by maintaining the same seed value, regardless of variations in input media, which is vital for reproducible results.

Practical Benefits

The integration of the Qwen-VL node into ComfyUI enhances workflows by providing users with greater control and efficiency. The ability to handle both text and visual data in a cohesive manner allows for higher-quality outputs and a more streamlined creative process, ultimately improving the overall efficiency of AI-driven projects.

Credits/Acknowledgments

This project is developed by huchukato, with contributions from the Qwen Team at Alibaba Cloud, and is built on the ComfyUI framework by comfyanonymous. The code is released under the GPL-3.0 License, ensuring open-source accessibility and collaboration within the community.

Inner Nodes

VRAMCleanup

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.6k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

huchukato