API

Pricing

Workflows

API

Pricing

ComfyUI_CaptionThis

Author MieMieeeee

https://github.com/MieMieeeee/ComfyUI-CaptionThis

Last updated

2025-07-04

Run hundreds of ComfyUI nodes and workflows in your browser.

ComfyUI-CaptionThis is a versatile tool designed for generating captions for images, utilizing advanced models like Janus Pro and Florence2, with plans to incorporate additional models such as JoyCaption. Its primary goal is to facilitate image-to-image tasks and assist in preparing datasets for LoRA training, streamlining the process of describing both individual images and entire directories of images.

Supports multiple captioning models, allowing users to select the best fit for their needs.
Enables batch processing of images, automatically generating and saving captions for multiple files.
Provides a user-friendly interface for both single image and directory caption generation.

Context

ComfyUI-CaptionThis serves as an extension within the ComfyUI framework, focusing on the generation of descriptive captions for images. It is particularly useful for users involved in training machine learning models, as it simplifies the dataset creation process by providing detailed descriptions of images.

Key Features & Benefits

The tool offers the capability to generate captions for both single images and batches of images, significantly enhancing the efficiency of dataset preparation. By supporting multiple captioning models, it allows users to choose the most suitable model for their specific tasks, thereby improving the quality and relevance of the generated captions.

Advanced Functionalities

ComfyUI-CaptionThis includes the ability to customize prompts or guiding questions when describing individual images, which can lead to more tailored and informative captions. Additionally, the tool is designed to evolve with future updates, including the integration of new models and advanced configuration options for fine-tuning caption outputs.

Practical Benefits

This tool enhances workflow efficiency by automating the caption generation process for multiple images, reducing the time and effort required for dataset preparation. Users gain greater control over the quality of captions, which can lead to improved outcomes in training AI models.

Credits/Acknowledgments

The development of ComfyUI-CaptionThis is built upon the contributions of various authors, including DeepSeek-AI for the Janus Pro model, and contributors like CY-CHENYUE and kijai for their implementations of Janus Pro and Florence2. The project acknowledges these foundational works while introducing a multi-model architecture that enhances user flexibility and functionality.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

MieMieeeee