API

Pricing

Workflows

API

Pricing

ComfyUI-Whisper

Author yuvraj108c

https://github.com/yuvraj108c/ComfyUI-Whisper

249

Last updated

2026-06-07

Run hundreds of ComfyUI nodes and workflows in your browser.

Transcribe audio content and generate subtitles for videos utilizing the Whisper model within the ComfyUI framework. This tool supports a variety of languages and offers customizable subtitle options to enhance video accessibility.

Supports multiple Whisper models, allowing users to choose the one that best fits their needs.
Provides options for adding subtitles directly to video frames or as a word cloud on blank frames.
Exports subtitle alignments in SRT format, facilitating easy integration with video editing software.

Context

This tool, known as ComfyUI Whisper, is an extension designed to integrate the Whisper speech-to-text model into the ComfyUI environment. Its primary function is to transcribe audio tracks and create subtitles for videos, making content more accessible to a broader audience.

Key Features & Benefits

The ComfyUI Whisper extension includes several practical features such as the ability to transcribe audio and generate timestamps for each spoken segment. Users can customize subtitle appearance by selecting font styles, colors, and positions, which enhances the viewing experience and ensures clarity. Additionally, the tool supports multiple languages and various Whisper models, giving users flexibility in transcription quality and accuracy.

Advanced Functionalities

ComfyUI Whisper includes advanced capabilities like the experimental feature of adding subtitles in a word cloud format on blank frames, which can be useful for creative presentations. The transcription process also allows for detailed timestamps, enabling precise synchronization of subtitles with video content.

Practical Benefits

By integrating this tool into their workflows, users can significantly enhance their video production processes, improving both efficiency and control over subtitle quality. The ability to export SRT files simplifies the task of incorporating subtitles into video editing software, streamlining the overall workflow.

Credits/Acknowledgments

This project is credited to the original authors and contributors from the ComfyUI community, including notable contributions from various developers. The tool is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license.

Inner Nodes

Add Subtitles To Background

Add Subtitles To Frames

Apply Whisper

Resize Cropped Subtitles

Save SRT

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

yuvraj108c