API

Pricing

Workflows

API

Pricing

ComfyUI_ChatterBox_Voice

Author ShmuelRonen

https://github.com/ShmuelRonen/ComfyUI_ChatterBox_Voice

Last updated

2025-06-04

Run hundreds of ComfyUI nodes and workflows in your browser.

An unofficial integration for ComfyUI, this tool provides high-quality Text-to-Speech (TTS) and Voice Conversion (VC) capabilities using ResembleAI's ChatterboxTTS, allowing for unlimited text length processing. It enhances the ComfyUI experience by enabling advanced audio functionalities such as voice cloning and intelligent text chunking.

Offers production-grade TTS that surpasses other popular services in quality and performance.
Includes a voice capture node with smart silence detection for effective audio recording.
Features emotion control for expressive speech, allowing users to adjust the intensity of delivered emotions.

Context

This tool serves as a custom node integration within ComfyUI, a user-friendly interface for AI applications. Its primary purpose is to facilitate high-quality speech generation and voice transformation, leveraging the capabilities of ChatterboxTTS to expand the creative possibilities for users.

Key Features & Benefits

The integration provides several practical features, such as:

ChatterBox TTS: Capable of generating speech from any length of text, with options for voice cloning, ensuring versatility in audio production.
Voice Conversion: Enables users to transform one speaker's voice to another, enhancing flexibility in audio projects.
Audio Capture Node: A dedicated node for recording voice with advanced features like silence detection, which streamlines the recording process and improves audio quality.

Advanced Functionalities

The tool introduces advanced text processing capabilities, including:

Intelligent Text Chunking: Automatically splits lengthy text into manageable segments while preserving sentence integrity, which is crucial for maintaining natural speech patterns.
Emotion Control: Offers parameters to adjust the expressiveness of speech, allowing for nuanced vocal performances tailored to specific contexts.

Practical Benefits

By integrating this tool, users experience significant improvements in workflow efficiency and audio quality within ComfyUI. The ability to handle unlimited text lengths and the smart chunking feature reduce the hassle of preparing text for TTS, while the voice capture and conversion functionalities provide users with greater creative control over their audio projects.

Credits/Acknowledgments

This integration is based on the work of ResembleAI, which developed the ChatterboxTTS model, and is supported by the ComfyUI team. The tool is released under the MIT License, ensuring open access and collaboration within the community.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

ShmuelRonen