API

Pricing

Workflows

API

Pricing

ComfyUI-ChatterboxTTS

Author Yuan-ManX

https://github.com/Yuan-ManX/ComfyUI-ChatterboxTTS

Last updated

2026-04-15

Run hundreds of ComfyUI nodes and workflows in your browser.

ComfyUI-ChatterboxTTS integrates an advanced text-to-speech (TTS) model, Chatterbox, into the ComfyUI framework, marking it as the first open-source TTS solution suitable for production use. This tool enhances the capabilities of ComfyUI by providing high-quality, expressive voice synthesis.

Offers production-grade TTS capabilities through the Chatterbox model.
Supports customizable parameters for improved speech pacing and expressiveness.
Facilitates seamless integration within the ComfyUI environment for enhanced user experience.

Context

ComfyUI-ChatterboxTTS is a specialized extension for ComfyUI that introduces the Chatterbox TTS model, which is designed to generate high-quality speech from text inputs. Its primary function is to provide users with a reliable and flexible TTS solution that can be utilized for various applications, including voice agents and content creation.

Key Features & Benefits

This tool stands out by delivering a production-ready TTS experience, allowing users to generate realistic speech that can be finely tuned. The ability to adjust parameters such as cfg_weight and exaggeration enables users to customize the speech output to suit specific needs, whether for dramatic readings or conversational agents.

Advanced Functionalities

Chatterbox TTS offers advanced features like adjustable speaking styles, where users can manipulate the cfg_weight to alter the pacing and exaggeration to control expressiveness. This flexibility allows for nuanced speech synthesis, making it suitable for diverse applications from storytelling to interactive dialogues.

Practical Benefits

By incorporating ComfyUI-ChatterboxTTS into their workflows, users can significantly enhance their control over voice output quality and pacing. This tool streamlines the process of generating expressive speech, improving overall efficiency and effectiveness in projects that require TTS functionalities.

Credits/Acknowledgments

The Chatterbox TTS model is developed by Resemble AI, and the ComfyUI-ChatterboxTTS extension is maintained by Yuan-ManX. The repository is open-source, contributing to the collaborative nature of the AI art and TTS community.

Inner Nodes

ChatterboxTTS

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.6k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

Yuan-ManX