API

Pricing

Workflows

API

Pricing

ComfyUI-Geeky-Kokoro-TTS

Author GeekyGhost

https://github.com/GeekyGhost/ComfyUI-Geeky-Kokoro-TTS

Last updated

2025-03-21

Run hundreds of ComfyUI nodes and workflows in your browser.

A custom node wrapper designed for the Kokoro Text-to-Speech (TTS) system, this tool enhances ComfyUI by enabling advanced voice modification and improved text processing capabilities. It integrates the latest Kokoro TTS models, ensuring compatibility and performance improvements for a seamless user experience.

Integrates the latest Kokoro TTS v0.19+ with over 27 premium voice options, allowing for diverse audio outputs.
Features advanced text chunking that maintains sentence structure and natural pauses, improving speech flow and clarity.
Offers real-time voice modulation and effects, enabling users to customize audio output with professional-grade processing.

Context

This tool serves as a specialized extension for ComfyUI, facilitating high-quality text-to-speech functionality through the Kokoro TTS system. Its main objective is to provide users with enhanced audio outputs while maintaining the integrity of the original text.

Key Features & Benefits

The tool boasts several practical features, including:

Advanced Voice Options: Users can select from a variety of premium voices, enhancing the versatility of audio output for different applications.
Intelligent Text Processing: The custom node ensures that text is chunked intelligently, preserving sentence boundaries and paragraph structures, which is crucial for maintaining the natural flow of speech.
Voice Effects and Modulation: Real-time voice transformation capabilities allow users to apply effects and blend voices, creating unique audio experiences tailored to specific needs.

Advanced Functionalities

This tool includes advanced capabilities such as:

Voice Blending: Users can combine two distinct voices with adjustable ratios, allowing for creative audio outputs that can suit various contexts.
Real-time Audio Processing: The node supports multiple audio effects, including pitch shifting and reverb, providing users with the ability to create complex soundscapes without additional software.
Debug Logging: Enhanced logging features offer transparency during the text chunking process, making it easier for users to troubleshoot issues.

Practical Benefits

By incorporating this tool into their workflows, users can significantly improve the quality and efficiency of their text-to-speech projects. The advanced chunking and voice modulation capabilities not only enhance the auditory experience but also streamline the overall process, allowing for quicker production times and more polished outputs.

Credits/Acknowledgments

The development of this tool is credited to the original authors and contributors, with the Kokoro TTS model licensed under Apache 2.0. Special thanks to the ComfyUI team for their foundational framework and to community testers who contributed to identifying and resolving issues during development.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

GeekyGhost