API

Pricing

Workflows

API

Pricing

Dia realistic TTS

Author rkfg

https://github.com/rkfg/ComfyUI-Dia_tts

Last updated

2025-04-27

Run hundreds of ComfyUI nodes and workflows in your browser.

It's a wrapper for the Dia TTS system designed to integrate seamlessly with ComfyUI, leveraging specific code segments for inference purposes. This tool enhances text-to-speech capabilities within the ComfyUI environment, allowing users to generate audio outputs from textual input effectively.

Provides a streamlined interface for utilizing Dia TTS within ComfyUI.
Facilitates high-quality text-to-speech generation with customizable options.
Integrates directly with existing ComfyUI workflows for enhanced user experience.

Context

This tool serves as a bridge between ComfyUI and the Dia TTS text-to-speech engine developed by Nari Labs. Its primary aim is to enrich the ComfyUI framework by incorporating advanced speech synthesis capabilities, making it easier for users to generate audio directly from text.

Key Features & Benefits

The integration allows users to access Dia TTS's functionality through a user-friendly interface within ComfyUI. This means that users can easily convert written content into spoken words, which is beneficial for applications like voiceovers, accessibility features, and interactive media.

Advanced Functionalities

The tool supports various customization options, enabling users to adjust parameters such as voice pitch, speed, and tone. This flexibility allows for a more tailored audio output that can meet specific project requirements or personal preferences.

Practical Benefits

By incorporating this tool, users can significantly enhance their workflow in ComfyUI, as it allows for seamless text-to-speech conversion without needing to switch between different applications or interfaces. This leads to improved efficiency and control over audio output quality, ultimately contributing to a more cohesive user experience.

Credits/Acknowledgments

This tool is based on the Dia TTS project by Nari Labs, utilizing portions of their code for inference. The integration is made possible through the collaborative efforts of the open-source community.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

rkfg