API

Pricing

Workflows

API

Pricing

ComfyUI-OpenAI-FM

Author fairy-root

https://github.com/fairy-root/ComfyUI-OpenAI-FM

Last updated

2025-05-09

Run hundreds of ComfyUI nodes and workflows in your browser.

The OpenAI FM TTS node is a specialized component for ComfyUI that integrates the OpenAI FM Text-to-Speech service, enabling users to convert written text into spoken audio with diverse voice options and emotional expressions. This functionality enhances audio projects by allowing for the creation of realistic voiceovers and dynamic audio experiences.

Supports high-quality text-to-speech conversion through the OpenAI FM API.
Provides a selection of voices and emotional styles, allowing for customized vocal performances.
Outputs audio that seamlessly fits into the ComfyUI audio processing framework and saves generated files automatically.

Context

The OpenAI FM TTS node is designed to facilitate audio workflows within ComfyUI by enabling users to transform text into speech effortlessly. It serves as a bridge between text input and audio output, making it easier to incorporate voice into multimedia projects.

Key Features & Benefits

The node's primary feature is its ability to convert text into speech using the OpenAI FM API, ensuring high-quality audio output. Users can select from a variety of voices and emotional tones, making it possible to tailor the audio to fit the context of their projects. This flexibility is crucial for enhancing user engagement and delivering more immersive audio experiences.

Advanced Functionalities

This tool allows for nuanced control over the emotional delivery of the speech through its "vibe" feature, which lets users choose different emotional styles. This capability is particularly useful for projects that require a specific tone, such as storytelling or character-driven narratives. Additionally, the node supports multiline text input, accommodating longer scripts or dialogues.

Practical Benefits

By integrating this node into their workflows, users can significantly streamline the process of adding voiceovers to their projects. The direct compatibility with ComfyUI's audio output framework enhances workflow efficiency, while the automatic saving of audio files simplifies file management. Overall, this tool improves control over audio quality and performance, allowing for more polished and professional results.

Credits/Acknowledgments

The OpenAI FM TTS node was developed by FairyRoot and is available under the MIT License. Contributions to the project are encouraged, and the original author can be contacted through their GitHub profile or Telegram.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

fairy-root