API

Pricing

Workflows

API

Pricing

ComfyUI-AudioX

Author Yuan-ManX

https://github.com/Yuan-ManX/ComfyUI-AudioX

Last updated

2025-05-27

Run hundreds of ComfyUI nodes and workflows in your browser.

Make AudioX functionality accessible within ComfyUI, allowing users to leverage advanced audio generation capabilities. This integration enables the utilization of a diffusion transformer model specifically designed for generating audio from various inputs.

Integrates AudioX, a powerful diffusion transformer for audio generation, directly into the ComfyUI framework.
Facilitates seamless audio production workflows by providing a user-friendly interface for generating audio from text or other inputs.
Supports pretrained models, enabling users to quickly implement high-quality audio generation without extensive setup.

Context

This tool, ComfyUI-AudioX, serves as an extension for ComfyUI, enabling users to incorporate AudioX, a diffusion transformer specialized in generating audio from diverse input types. Its primary purpose is to enhance the audio generation capabilities of ComfyUI, making it easier for users to create and manipulate audio content.

Key Features & Benefits

ComfyUI-AudioX allows users to leverage the advanced capabilities of the AudioX model, which is designed for "Anything-to-Audio" generation. This integration simplifies the process of audio creation, providing a streamlined interface that enhances user experience and productivity.

Advanced Functionalities

The tool supports pretrained checkpoints, allowing users to quickly download and implement the necessary models for audio generation. This feature significantly reduces setup time and ensures that users can start generating audio with high-quality models right away.

Practical Benefits

By incorporating ComfyUI-AudioX into their workflows, users can expect improved efficiency in audio generation, greater control over the audio creation process, and enhanced overall quality of the audio outputs. This tool effectively bridges the gap between text and audio, enabling a more versatile creative process within ComfyUI.

Credits/Acknowledgments

The original AudioX model is developed by HKUSTAudio and is available on Hugging Face. The ComfyUI-AudioX repository is maintained by Yuan-ManX, who has contributed to making AudioX accessible within the ComfyUI ecosystem.

Discover most popular workflows

Hand-picked based on what hundreds of other artists looked at.

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

Nano Banana 2: Fast Image Generation & Editing

floyoofficial

4.6k

API

gemini flash image

Image2Image

Text2Image

typography

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

Nano Banana 2: Fast Image Generation & Editing

The top-ranked image model on Artificial Analysis and LM Arena. 4K output, text rendering, and subject consistency across 5 characters.

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

goshnii

10.7k

Face swap

Flux

flux 2 klein

Flux 2 Klein face swap

Flux face swap

head swap

image 2 image

image editing

Instead of using outdated or unstable techniques, this workflow was designed to take full advantage of FLUX 2 KLEIN's editing capabilities—using a face image and a reference character image to produce clean, highly consistent results.

Flux 2 Klein 9b - Perfect Face swap

floyoofficial

4.7k

API

Image to Video

LTX2.3

LTX 2.3

LTX 2.3 Pro Image to Video

LTX 2.3

Author

Yuan-ManX