floyo logobeta logo
Powered by
ThinkDiffusion
floyo logobeta logo
Powered by
ThinkDiffusion

ComfyUI-AudioX

24

Last updated
2025-06-24

A robust audio generation extension for ComfyUI, ComfyUI-AudioX leverages AudioX models to produce high-quality audio synthesis from both text and video inputs. Designed for users with a minimum of 16GB VRAM, it enhances creative workflows by integrating advanced audio processing capabilities.

  • Enables text-to-audio and text-to-music generation with customizable styles, tempos, and moods.
  • Supports video-to-audio conversion while allowing users to mute original audio and combine it with generated sound.
  • Offers professional audio processing features, including LUFS normalization and precise volume control.

Context

ComfyUI-AudioX is an extension designed to enhance the audio generation capabilities within the ComfyUI framework. Its primary purpose is to provide users with high-quality audio synthesis from various input sources, enabling more dynamic and versatile audio content creation.

Key Features & Benefits

The extension includes several practical features that significantly improve audio generation. Users can generate audio from text descriptions with enhanced conditioning, allowing for a higher degree of control over the output. The ability to extract audio from video content offers a seamless way to create soundtracks that match visual media, while advanced audio processing features ensure professional quality.

Advanced Functionalities

ComfyUI-AudioX includes specialized capabilities such as enhanced conditioning controls that allow for separate CFG scales and conditioning weights for text and video inputs. This level of granularity enables users to fine-tune the influence of each input type on the final audio output, resulting in more tailored and precise audio generation.

Practical Benefits

This tool streamlines audio workflows by providing a user-friendly interface for generating and processing audio. It enhances control over audio characteristics, improves overall audio quality, and increases efficiency in content creation, allowing users to focus on creativity rather than technical limitations.

Credits/Acknowledgments

The ComfyUI-AudioX extension is developed by the AudioX team and is supported by contributions from the ComfyUI community. The project is licensed under the MIT License, allowing for open collaboration and further development.