floyo logo
Powered by
ThinkDiffusion
Pricing
Wan 2.7 is now live. Check it out 👉🏼
floyo logo
Powered by
ThinkDiffusion
Pricing
Wan 2.7 is now live. Check it out 👉🏼
Last updated
2025-10-13

Lightweight ComfyUI wrapper designed for IndexTTS 2, this tool facilitates voice cloning and emotion control by utilizing nodes that maintain fidelity to the original inference code. It enhances the ComfyUI experience by providing intuitive audio generation capabilities with advanced customization features.

  • Supports voice cloning and emotion control through a series of specialized nodes.
  • Features an inline audio player for immediate playback and preview of generated audio.
  • Offers advanced configuration options for audio parameters, including sampling rate and output gain control.

Context

This tool serves as a streamlined interface for IndexTTS 2 within the ComfyUI framework, allowing users to leverage voice synthesis and emotional modulation effectively. By invoking the original inference code, it ensures that the functionality mirrors that of the foundational repository while enhancing usability.

Key Features & Benefits

The ComfyUI-IndexTTS2 wrapper includes several practical nodes: the IndexTTS2 Simple node for basic audio generation, the IndexTTS2 Advanced node for more granular control over speech parameters, and the IndexTTS2 Emotion Vector node for nuanced emotional expression. Additionally, the IndexTTS2 Save Audio node simplifies the process of saving generated audio files, complete with an integrated player for quick previews.

Advanced Functionalities

Advanced capabilities include the ability to manipulate various audio parameters such as sampling rate, speech speed, and emotional expression through sliders. The IndexTTS2 Emotion From Text node allows users to convert text into an emotion vector, enhancing the expressiveness of the generated audio.

Practical Benefits

This tool significantly enhances workflow efficiency by combining multiple audio generation features into a cohesive interface. Users can easily generate, preview, and save audio without needing separate nodes for each task, thus streamlining the creative process and improving overall control over audio outputs.

Credits/Acknowledgments

The original development of IndexTTS 2 is credited to the contributors at the IndexTTS GitHub repository. The ComfyUI-IndexTTS2 wrapper is a community-driven extension that builds upon their work, adhering to open-source principles.

Inner Nodes

IndexTTS2Advanced
IndexTTS2EmotionFromText
IndexTTS2EmotionVector
IndexTTS2SaveAudio
IndexTTS2Simple