floyo logo
Powered by
ThinkDiffusion
Pricing
๐Ÿ”ฅ Seedance 2.0 is here! Create now ๐Ÿ‘‰๐Ÿผ
floyo logo
Powered by
ThinkDiffusion
Pricing
๐Ÿ”ฅ Seedance 2.0 is here! Create now ๐Ÿ‘‰๐Ÿผ

VibeVoice 1.5B

TTS, 1.5B, Microsoft

230

Generates in about -- secs

Nodes & Models

LoadTextFromFileNode
VibeVoiceSingleSpeakerNode
Note
LoadAudio
PreviewAudio

VibeVoice ComfyUI Nodes

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

โœจ Features

Core Functionality

  • ๐ŸŽค Single Speaker TTS: Generate natural speech with optional voice cloning

  • ๐Ÿ‘ฅ Multi-Speaker Conversations: Support for up to 4 distinct speakers

  • ๐ŸŽฏ Voice Cloning: Clone voices from audio samples

  • ๐ŸŽจ LoRA Support: Fine-tune voices with custom LoRA adapters (v1.4.0+)

  • ๐ŸŽš๏ธ Voice Speed Control: Adjust speech rate by modifying reference voice speed (v1.5.0+)

  • ๐Ÿ“ Text File Loading: Load scripts from text files

  • ๐Ÿ“š Automatic Text Chunking: Handles long texts seamlessly with configurable chunk size

  • โธ๏ธ Custom Pause Tags: Insert silences with [pause] and [pause:ms] tags (wrapper feature)

  • ๐Ÿ”„ Node Chaining: Connect multiple VibeVoice nodes for complex workflows

  • โน๏ธ Interruption Support: Cancel operations before or between generations

  • ๐Ÿ”ง Flexible Configuration: Control temperature, sampling, and guidance scale

Performance & Optimization

  • โšก Attention Mechanisms: Choose between auto, eager, sdpa, flash_attention_2 or sage

  • ๐ŸŽ›๏ธ Diffusion Steps: Adjustable quality vs speed trade-off (default: 20)

  • ๐Ÿ’พ Memory Management: Toggle automatic VRAM cleanup after generation

  • ๐Ÿงน Free Memory Node: Manual memory control for complex workflows

  • ๐ŸŽ Apple Silicon Support: Native GPU acceleration on M1/M2/M3 Macs via MPS

  • ๐Ÿ”ข 8-Bit Quantization: Perfect audio quality with high VRAM reduction

  • ๐Ÿ”ข 4-Bit Quantization: Maximum VRAM savings with minimal quality loss

Read more

N
c
calmconqueror
โ€ข 3 months ago
Credit: https://github.com/Enemyx-net/VibeVoice-ComfyUI

Reply