Workflows

API

Pricing

Workflows

API

Pricing

SOUND & VOICE

Build repeatable audio pipelines for voice, sound design, and post.

1. Music and SFX Pipelines

Workflows for creating, enhancing, and shaping with audio

floyoofficial

1.9k

MMaudio

Video to Video

Generate synchronized audio with a given video input. It can be combined with video models to get videos with audio.

MMAudio: Video to Synced Audio

Generate synchronized audio with a given video input. It can be combined with video models to get videos with audio.

aiandpixels

3.3k

aiandpixels

image to video

LTX2

sound image to video

Image + Sound to VIDEO

LTX2 Image + Sound to VIDEO

Image + Sound to VIDEO

HunyuanVideo Foley: Create a Lifelike Sound

floyoofficial

942

HunyuanVideo Foley

Video2Video

HunyuanVideo Foley: Create a Lifelike Sound

aiandpixels

671

aiandpixels

Ltx2

ltx2video

sound to video

video

Sound to Video

LTX2 Sound to Video

Sound to Video

mdmz

2.7k

ai avatar

image to video

infinite talk

Infinitetalk

lip-sync

InfiniteTalk | Image to Video: Unlimited Talking Avatar with Lip-sync

2. Script to Voice

Turn scripts into clean voice tracks fast. Ideal for narration, explainer videos, product walkthroughs, localization drafts, and rapid iteration before final VO.

VibeVoice: Single-Speaker Text to Speech

floyoofficial

995

text to speech

TTS

VibeVoice

voice cloning

VibeVoice

VibeVoice: Single-Speaker Text to Speech

VibeVoice

floyoofficial

483

API

ElevenLabs

Floyo API

TTS

ElevenLabs Text to Speech

floyoofficial

486

Multi Speaker

TTS

VibeVoice

Speech Multi Speaker

VibeVoice Text to Speech Multi Speaker

Speech Multi Speaker

floyoofficial

433

Chatterbox

TTS

Text to speech workflow using Chatterbox

Chatterbox Text to Speech

Text to speech workflow using Chatterbox

Multi Model for Voice Convesion and Text to Speech

floyoofficial

367

ChatterBox

Higgs

Text to Speech

TTS

VibeVoice

A workflow of TTS Audio Suite which can to use different type of audio models.

Multi Model for Voice Convesion and Text to Speech

A workflow of TTS Audio Suite which can to use different type of audio models.

Minimax Speech 2.8 HD for Text to Speech

floyoofficial

538

Minimax

Minimax Speech 2.8 HD

TTS

Create realistic speech using Minimax speech 2.8

Minimax Speech 2.8 HD for Text to Speech

Create realistic speech using Minimax speech 2.8

Table of Contents

OVERVIEW

Build repeatable audio pipelines for voice, sound design, and post.