floyo logo
Powered by
ThinkDiffusion
floyo logo
Powered by
ThinkDiffusion
Sound & Voice hero

SOUND & VOICE

Build repeatable audio pipelines for voice, sound design, and post.

1. Music and SFX Pipelines

Workflows for creating, enhancing, and shaping with audio

MMAudio: Video to Synced Audio

MMaudio

Video to Video

Generate synchronized audio with a given video input. It can be combined with video models to get videos with audio.

MMAudio: Video to Synced Audio

Generate synchronized audio with a given video input. It can be combined with video models to get videos with audio.

LTX2 Image + Sound to VIDEO

aiandpixels

image to video

LTX2

sound image to video

Image + Sound to VIDEO

LTX2 Image + Sound to VIDEO

Image + Sound to VIDEO

 HunyuanVideo Foley: Create a Lifelike Sound

HunyuanVideo Foley

Video2Video

HunyuanVideo Foley: Create a Lifelike Sound

LTX2 Sound to Video

aiandpixels

Ltx2

ltx2video

sound to video

video

Sound to Video

LTX2 Sound to Video

Sound to Video

InfiniteTalk | Image to Video: Unlimited Talking Avatar with Lip-sync
mdmz

mdmz

944

ai avatar

image to video

infinite talk

Infinitetalk

lip-sync

InfiniteTalk | Image to Video: Unlimited Talking Avatar with Lip-sync

2. Script to Voice

Turn scripts into clean voice tracks fast. Ideal for narration, explainer videos, product walkthroughs, localization drafts, and rapid iteration before final VO.

VibeVoice Text to Speech Single Speaker

TTS

VibeVoice

VibeVoice Text to Speech Single Speaker

ElevenLabs Text to Speech

API

ElevenLabs

Floyo API

TTS

ElevenLabs Text to Speech

ElevenLabs Text to Speech

ElevenLabs Text to Speech

VibeVoice Text to Speech Multi Speaker

Multi Speaker

TTS

VibeVoice

Speech Multi Speaker

VibeVoice Text to Speech Multi Speaker

Speech Multi Speaker

Chatterbox Text to Speech

Chatterbox

TTS

Text to speech workflow using Chatterbox

Chatterbox Text to Speech

Text to speech workflow using Chatterbox

Multi Model for Voice Convesion and Text to Speech

ChatterBox

Higgs

Text to Speech

TTS

VibeVoice

A workflow of TTS Audio Suite which can to use different type of audio models.

Multi Model for Voice Convesion and Text to Speech

A workflow of TTS Audio Suite which can to use different type of audio models.

Minimax Speech 2.8 HD for Text to Speech

Minimax

Minimax Speech 2.8 HD

TTS

Create realistic speech using Minimax speech 2.8

Minimax Speech 2.8 HD for Text to Speech

Create realistic speech using Minimax speech 2.8

Table of Contents
OVERVIEW

Build repeatable audio pipelines for voice, sound design, and post.