API

Pricing

Workflows

API

Pricing

Kling AI Avatar V2 Pro - Photo to Talking Video

Turn a photo into a talking video with Kling Avatar V2 Pro.

animation

character design

image to video

kling

Lipsync

video generation

284

_MConverter.eu_BdSf0HJmqs_pty_L9Ls0L_output_1775556674464.webp

Generates in about -- secs

nikhil07

Nodes & Models

Floyo Partner Nodes

KlingAIAvatarV2Pro_floyo

Ver Private

Comm Use

VideoToFrames

ComfyUI Official

WorkflowGraphics

LoadAudio

LoadImage

ComfyUI-VideoHelperSuite

VHS_VideoCombine

ComfyUI-S3-IO

VHS_VideoCombine

Kling AI Avatar V2 Pro turns a single photo into a talking video where the person in the image speaks the words from your audio track.

Upload a portrait and an audio file. Add a short description of who is in the photo. Kling Avatar V2 Pro animates the face with realistic mouth movements, expressions, and head motion that match the speech in your audio.

One photo, one audio file. One talking avatar video out.

How do you use Kling AI Avatar V2 Pro?

Upload a portrait photo and an audio file. Write a short prompt describing who is in the image. Kling Avatar V2 Pro generates a video where the person appears to speak the words in your audio, with natural facial movement and expressions.

Here is the setup, step by step:

Step 1: Upload your photo Upload a clear portrait of the person you want to animate. A front-facing shot with good lighting and a visible face works best. The cleaner and more detailed the photo, the more realistic the output. Blurry images, heavy shadows, or side profiles will reduce quality.

Step 2: Upload your audio Upload the speech track you want the avatar to say. This can be a voiceover, a recording, or any speech audio file. The model reads the audio and drives the mouth and face movements to match it.

Step 3: Write a prompt Add a short description of the person and what they are doing. One line is enough: "man teaching mathematics" or "woman giving a presentation." This helps the model generate the right expression and tone for the video.

Step 4: Run Hit run. Kling Avatar V2 Pro generates a video where the person in your photo appears to speak the words from your audio file, with realistic facial animation throughout.

What is Kling AI Avatar V2 Pro good for?

Creating spokesperson videos from a single photo, producing talking head content without a camera, generating avatar videos for presentations or courses, and animating portraits for social content.

The most direct use is spokesperson video. If you have a photo of a person and a script recorded as audio, you can generate a full talking head video without any filming. The output looks like a real person speaking directly to camera.

It also works well for content creators who want to generate avatar-style videos at scale. Upload one portrait, swap in different audio tracks, and generate multiple talking videos from the same face.

V2 Pro is the higher-quality tier with sharper facial detail, more natural expression, and better motion coherence than the standard Avatar model. Use it when the output is going in front of an audience.

Discover more workflows

You might like these too.

floyoofficial

319

animation

character design

image to video

kling

video generation

Apply motion from a reference video to a still image with Kling 3.0 Pro.

Kling 3.0 Pro Motion Control

Apply motion from a reference video to a still image with Kling 3.0 Pro.

nikhil07

310

animation

image to video

LatentSync

Lipsync

vfx

video generation

Make anyone in a video match your audio with LatentSync

LatentSync - Lip Sync Video from Audio

Make anyone in a video match your audio with LatentSync

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

Wan 2.1 FusionX: Cinematic Image to Video

floyoofficial

4.6k

FusionX

Image to Video

Video Generation

Wan

Created by @vrgamedevgirl on Civitai, please support the original creator!

Wan 2.1 FusionX: Cinematic Image to Video

Created by @vrgamedevgirl on Civitai, please support the original creator!

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

floyoofficial

14.6k

VFX

Video2Video

Video Production

Wan2.6

Wan 2.6 Reference to Video

floyoofficial

14.6k

API

gemini 3 pro

Image2Image

typography

Google just released Nano Banana Pro, and honestly, it's a pretty big step up from the original Nano Banana. The main thing? It can actually put legible text in images now. Like, real text that you can read, not the garbled nonsense most AI models spit out.

Nano Banana Pro: Generate & Edit Images