floyo logobeta logo
Powered by
ThinkDiffusion
floyo logobeta logo
Powered by
ThinkDiffusion

ComfyUI_Float_Animator

4

Last updated
2025-06-16

A custom node package designed for ComfyUI, the ComfyUI_Float_Animator integrates the FLOAT project to facilitate the generation of animated talking portraits from still images and audio files. This tool allows users to create synchronized video frames that reflect lip movements and emotional expressions based on audio input.

  • Generates animated video frames from a still portrait and audio, allowing for realistic lip synchronization.
  • Offers control over emotional expression through an intuitive interface, enhancing the realism of the animations.
  • Supports various output configurations, including frame rates and automatic face cropping for optimal results.

Context

The ComfyUI_Float_Animator is a specialized node within the ComfyUI framework that leverages the FLOAT model to produce animated video sequences from static images and accompanying audio. Its primary purpose is to enhance user workflows by providing a straightforward method for creating dynamic visual content that reflects spoken audio.

Key Features & Benefits

The Float_Animator node enables users to input a still portrait image and an audio file, generating a sequence of animated frames where lip movements are synchronized with the audio. Key features include adjustable parameters for emotional expression, frame rates, and guidance scales, which allow for fine-tuning of the animation quality and emotional depth. This capability is particularly beneficial for projects requiring high levels of realism in animated characters or presentations.

Advanced Functionalities

The node allows for advanced customization options, such as selecting different models for varying animation styles and adjusting classifier-free guidance scales for both audio and visual inputs. Users can specify target emotions or let the system infer emotions from the audio, providing flexibility in how the final output is perceived. The automatic cropping feature can enhance the focus on the subject's face, improving the overall quality of the animation.

Practical Benefits

By integrating the Float_Animator into their workflows, users can significantly enhance their creative projects with minimal effort. The tool streamlines the process of creating animated content, providing high-quality outputs that can be directly utilized in videos or presentations. This not only saves time but also elevates the standard of visual storytelling through precise synchronization of audio and visual elements.

Credits/Acknowledgments

The development of this tool is credited to the original authors of the FLOAT project, including Taekyung Ki, Dongchan Min, and Gyeongsu Chae. The code for the ComfyUI_Float_Animator is released under the MIT License, while the underlying FLOAT model is governed by the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License. For further details, users are encouraged to refer to the repository's LICENSE.md file.