floyo logo
Powered by
ThinkDiffusion
Webinar: Qwen 2511 for Multi Angle & Relighting w Sebastian Kamph. Sign up now 👉🏽
floyo logo
Powered by
ThinkDiffusion
Webinar: Qwen 2511 for Multi Angle & Relighting w Sebastian Kamph. Sign up now 👉🏽

ComfyUI-SAM3

415

Last updated
2026-02-09

ComfyUI-SAM3 is an integration of Meta's Segment Anything Model 3 (SAM3) within the ComfyUI framework, designed to facilitate open-vocabulary segmentation of images and videos using natural language prompts. This tool allows users to efficiently segment objects by simply describing them in text, significantly enhancing the capabilities of image and video processing workflows.

  • Enables text-based segmentation for both images and videos, allowing for flexible object identification.
  • Provides interactive tools for collecting prompts, making the segmentation process more intuitive.
  • Supports GPU acceleration for video tracking, greatly improving performance and efficiency.

Context

ComfyUI-SAM3 serves as a bridge between ComfyUI and Meta's SAM3, a powerful model capable of segmenting images and videos based on user-defined text prompts. Its main purpose is to enhance the user experience by allowing natural language input for segmentation tasks, thus broadening the scope of object recognition and manipulation in various applications.

Key Features & Benefits

The tool includes several practical features tailored for efficient segmentation. The LoadSAM3Model node allows users to load the SAM3 model seamlessly, while the SAM3Segmentation node lets users specify objects for segmentation through text prompts. Additionally, interactive tools like SAM3PointCollector and SAM3BBoxCollector simplify the process of gathering input prompts, making it easier to create precise segmentation masks.

Advanced Functionalities

ComfyUI-SAM3 offers advanced capabilities such as video tracking through nodes like SAM3VideoModelLoader and SAM3InitVideoSession, allowing users to track objects across video frames. The inclusion of GPU acceleration for video processing can significantly reduce tracking times, making it suitable for high-demand applications. Moreover, the ability to combine multiple prompts using nodes like SAM3CombineBoxes and SAM3CombinePoints provides users with greater flexibility in defining complex segmentation tasks.

Practical Benefits

By integrating SAM3 into ComfyUI, this tool enhances workflow efficiency, providing users with greater control over segmentation tasks. The use of natural language for prompts simplifies the interaction process, while the interactive tools and advanced video tracking capabilities improve overall quality and speed. This results in a more streamlined experience for users engaged in image and video processing.

Credits/Acknowledgments

  • SAM3: Developed by Meta AI Research (https://github.com/facebookresearch/sam3).
  • ComfyUI Integration: Created by ComfyUI-SAM3 contributors.
  • Interactive Points Editor: Adapted from ComfyUI-KJNodes by kijai (Apache 2.0 License), with the SAM3PointsEditor node based on the PointsEditor implementation from KJNodes, tailored for SAM3-specific segmentation needs.

Inner Nodes

LoadSAM3Model, SAM3BBoxCollector, SAM3CombineBoxes, SAM3CombinePoints, SAM3CreateBox, SAM3CreatePoint, SAM3Grounding, SAM3MultiRegionCollector, SAM3MultipromptSegmentation, SAM3PointCollector, SAM3Propagate, SAM3Segmentation, SAM3VideoOutput, SAM3VideoSegmentation