API

Pricing

Workflows

API

Pricing

Segment Anything 2 for Creating Video Mask

Create a video mark frame by frame using Segment Anything 2

SAM2

Segment Anything 2

video2video

Video Mask

584

Generates in about -- secs

floyoofficial

Nodes & Models

comfyui-dream-project

Int Input [Dream]

ComfyUI-segment-anything-2

DownloadAndLoadSAM2Model

sam2.1_hiera_large.safetensors

Sam2Segmentation

comfyui-tensorops

DownloadAndLoadSAM2Model

sam2.1_hiera_large.safetensors

Sam2Segmentation

ComfyUI Official

WorkflowGraphics

PointsEditor

ResizeMask

MaskToImage

ComfyUI-VideoHelperSuite

VHS_LoadVideo

VHS_VideoInfo

VHS_VideoCombine

ComfyUI-S3-IO

VHS_LoadVideo

VHS_VideoInfo

VHS_VideoCombine

was-node-suite-comfyui

Image Blank

ComfyUI-load-lora-from-url

ImageResize+

ComfyUI_essentials

ImageResize+

ComfyUI-Easy-Use

easy imageSwitch

Segment Anything 2 (SAM 2) can generate a temporally consistent mask for an object across an entire video, starting from just a few clicks or a box on one frame.

Overview

SAM 2 is a promptable segmentation model for both images and videos: you give a point, box, or initial mask on a target object, and the model tracks and segments that object through the video using an internal memory mechanism. The memory encoder and mask decoder work frame‑by‑frame, using past predictions as context so the mask stays locked to the same object even as it moves, deforms, or is briefly occluded.

Why use SAM 2 for video masks

It can track objects over time with much less manual keyframing than classic rotoscoping.
It accepts flexible prompts (point, box, or mask), so you can start from whatever annotation is easiest in your tool.
Its memory bank makes masks more stable in real‑world footage (camera shake, motion blur, partial occlusions) than single‑frame segmentation models.

Typical mask‑creation flow

Provide a prompt for the object on one frame (commonly the first frame):
- A click on the object,
- A bounding box around it, or
- A rough mask from another tool.
Run the video predictor to propagate that prompt across all frames, which yields logits or masks per frame plus stable object IDs.
Threshold the mask logits to binary masks and export them (for example as per‑frame PNG alpha or as a mask video) for compositing, background removal, or targeted effects.

That gives you a clean, frame‑aligned video mask that can be used downstream for things like background replacement, localized stylization, or feeding into other I2V/V2V models.

Discover more workflows

You might like these too.

pluzio23

238

animate

character

large

SAM2

video

wan2.2

Anima tus personajes con un video de referencia

Wam2.2 Video2Video animate

Anima tus personajes con un video de referencia

floyoofficial

987

Masking

Segmentation

Video

Use a video clip and visual markers to segment/create masks of the subject or the inverse. Key Inputs Load Video: Use any Mp4 that you would like to segment or create a mask from Select subject: Use 3 green selectors to identify your subject and one red selector to identify the space outside your subject Modify markers: Shift+Click to add markers, Shift+Right Click to remove markers

Video Masking with Sam2 Comparison

mdmz

11.0k

wan 2.2

wan22

wan 2.2 animate

wan 22 animate

wan animate

Wan 2.2 Animate Preprocess by Kijai (MDMZ Edition)

floyoofficial

25.2k

AiVideo

API

image to video

video generation

wan 2.5

Wan 2.5: Image to Video with Audio

Z-Image Turbo: Fast Image Generation in Seconds

floyoofficial

21.9k

Marketing

Photography

Production

Text2Image

Z-Image Turbo

Fast Image Generation in Seconds

Z-Image Turbo: Fast Image Generation in Seconds

Fast Image Generation in Seconds

floyoofficial

14.6k

VFX

Video2Video

Video Production

Wan2.6

Wan 2.6 Reference to Video

floyoofficial

14.6k

API

gemini 3 pro

Image2Image

typography

Google just released Nano Banana Pro, and honestly, it's a pretty big step up from the original Nano Banana. The main thing? It can actually put legible text in images now. Like, real text that you can read, not the garbled nonsense most AI models spit out.

Nano Banana Pro: Generate & Edit Images