Computer Vision - a Norm Collection

Norm 's Collections

VAE

Image / Video Gen

Multimodal Language Model

Fundamental Research

Computer Vision

Computer Vision

updated Oct 8, 2024

Do we still need a network for specific computer vision tasks anymore today?

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 112

Note 1. Process video frames one at a time, equipped with a memory attention module to attend to the previous memories of the target object.
facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24, 2024 • 320k • 72