Collections

Discover the best community collections!

Collections including paper arxiv:2412.04432
video LM
Collection by Jan 9
Video
Collection by Jan 4
Unified MLLM
Unified model that generate Text, Image, Video
Cognition
Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.
VisionLM
Collection by about 21 hours ago
video
Collection by 3 days ago
daily papers
Collection by 14 days ago