1 59 55

Unknown Entity

unknownentity

AI & ML interests

None yet

Recent Activity

upvoted an article about 22 hours ago

Open R1: Update #3

liked a model 3 days ago

primecai/dsd_model

liked a Space 3 days ago

primecai/diffusion-self-distillation

View all activity

Organizations

None yet

unknownentity's activity

upvoted an article about 22 hours ago

Article

Open R1: Update #3

and 9 others •

1 day ago

• 163

upvoted a paper 3 days ago

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 6 days ago • 22

upvoted a collection 13 days ago

SkyReels-V1

Collection

SkyReels V1 open models collections • 2 items • Updated 24 days ago • 18

upvoted 2 papers 29 days ago

Enhance-A-Video: Better Generated Video for Free

Paper • 2502.07508 • Published 30 days ago • 21

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published 29 days ago • 34

upvoted a paper about 1 month ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 61

upvoted 2 papers about 2 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 54

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 275

upvoted a paper 4 months ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 94

upvoted a paper 6 months ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 112

upvoted 10 papers over 1 year ago

Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities

Paper • 2311.05698 • Published Nov 9, 2023 • 14

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 37

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 33

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 55

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 79