Taha Ansari's picture

14 20

Taha Ansari

Tahahah

·

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

Tahahah/LunarLander_HF_Deep_RL_Course

published a model 1 day ago

Tahahah/LunarLander_HF_Deep_RL_Course

updated a model 1 day ago

Tahahah/pacman-sana-3.2m-taesd-driftfix

View all activity

Organizations

None yet

Tahahah's activity

upvoted a paper 19 days ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Paper • 2502.07780 • Published 25 days ago • 17

upvoted a paper 20 days ago

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Paper • 2502.08639 • Published 24 days ago • 37

upvoted a paper 26 days ago

History-Guided Video Diffusion

Paper • 2502.06764 • Published 26 days ago • 12

upvoted 2 papers about 1 month ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published Feb 3 • 29

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Paper • 2502.01639 • Published Feb 3 • 25

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

upvoted 5 papers about 1 month ago

DeepFlow: Serverless Large Language Model Serving at Scale

Paper • 2501.14417 • Published Jan 24 • 3

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 12

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 109

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted a paper 3 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 46

upvoted a collection 4 months ago

VILA-U-7B

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation • 2 items • Updated Jan 13 • 5

upvoted a paper about 1 year ago

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 28