Gyanateet Dutta

Ryukijano

https://ryukijano.github.io

AI & ML interests

Computer Graphics, General Artificial Intelligence,model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.

Recent Activity

liked a Space 5 days ago

THUDM/CogVideoX-5B-Space

updated a Space 7 days ago

Ryukijano/TimeForge

liked a Space 7 days ago

merve/vision_papers

View all activity

Organizations

Ryukijano's activity

upvoted a collection 27 days ago

VILA: On Pre-training for Visual Language Models

Collection

10 items • Updated Oct 31, 2024 • 47

upvoted an article about 1 month ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 11

upvoted a paper about 1 month ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 111

upvoted a paper about 2 months ago

Grounding Image Matching in 3D with MASt3R

Paper • 2406.09756 • Published Jun 14, 2024 • 1

upvoted an article about 2 months ago

Article

How to run Gemini Nano locally in your browser

•

Jul 11, 2024

• 43

upvoted 3 collections 2 months ago

upvoted an article 2 months ago

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

•

Oct 21, 2024

• 32

upvoted 2 papers 3 months ago

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Paper • 2410.10774 • Published Oct 14, 2024 • 25

MonoFormer: One Transformer for Both Diffusion and Autoregression

Paper • 2409.16280 • Published Sep 24, 2024 • 17

upvoted a collection 3 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 551

upvoted 3 collections 4 months ago

3D

Collection

Stability AI's suite of models for 3D generation • 5 items • Updated Aug 9, 2024 • 33

SAM2

Collection

All the models and demos for SAM2 • 8 items • Updated Aug 2, 2024 • 13

NVEagle

Collection

4 items • Updated Aug 29, 2024 • 12

upvoted an article 4 months ago

Article

Scaling robotics datasets with video encoding

Aug 27, 2024

• 34

upvoted a paper 5 months ago

Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Paper • 2407.21705 • Published Jul 31, 2024 • 27

upvoted a collection 5 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 28 days ago • 637

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 294

upvoted a collection 6 months ago

DCLM

Collection

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 42