Shivam Kumar's picture

22 221

Shivam Kumar

shivamkumar

·

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

facebook/MusicGen

liked a Space about 2 months ago

facebook/MelodyFlow

liked a Space about 2 months ago

discord-community/LevelBot

View all activity

Organizations

shivamkumar's activity

upvoted a collection 2 months ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 106

upvoted a paper 7 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

upvoted a collection 9 months ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 149

upvoted an article 10 months ago

Article

Design choices for Vision Language Models in 2024

By

•

Apr 16, 2024

• 27

upvoted a collection 10 months ago

DeepSeek-Coder

DeepSeek Coder series • 9 items • Updated Aug 16, 2024 • 50

upvoted an article 11 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29, 2024

• 76

upvoted a collection 12 months ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 94

upvoted a paper 12 months ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 95

upvoted a paper about 1 year ago

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 191

upvoted 2 collections about 1 year ago

OpenCodeInterpreter

18 items • Updated Mar 3, 2024 • 84

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Oct 3, 2024 • 52

upvoted 9 papers about 1 year ago

InstantID: Zero-shot Identity-Preserving Generation in Seconds

Paper • 2401.07519 • Published Jan 15, 2024 • 57

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Paper • 2312.17681 • Published Dec 29, 2023 • 19

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 58

Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4

Paper • 2312.16171 • Published Dec 26, 2023 • 35

Parrot Captions Teach CLIP to Spot Text

Paper • 2312.14232 • Published Dec 21, 2023 • 12

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 28

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 54

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

Paper • 2312.13834 • Published Dec 20, 2023 • 27

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 55