Shubham Toshniwal's picture

Shubham Toshniwal

stoshniwal

·

https://shtoshni.github.io/

shtoshni

AI & ML interests

NLP, LLM

Recent Activity

new activity about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B:Tokenizer config is wrong

new activity about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B:Tokenizer config is wrong

liked a model 3 months ago

Qwen/Qwen2.5-Math-7B-Instruct

View all activity

Organizations

stoshniwal's activity

upvoted a paper 3 months ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 52

upvoted a collection 4 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 77

upvoted an article 5 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 50

upvoted 2 collections 5 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated Jan 17 • 13

upvoted 3 papers 5 months ago

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Paper • 2410.01560 • Published Oct 2, 2024 • 4

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Paper • 2410.02749 • Published Oct 3, 2024 • 12

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 23

upvoted a paper about 1 year ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15, 2024 • 36

upvoted a collection about 1 year ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Jan 17 • 42