Arunkumar Venkataramanan's picture

81 185

Arunkumar Venkataramanan

ArunkumarVR

·

https://arunkumarramanan.github.io

AI & ML interests

AGI Research: Reasoning, Safety & Alignment (Superalignment), Generative AI (GenAI), Multi-Modal Foundation Models (FMs), Large Language Models (LLMs), Transformers & Diffusion Models, Open LLM Training, Optimization & Finetuning, Serving & Inference

Recent Activity

liked a model about 12 hours ago

google/gemma-3-4b-it

upvoted a collection about 12 hours ago

Google's Gemma models family

liked a model about 12 hours ago

google/gemma-3-27b-it

View all activity

Organizations

ArunkumarVR's activity

upvoted a collection about 12 hours ago

Google's Gemma models family

264 items • Updated 1 day ago • 106

upvoted an article 1 day ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 192

upvoted a collection 1 day ago

Gemma 3 Release

9 items • Updated 1 day ago • 210

upvoted a collection 5 days ago

QwQ

Qwen with Questions • 6 items • Updated 7 days ago • 80

upvoted a collection 16 days ago

Model Optimizer

A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 10 items • Updated 21 days ago • 9

upvoted a paper 26 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 124

upvoted 3 collections about 1 month ago

RLHFlow MATH Process Reward Model

This is a collection of datasets and models of process reward modeling. • 15 items • Updated Nov 9, 2024 • 10

Skywork-o1-Open

Skywork o1 open model collections • 3 items • Updated Nov 27, 2024 • 21

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 78

upvoted an article about 1 month ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 202

upvoted 2 collections about 1 month ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 120

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 56

upvoted 2 articles about 1 month ago

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 295

Article

How to deploy and fine-tune DeepSeek models on AWS

Jan 30

• 51

upvoted a paper about 1 month ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 27

upvoted 3 collections about 1 month ago

IndicBERT v2

IndicBERT v2 is a multilingual BERT model pretrained on IndicCorp v2, an Indic monolingual corpus of 20.9 billion tokens, covering 24 consitutionally • 4 items • Updated Oct 15, 2024 • 3

IndicLLMSuite

Largest Collections of Pretraining and Instruction Finetuning datasets for 22 Indic languages. • 4 items • Updated Nov 5, 2024 • 15

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 100

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

upvoted a collection about 2 months ago

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 40