117 10 14

Omkar Pangarkar

omkarenator

AI & ML interests

None yet

Recent Activity

liked a Space about 8 hours ago

nanotron/ultrascale-playbook

new activity 5 days ago

LLM360/TxT360:fix-deps

updated a Space 5 days ago

LLM360/TxT360

View all activity

Organizations

omkarenator's activity

liked a Space about 8 hours ago

371

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in LLM360/TxT360 5 days ago

fix-deps

#7 opened 5 days ago by

omkarenator

updated a Space 5 days ago

100

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

New activity in LLM360/TxT360 7 days ago

code-formatting

#6 opened 7 days ago by

omkarenator

liked a Space 15 days ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

upvoted an article 23 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

23 days ago

• 762

New activity in LLM360/TxT360 4 months ago

Add citations and other fixes

#4 opened 4 months ago by

omkarenator

liked a dataset 4 months ago

LLM360/TxT360

Preview • Updated Nov 8, 2024 • 417k • 223

New activity in LLM360/TxT360 4 months ago

Update arxiv examples

#3 opened 4 months ago by

zhoujun

upvoted an article 4 months ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9, 2024

• 28

updated a Space 5 months ago

Fh New

📊

updated a Space 6 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

upvoted a paper 6 months ago

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23, 2024 • 24

liked a dataset 6 months ago

Trelis/touch-rugby-rules-memorisation

Viewer • Updated Feb 28, 2024 • 363 • 40 • 2

liked a dataset 8 months ago

commoncrawl/statistics

Viewer • Updated Oct 20, 2024 • 531k • 220 • 22

upvoted 4 papers about 1 year ago

Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures

Paper • 2402.05424 • Published Feb 8, 2024 • 16

Omkar Pangarkar

AI & ML interests

Recent Activity

Organizations

omkarenator's activity

The Ultra-Scale Playbook

fix-deps

TxT360: Trillion Extracted Text

code-formatting

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Open-R1: a fully open reproduction of DeepSeek-R1

Add citations and other fixes

Update arxiv examples

Scaling AI-based Data Processing with Hugging Face + Dask

Fh New

Toc

FineWeb: decanting the web for the finest text data at scale