Dennis's picture

4 3 9

Dennis PRO

doubledsbv

·

ddickmann

AI & ML interests

None yet

Recent Activity

reacted to lewtun's post with ❤️ 5 days ago

Introducing OpenR1-Math-220k! https://huggingface.co/datasets/open-r1/OpenR1-Math-220k The community has been busy distilling DeepSeek-R1 from inference providers, but we decided to have a go at doing it ourselves from scratch 💪 What’s new compared to existing reasoning datasets? ♾ Based on https://huggingface.co/datasets/AI-MO/NuminaMath-1.5: we focus on math reasoning traces and generate answers for problems in NuminaMath 1.5, an improved version of the popular NuminaMath-CoT dataset. 🐳 800k R1 reasoning traces: We generate two answers for 400k problems using DeepSeek R1. The filtered dataset contains 220k problems with correct reasoning traces. 📀 512 H100s running locally: Instead of relying on an API, we leverage vLLM and SGLang to run generations locally on our science cluster, generating 180k reasoning traces per day. ⏳ Automated filtering: We apply Math Verify to only retain problems with at least one correct answer. We also leverage Llama3.3-70B-Instruct as a judge to retrieve more correct examples (e.g for cases with malformed answers that can’t be verified with a rules-based parser) 📊 We match the performance of DeepSeek-Distill-Qwen-7B by finetuning Qwen-7B-Math-Instruct on our dataset. 🔎 Read our blog post for all the nitty gritty details: https://huggingface.co/blog/open-r1/update-2

updated a model about 1 month ago

seedboxai/sparse_llama_3.1-8b-sft-distilled

updated a model about 1 month ago

seedboxai/sparse_llama_3.1-8b-sft-distilled

View all activity

Organizations

models 18

doubledsbv/Llama-3.1-8B-2of4-NO-KL

Updated Dec 9, 2024

doubledsbv/Llama-3.1-70B-2of4-NO-KL

Updated Dec 2, 2024

doubledsbv/Llama-3.1-405B-16of64

Updated Dec 2, 2024

doubledsbv/Llama-3.1-70B-2of4

Updated Dec 2, 2024

doubledsbv/LLM-Router-Llama-3.1-8B-Instruct

Text Generation • Updated Sep 24, 2024 • 12

doubledsbv/Meta-Llama-3.1-405B-Instruct-FP8

Updated Aug 15, 2024

doubledsbv/Meta-Llama-3.1-70B-Instruct-FP8

Updated Aug 2, 2024

doubledsbv/gemma_2_2B_kafka_experimental

Text Generation • Updated Aug 2, 2024 • 98

doubledsbv/KafkaLM-Mixtral-8x7B-V0.2_DPO-AWQ

Text Generation • Updated May 28, 2024 • 9

doubledsbv/KafkaLM-8x7B-German-V0.1-AWQ

Text Generation • Updated Mar 5, 2024 • 8

datasets 4

doubledsbv/kd_sample

Viewer • Updated Dec 4, 2024 • 10k • 46

doubledsbv/kafka_dpo_v2

Viewer • Updated May 14, 2024 • 59.9k • 65

doubledsbv/kafka_sft_refined

Viewer • Updated May 13, 2024 • 612k • 73

doubledsbv/llama-3-dataset

Viewer • Updated Apr 19, 2024 • 550k • 43