Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
3
9
Dennis
PRO
doubledsbv
Follow
Behrens's profile picture
trenkert's profile picture
bgroener's profile picture
13 followers
·
9 following
ddickmann
AI & ML interests
None yet
Recent Activity
reacted
to
lewtun
's
post
with ❤️
5 days ago
Introducing OpenR1-Math-220k! https://huggingface.co/datasets/open-r1/OpenR1-Math-220k The community has been busy distilling DeepSeek-R1 from inference providers, but we decided to have a go at doing it ourselves from scratch 💪 What’s new compared to existing reasoning datasets? ♾ Based on https://huggingface.co/datasets/AI-MO/NuminaMath-1.5: we focus on math reasoning traces and generate answers for problems in NuminaMath 1.5, an improved version of the popular NuminaMath-CoT dataset. 🐳 800k R1 reasoning traces: We generate two answers for 400k problems using DeepSeek R1. The filtered dataset contains 220k problems with correct reasoning traces. 📀 512 H100s running locally: Instead of relying on an API, we leverage vLLM and SGLang to run generations locally on our science cluster, generating 180k reasoning traces per day. ⏳ Automated filtering: We apply Math Verify to only retain problems with at least one correct answer. We also leverage Llama3.3-70B-Instruct as a judge to retrieve more correct examples (e.g for cases with malformed answers that can’t be verified with a rules-based parser) 📊 We match the performance of DeepSeek-Distill-Qwen-7B by finetuning Qwen-7B-Math-Instruct on our dataset. 🔎 Read our blog post for all the nitty gritty details: https://huggingface.co/blog/open-r1/update-2
updated
a model
about 1 month ago
seedboxai/sparse_llama_3.1-8b-sft-distilled
updated
a model
about 1 month ago
seedboxai/sparse_llama_3.1-8b-sft-distilled
View all activity
Organizations
models
18
Sort: Recently updated
doubledsbv/Llama-3.1-8B-2of4-NO-KL
Updated
Dec 9, 2024
doubledsbv/Llama-3.1-70B-2of4-NO-KL
Updated
Dec 2, 2024
doubledsbv/Llama-3.1-405B-16of64
Updated
Dec 2, 2024
doubledsbv/Llama-3.1-70B-2of4
Updated
Dec 2, 2024
doubledsbv/LLM-Router-Llama-3.1-8B-Instruct
Text Generation
•
Updated
Sep 24, 2024
•
12
doubledsbv/Meta-Llama-3.1-405B-Instruct-FP8
Updated
Aug 15, 2024
doubledsbv/Meta-Llama-3.1-70B-Instruct-FP8
Updated
Aug 2, 2024
doubledsbv/gemma_2_2B_kafka_experimental
Text Generation
•
Updated
Aug 2, 2024
•
98
doubledsbv/KafkaLM-Mixtral-8x7B-V0.2_DPO-AWQ
Text Generation
•
Updated
May 28, 2024
•
9
doubledsbv/KafkaLM-8x7B-German-V0.1-AWQ
Text Generation
•
Updated
Mar 5, 2024
•
8
Expand 18 models
datasets
4
Sort: Recently updated
doubledsbv/kd_sample
Viewer
•
Updated
Dec 4, 2024
•
10k
•
46
doubledsbv/kafka_dpo_v2
Viewer
•
Updated
May 14, 2024
•
59.9k
•
65
doubledsbv/kafka_sft_refined
Viewer
•
Updated
May 13, 2024
•
612k
•
73
doubledsbv/llama-3-dataset
Viewer
•
Updated
Apr 19, 2024
•
550k
•
43