Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Rintaro Enomoto
ununtrium
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
22 days ago
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards
published
a model
22 days ago
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards
upvoted
an
article
23 days ago
Open-R1: a fully open reproduction of DeepSeek-R1
View all activity
Organizations
None yet
Papers
1
arxiv:
2407.03963
models
6
Sort: Recently updated
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards
Text Generation
•
Updated
22 days ago
•
3
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
30 days ago
•
10
ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k2
Text Generation
•
Updated
Feb 9
•
8
ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k
Text Generation
•
Updated
Feb 8
•
8
ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-gsm8k
Updated
Feb 8
ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-1k
Updated
Feb 4
datasets
None public yet