Rintaro Enomoto's picture

1

Rintaro Enomoto

ununtrium

AI & ML interests

None yet

Recent Activity

updated a model 22 days ago

ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards

published a model 22 days ago

ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards

upvoted an article 23 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

None yet

Papers 1

arxiv:2407.03963

models 6

ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards

Text Generation • Updated 22 days ago • 3

ununtrium/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 30 days ago • 10

ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k2

Text Generation • Updated Feb 9 • 8

ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k

Text Generation • Updated Feb 8 • 8

ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-gsm8k

ununtrium/Llama-3.2-1B-Instruct-Open-R1-GRPO-1k

datasets

None public yet