***
free126
AI & ML interests
None yet
Recent Activity
updated
a model
28 days ago
free126/Qwen2-0.5B-GRPO-test
published
a model
28 days ago
free126/Qwen2-0.5B-GRPO-test
commented on
an
article
29 days ago
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning
Organizations
None yet
free126's activity
This model seems to perform poorly in Chinese
#5 opened about 1 year ago
by
free126