***'s picture

1

***

free126

AI & ML interests

None yet

Recent Activity

updated a model 28 days ago

free126/Qwen2-0.5B-GRPO-test

published a model 28 days ago

free126/Qwen2-0.5B-GRPO-test

commented on an article 29 days ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

View all activity

Organizations

None yet

free126's activity

New activity in timpal0l/mdeberta-v3-base-squad2 about 1 year ago

This model seems to perform poorly in Chinese

#5 opened about 1 year ago by