Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jiachenjiang
/
Qwen2-0.5B-GRPO-test
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
trl
grpo
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO-test
Commit History
End of training
c0a44ee
verified
jiachenjiang
commited on
15 days ago
Model save
c02b738
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 226
3a94c24
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 220
3375fcd
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 210
beb8d66
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 200
e512da9
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 190
a764fbe
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 180
cf1c9f4
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 170
f759ed4
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 160
69bf5c4
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 150
85e31ef
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 140
d47ce44
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 130
c33ab9c
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 120
ded0891
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 110
40c2989
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 100
fea940c
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 90
cf39264
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 80
4ef4e7f
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 70
21a317e
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 60
340c14d
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 50
206e0a6
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 40
6739801
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 30
f03fc95
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 20
fa87ed7
verified
jiachenjiang
commited on
15 days ago
Training in progress, step 10
d921903
verified
jiachenjiang
commited on
15 days ago
initial commit
7820bb5
verified
jiachenjiang
commited on
15 days ago