Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
alexshengzhili
/
Qwen2.5-3B-Open-R1-Code-GRPO-r2
like
0
Text Generation
Transformers
Safetensors
open-r1/verifiable-coding-problems-python-10k_decontaminated
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-3B-Open-R1-Code-GRPO-r2
Commit History
Training in progress, step 850
35ae8e3
verified
alexshengzhili
commited on
4 minutes ago
Training in progress, step 800
801aee7
verified
alexshengzhili
commited on
12 minutes ago
Training in progress, step 750
7e3ebc6
verified
alexshengzhili
commited on
19 minutes ago
Training in progress, step 700
9cac50a
verified
alexshengzhili
commited on
26 minutes ago
Training in progress, step 650
2237e36
verified
alexshengzhili
commited on
33 minutes ago
Training in progress, step 600
819474a
verified
alexshengzhili
commited on
41 minutes ago
Training in progress, step 550
56bac6b
verified
alexshengzhili
commited on
about 1 hour ago
End of training
34d35b9
verified
alexshengzhili
commited on
about 3 hours ago
Model save
5c8106f
verified
alexshengzhili
commited on
about 3 hours ago
Training in progress, step 502
452be8e
verified
alexshengzhili
commited on
about 3 hours ago
End of training
fa0d38b
verified
alexshengzhili
commited on
about 17 hours ago
Model save
3731129
verified
alexshengzhili
commited on
about 17 hours ago
Training in progress, step 501
8505ff7
verified
alexshengzhili
commited on
about 17 hours ago
End of training
ccff262
verified
alexshengzhili
commited on
about 19 hours ago
Model save
bdd0804
verified
alexshengzhili
commited on
about 19 hours ago
Training in progress, step 500
4e6e67f
verified
alexshengzhili
commited on
about 19 hours ago
Training in progress, step 450
421b716
verified
alexshengzhili
commited on
about 20 hours ago
Training in progress, step 400
61287b6
verified
alexshengzhili
commited on
about 20 hours ago
Training in progress, step 350
d78978d
verified
alexshengzhili
commited on
about 21 hours ago
Training in progress, step 300
8b752f2
verified
alexshengzhili
commited on
about 21 hours ago
Training in progress, step 250
8fad07c
verified
alexshengzhili
commited on
about 22 hours ago
Training in progress, step 200
a509c55
verified
alexshengzhili
commited on
about 22 hours ago
Training in progress, step 150
516aaa9
verified
alexshengzhili
commited on
about 23 hours ago
Training in progress, step 100
82a2c07
verified
alexshengzhili
commited on
about 23 hours ago
Training in progress, step 50
0272d84
verified
alexshengzhili
commited on
about 24 hours ago
initial commit
d7a6eca
verified
alexshengzhili
commited on
1 day ago