Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
qgallouedec
/
Qwen2-0.5B-Reward-Math-Sheperd-KN-fix-cast
like
0
Token Classification
Transformers
TensorBoard
Safetensors
trl-lib/math_shepherd
qwen2
Generated from Trainer
trl
stepwise-reward-trainer
text-generation-inference
Inference Endpoints
arxiv:
2211.14275
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-Reward-Math-Sheperd-KN-fix-cast
/
model.safetensors
Commit History
Training in progress, step 6601
54b3dd5
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 6500
fd8b207
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 6000
c007a05
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 5500
d58d543
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 5000
83254a0
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 4500
058acc7
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 4000
e0509e2
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 3500
c117903
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 3000
b9db1b8
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 2500
b1ee406
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 2000
dc4682f
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 1500
c051892
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 1000
7c38f0f
verified
qgallouedec
HF staff
commited on
14 days ago
Training in progress, step 500
b276db4
verified
qgallouedec
HF staff
commited on
14 days ago