trl-lib
/

Qwen2-0.5B-Reward-Math-Sheperd

Token Classification

Generated from Trainer

stepwise-reward-trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Qwen2-0.5B-Reward-Math-Sheperd / vocab.json

qgallouedec's picture

qgallouedec HF staff

Training in progress, step 500

812ef89 verified 27 days ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.