rasdani
/

qwen2-math-1_5b-step-dpo

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

qwen2-math-1_5b-step-dpo / vocab.json

rasdani's picture

Training in progress, step 400

0679085 verified 4 months ago

2.78 MB

File too large to display, you can check the raw version instead.