kawchar85
/

qwen2-math-7b-step-dpo-Q4_K_M-GGUF

alignment-handbook

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

qwen2-math-7b-step-dpo-Q4_K_M-GGUF / README.md

Commit History

Upload README.md with huggingface_hub

f7f9e6a
verified

kawchar85 commited on Oct 15