Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kawchar85
/
qwen2-math-7b-step-dpo-Q4_K_M-GGUF
like
0
Transformers
GGUF
xinlai/Math-Step-DPO-10K
alignment-handbook
trl
dpo
Generated from Trainer
llama-cpp
gguf-my-repo
Inference Endpoints
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
qwen2-math-7b-step-dpo-Q4_K_M-GGUF
/
README.md
Commit History
Upload README.md with huggingface_hub
f7f9e6a
verified
kawchar85
commited on
Oct 15