DeepSeek-R1-Distill-Qwen-32B-bnb-4bit-DPO-tuned / model.safetensors.index.json

Commit History

Trained with Unsloth
7ad734a
verified

imhmdf commited on