phi-4-GRPO / model-00004-of-00006.safetensors

Commit History

Trained with Unsloth
2fa212a
verified

vedrano commited on