phi4-14b-grpo-reasoning-merged_16bit / model.safetensors.index.json

Commit History

Trained with Unsloth
41445d5
verified

adrianoamalfi commited on