test_grpo_2 / adapter_model.safetensors

Commit History

Trained with Unsloth
856a55c
verified

Erland commited on