qwen_orpo_entropy_0_01 / trainer_state.json

Commit History

Model save
256623b
verified

yakazimir commited on