RLHF-PPO-PPOModel-LLama3-1B-v1.0 / generation_config.json

Commit History

End of training
b60abfe
verified

bikalnetomi commited on