https://github.com/vwxyzjn/lm-human-preference-details
-
lm-human-preference-details/train_policy_accelerate__sentiment_offline_5k.json__seed1
Text Generation • Updated • 7 -
lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2__sentiment_offline_5k.json__seed5
Text Generation • Updated • 7 -
lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2__sentiment_offline_5k.json__seed3
Text Generation • Updated • 6 -
lm-human-preference-details/train_policy_accelerate_tf_adam_gpt2__sentiment_offline_5k.json__seed2
Text Generation • Updated • 6