ppo-LunarLander-v2 / baseline_1k

Commit History

Baseline of PPO @ 512k iterations
10b1b7d

lysukhin commited on