ppo_lunar / replay.mp4

Commit History

lunar lander V0 trained for 500k, n_steps=2048, batch_size=128
0897aec

exploiter345 commited on