PPO-LunarLander-v2 / results.json
castejon777's picture
1st try of training PPO in LunarLander
e6dc5e9
{"mean_reward": 263.77789763061935, "std_reward": 16.217427539937812, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-03-05T20:02:08.297863"}