LunarLander-PPO / results.json
ashrek's picture
third upload of PPO for lunar lander with 1500000 timesteps training
b9bb177
raw
history blame
164 Bytes
{"mean_reward": 276.3210535463337, "std_reward": 13.291447802263376, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-01-15T18:17:19.001869"}