ppo-LunarLander-v2 / results.json
draziert's picture
Upload PPO-MlpPolicy trained model
c047c63
raw
history blame contribute delete
165 Bytes
{"mean_reward": 267.07873570000004, "std_reward": 20.299278150506513, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-06-11T07:59:54.846863"}