ppo-LunarLander-v2 / results.json
pmgautam's picture
model commit from HF RL course
810f81f
{"mean_reward": 279.9190402617945, "std_reward": 12.535997686125004, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-03-26T06:58:18.305486"}