Upload PPO based LunarLander-v2 Agent trained with MLP Policy for 100M steps
563fb61
{"mean_reward": 302.9949164746263, "std_reward": 20.230862536543793, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-03-25T03:18:11.576964"} |