Reinforce Agent playing Pixelcopter-PLE-v0

This is a trained model of a Reinforce agent playing Pixelcopter-PLE-v0 . To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction

Training Time

Trained on 50 000 timesteps for 4 hours and 20 minutes.

Hyperparameters

pixelcopter_hyperparameters = {
    "h_size": 64,
    "n_training_episodes": 50000,
    "n_evaluation_episodes": 10,
    "max_t": 10000,
    "gamma": 0.99,
    "lr": 1e-4,
    "env_id": env_id,
    "state_space": s_size,
    "action_space": a_size,
}

chirbard
/

Reinforce-Pixelcopter-PLE-v0

Reinforce Agent playing Pixelcopter-PLE-v0

Training Time

Hyperparameters

Evaluation results