---
tags:
- Pixelcopter-PLE-v0
- reinforce
- reinforcement-learning
- custom-implementation
- deep-rl-class
model-index:
- name: Reinforce-Pixelcopter-PLE-v0
  results:
  - metrics:
    - type: mean_reward
      value: 13.30 +/- 9.12
      name: mean_reward
    task:
      type: reinforcement-learning
      name: reinforcement-learning
    dataset:
      name: Pixelcopter-PLE-v0
      type: Pixelcopter-PLE-v0
---

      # 使用**Reinforce**智能体来玩**Pixelcopter-PLE-v0**
      这是一个使用**Reinforce**训练有素的模型玩**Pixelcopter-PLE-v0**.
      要学习使用这个模型并训练你的模型, 请查阅深度强化学习课程第5单元: https://github.com/huggingface/deep-rl-class/tree/main/unit5