This model serves as the baseline for the Aerial Wildfire Suppression environment, trained and tested on task 3 with difficulty 10 using the Proximal Policy Optimization (PPO) algorithm.

Environment: Aerial Wildfire Suppression
Task: 3
Difficulty: 10
Algorithm: PPO
Episode Length: 3000
Training max_steps: 1800000
Testing max_steps: 180000

Train & Test Scripts
Download the Environment

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results

  • Crash Count on hivex-aerial-wildfire-suppression
    self-reported
    0.21666667088866234 +/- 0.25989201752505897
  • Extinguishing Trees on hivex-aerial-wildfire-suppression
    self-reported
    23.541666555404664 +/- 25.76001879059616
  • Extinguishing Trees Reward on hivex-aerial-wildfire-suppression
    self-reported
    117.70833463668824 +/- 128.80009751865373
  • Fire Out on hivex-aerial-wildfire-suppression
    self-reported
    0.25000000596046446 +/- 0.3176117073446609
  • Fire too Close to City on hivex-aerial-wildfire-suppression
    self-reported
    0.95 +/- 0.22360679774997894
  • Preparing Trees on hivex-aerial-wildfire-suppression
    self-reported
    842.3416595458984 +/- 719.565955429358
  • Preparing Trees Reward on hivex-aerial-wildfire-suppression
    self-reported
    842.3416595458984 +/- 719.565955429358
  • Water Drop on hivex-aerial-wildfire-suppression
    self-reported
    56.20000038146973 +/- 35.51066867416029
  • Water Pickup on hivex-aerial-wildfire-suppression
    self-reported
    55.85000023841858 +/- 35.4871245294335
  • Cumulative Reward on hivex-aerial-wildfire-suppression
    self-reported
    1050.1918411254883 +/- 490.0267879628168