This model serves as the baseline for the Aerial Wildfire Suppression environment, trained and tested on task 3
with difficulty 10
using the Proximal Policy Optimization (PPO) algorithm.
Environment: Aerial Wildfire Suppression
Task: 3
Difficulty: 10
Algorithm: PPO
Episode Length: 3000
Training max_steps
: 1800000
Testing max_steps
: 180000
Train & Test Scripts
Download the Environment
Evaluation results
- Crash Count on hivex-aerial-wildfire-suppressionself-reported0.21666667088866234 +/- 0.25989201752505897
- Extinguishing Trees on hivex-aerial-wildfire-suppressionself-reported23.541666555404664 +/- 25.76001879059616
- Extinguishing Trees Reward on hivex-aerial-wildfire-suppressionself-reported117.70833463668824 +/- 128.80009751865373
- Fire Out on hivex-aerial-wildfire-suppressionself-reported0.25000000596046446 +/- 0.3176117073446609
- Fire too Close to City on hivex-aerial-wildfire-suppressionself-reported0.95 +/- 0.22360679774997894
- Preparing Trees on hivex-aerial-wildfire-suppressionself-reported842.3416595458984 +/- 719.565955429358
- Preparing Trees Reward on hivex-aerial-wildfire-suppressionself-reported842.3416595458984 +/- 719.565955429358
- Water Drop on hivex-aerial-wildfire-suppressionself-reported56.20000038146973 +/- 35.51066867416029
- Water Pickup on hivex-aerial-wildfire-suppressionself-reported55.85000023841858 +/- 35.4871245294335
- Cumulative Reward on hivex-aerial-wildfire-suppressionself-reported1050.1918411254883 +/- 490.0267879628168