This model serves as the baseline for the Aerial Wildfire Suppression environment, trained and tested on task 3 with difficulty 10 using the Proximal Policy Optimization (PPO) algorithm.

Environment: Aerial Wildfire Suppression
Task: 3
Difficulty: 10
Algorithm: PPO
Episode Length: 3000
Training max_steps: 1800000
Testing max_steps: 180000

Train & Test Scripts
Download the Environment

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

Collection including hivex-research/hivex-AWS-PPO-baseline-task-3-difficulty-10

Aerial Wildfire Suppression

Collection

89 items • Updated 23 days ago

Evaluation results

Crash Count on hivex-aerial-wildfire-suppression
self-reported

0.21666667088866234 +/- 0.25989201752505897
Extinguishing Trees on hivex-aerial-wildfire-suppression
self-reported

23.541666555404664 +/- 25.76001879059616
Extinguishing Trees Reward on hivex-aerial-wildfire-suppression
self-reported

117.70833463668824 +/- 128.80009751865373
Fire Out on hivex-aerial-wildfire-suppression
self-reported

0.25000000596046446 +/- 0.3176117073446609
Fire too Close to City on hivex-aerial-wildfire-suppression
self-reported

0.95 +/- 0.22360679774997894
Preparing Trees on hivex-aerial-wildfire-suppression
self-reported

842.3416595458984 +/- 719.565955429358
Preparing Trees Reward on hivex-aerial-wildfire-suppression
self-reported

842.3416595458984 +/- 719.565955429358
Water Drop on hivex-aerial-wildfire-suppression
self-reported

56.20000038146973 +/- 35.51066867416029
Water Pickup on hivex-aerial-wildfire-suppression
self-reported

55.85000023841858 +/- 35.4871245294335
Cumulative Reward on hivex-aerial-wildfire-suppression
self-reported

1050.1918411254883 +/- 490.0267879628168

View on Papers With Code