alexandersoare
commited on
Commit
•
39bad5d
1
Parent(s):
577cd14
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
@@ -30,7 +30,7 @@ The model was evaluated on the `PushT` environment from [gym-pusht](https://gith
|
|
30 |
- Maximum overlap with target (seen as `eval/avg_max_reward` in the charts above). This ranges in [0, 1].
|
31 |
- Success: whether or not the maximum overlap is at least 95%.
|
32 |
|
33 |
-
Here are the metrics for 500 episodes worth of evaluation. For the succes rate we add
|
34 |
|
35 |
<blank>|Ours|Theirs
|
36 |
-|-|-
|
|
|
30 |
- Maximum overlap with target (seen as `eval/avg_max_reward` in the charts above). This ranges in [0, 1].
|
31 |
- Success: whether or not the maximum overlap is at least 95%.
|
32 |
|
33 |
+
Here are the metrics for 500 episodes worth of evaluation. For the succes rate we add an extra row with confidence bounds. This assumes a uniform prior over success probability and computes the beta posterior, then calculates the mean and lower/upper confidence bounds (with a 68.2% confidence interval centered on the mean).
|
34 |
|
35 |
<blank>|Ours|Theirs
|
36 |
-|-|-
|