cyberosa
commited on
Commit
Β·
2a62f64
1
Parent(s):
3c08161
Adding timeline info of the autocast dataset
Browse files- images/autocast_dataset_timeline.png +0 -0
- tabs/faq.py +3 -0
images/autocast_dataset_timeline.png
ADDED
tabs/faq.py
CHANGED
@@ -4,6 +4,9 @@ How good are LLMs at making predictions about events in the future? This is a to
|
|
4 |
This is a leaderboard showing the performance of LLM tools for making predictions (event forecasting) on a dataset, refined from Autocast.\
|
5 |
The leaderboard shows tool performance in terms of accuracy and cost. \
|
6 |
|
|
|
|
|
|
|
7 |
π€ Pick a tool and run it on the benchmark using the "π₯ Run the Benchmark" page!
|
8 |
"""
|
9 |
|
|
|
4 |
This is a leaderboard showing the performance of LLM tools for making predictions (event forecasting) on a dataset, refined from Autocast.\
|
5 |
The leaderboard shows tool performance in terms of accuracy and cost. \
|
6 |
|
7 |
+
π π§ The autocast dataset resolved-questions are from a timeline ending in 2022. Thus the current reported accuracy measure might be an in-sample forecasting one. We are working
|
8 |
+
to incorporate soon an out-of-sample one using another dataset with unseen data.\
|
9 |
+
|
10 |
π€ Pick a tool and run it on the benchmark using the "π₯ Run the Benchmark" page!
|
11 |
"""
|
12 |
|