Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
what kind of task does TruthfulQA eval? mc1? mc2? or both?
#95
by
paopao0226
- opened
Hello and thanks for your leaderboard, here is the question that which task does TruthfulQA score based on? just mc1? just mc2? or both mc1 and mc2? if both, how to mix the scores of two different task. Thanks!
Hi, I would like to expand on the question by
@paopao0226
. I would like to run some of the experiments on my hardware and am unsure about this and other details. Could you point us to the code that calls lm-evaluation-harness and produces the numbers in the leaderboard?
Thanks ๐
clefourrier
changed discussion status to
closed
@freejen hello! the info has added on the About section!
This comment has been hidden