open_dutch_llm_leaderboard

Running

Bram Vanroy commited on Dec 7, 2023

Commit

c66a031

•

1 Parent(s): b268b1d

get rid of whitespace

Files changed (1) hide show

content.py CHANGED Viewed

@@ -8,11 +8,12 @@ This is a fork of the [Open Multilingual LLM Evaluation Leaderboard](https://hug
 We test the models on the following benchmarks **for the Dutch version only!!**, which have been translated into Dutch automatically by the original authors of the Open Multilingual LLM Evaluation Leaderboard with `gpt-35-turbo`.
 I did not verify their translations and I do not maintain the datasets, I only run the benchmarks and add the results to this space. For questions regarding the test sets or running them yourself, see [the original Github repository](https://github.com/laiviet/lm-evaluation-harness).
-- <a href="https://arxiv.org/abs/1803.05457" target="_blank">  AI2 Reasoning Challenge </a> (25-shot)
-- <a href="https://arxiv.org/abs/1905.07830" target="_blank">  HellaSwag </a> (10-shot)
-- <a href="https://arxiv.org/abs/2009.03300" target="_blank">  MMLU </a>  (5-shot)
-- <a href="https://arxiv.org/abs/2109.07958" target="_blank">  TruthfulQA </a> (0-shot)
 """
 DISCLAIMER = """## Disclaimer

 We test the models on the following benchmarks **for the Dutch version only!!**, which have been translated into Dutch automatically by the original authors of the Open Multilingual LLM Evaluation Leaderboard with `gpt-35-turbo`.
 I did not verify their translations and I do not maintain the datasets, I only run the benchmarks and add the results to this space. For questions regarding the test sets or running them yourself, see [the original Github repository](https://github.com/laiviet/lm-evaluation-harness).
+<p align="center">
+  <a href="https://arxiv.org/abs/1803.05457" target="_blank">AI2 Reasoning Challenge </a> (25-shot) |
+  <a href="https://arxiv.org/abs/1905.07830" target="_blank">HellaSwag</a> (10-shot) |
+  <a href="https://arxiv.org/abs/2009.03300" target="_blank">MMLU</a>  (5-shot) |
+  <a href="https://arxiv.org/abs/2109.07958" target="_blank">TruthfulQA</a> (0-shot)
+</p>
 """
 DISCLAIMER = """## Disclaimer