Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,16 @@ pipeline_tag: text-generation
|
|
12 |
|
13 |
### Open LLM Leaderboard
|
14 |
|
15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
17 |
## 💻 Usage
|
18 |
|
|
|
12 |
|
13 |
### Open LLM Leaderboard
|
14 |
|
15 |
+
| Model |Average|ARC|HellaSwag|MMLU|TruthfulQA|Winogrande|GSM8K
|
16 |
+
|------------------------------------------------------------|------:|------:|---------:|-------:|------:|------:|------:|
|
17 |
+
|[**Cyrax-7B**](https://huggingface.co/touqir/Cyrax-7B)| **75.98**| **72.95**| 88.19| 64.6| **77.01**| 83.9| **69.22** |
|
18 |
+
|[Qwen-72B](https://huggingface.co/Qwen/Qwen-72B)| 73.6| 65.19| 85.94| **77.37**| 60.19| 82.48| 70.43|
|
19 |
+
|[Mixtral-8x7B-Instruct-v0.1-DPO](https://huggingface.co/cloudyu/Mixtral-8x7B-Instruct-v0.1-DPO)| 73.44| 69.8| 87.83| 71.05| 69.18| 81.37| 61.41|
|
20 |
+
|[Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)| 70.14| 72.7| 87.55| 71.4| 64.98| 81.06| 61.11 |
|
21 |
+
|[llama2_70b_mmlu](https://huggingface.co/itsliupeng/llama2_70b_mmlu)| 65.61| 68.24| 87.37| 71.89| 49.15| 82.4| 52.99 |
|
22 |
+
|[falcon-180B](https://huggingface.co/tiiuae/falcon-180B)| 67.85| 69.45| **88.86**| 70.5| 45.47| **86.9**| 45.94|
|
23 |
+
|
24 |
+
See the complete evaluation [here](https://gist.github.com/mlabonne/cd03d60f7428450a87ca270b5c467324).
|
25 |
|
26 |
## 💻 Usage
|
27 |
|