Update README.md
Browse files
README.md
CHANGED
@@ -21,20 +21,20 @@ Window context = 4k tokens
|
|
21 |
|
22 |
### Benchmarks
|
23 |
|
24 |
-
Chocolatine is the best-performing
|
25 |
|
26 |
![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
|
27 |
|
28 |
|
29 |
| Metric |Value|
|
30 |
|-------------------|----:|
|
31 |
-
|
32 |
-
|IFEval (0-Shot) |
|
33 |
-
|BBH (3-Shot) |
|
34 |
-
|MATH Lvl 5 (4-Shot)|14.
|
35 |
-
|GPQA (0-shot) |
|
36 |
-
|MuSR (0-shot) |15.
|
37 |
-
|
38 |
|
39 |
|
40 |
### MT-Bench-French
|
|
|
21 |
|
22 |
### Benchmarks
|
23 |
|
24 |
+
Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
|
25 |
|
26 |
![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
|
27 |
|
28 |
|
29 |
| Metric |Value|
|
30 |
|-------------------|----:|
|
31 |
+
|**Avg.** |**27.63**|
|
32 |
+
|IFEval (0-Shot) |56.23|
|
33 |
+
|BBH (3-Shot) |37.16|
|
34 |
+
|MATH Lvl 5 (4-Shot)|14.5|
|
35 |
+
|GPQA (0-shot) |9.62|
|
36 |
+
|MuSR (0-shot) |15.1|
|
37 |
+
|MMLU-PRO (5-shot) |33.21|
|
38 |
|
39 |
|
40 |
### MT-Bench-French
|