jpacifico commited on
Commit
c13ba57
1 Parent(s): 671df79

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -21,20 +21,20 @@ Window context = 4k tokens
21
 
22
  ### Benchmarks
23
 
24
- Chocolatine is the best-performing < 50B model in terms of MMLU-PRO on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
25
 
26
  ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
27
 
28
 
29
  | Metric |Value|
30
  |-------------------|----:|
31
- |Avg. |29.83|
32
- |IFEval (0-Shot) |46.89|
33
- |BBH (3-Shot) |48.02|
34
- |MATH Lvl 5 (4-Shot)|14.88|
35
- |GPQA (0-shot) |12.19|
36
- |MuSR (0-shot) |15.15|
37
- |**MMLU-PRO (5-shot)** |**41.82**|
38
 
39
 
40
  ### MT-Bench-French
 
21
 
22
  ### Benchmarks
23
 
24
+ Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
25
 
26
  ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
27
 
28
 
29
  | Metric |Value|
30
  |-------------------|----:|
31
+ |**Avg.** |**27.63**|
32
+ |IFEval (0-Shot) |56.23|
33
+ |BBH (3-Shot) |37.16|
34
+ |MATH Lvl 5 (4-Shot)|14.5|
35
+ |GPQA (0-shot) |9.62|
36
+ |MuSR (0-shot) |15.1|
37
+ |MMLU-PRO (5-shot) |33.21|
38
 
39
 
40
  ### MT-Bench-French