jpacifico
/

Chocolatine-3B-Instruct-DPO-Revised

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Aug 10, 2024

Commit

c13ba57

•

1 Parent(s): 671df79

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -21,20 +21,20 @@ Window context = 4k tokens
 ### Benchmarks
-Chocolatine is the best-performing < 50B model in terms of MMLU-PRO on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
 ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
 |      Metric       |Value|
 |-------------------|----:|
-|Avg.               |29.83|
-|IFEval (0-Shot)    |46.89|
-|BBH (3-Shot)       |48.02|
-|MATH Lvl 5 (4-Shot)|14.88|
-|GPQA (0-shot)      |12.19|
-|MuSR (0-shot)      |15.15|
-|**MMLU-PRO (5-shot)**  |**41.82**|
 ### MT-Bench-French

 ### Benchmarks
+Chocolatine is the best-performing 3B model on the [OpenLLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) (august 2024)
 ![image/png](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Assets/benchmark_14B_V1.png?raw=false)
 |      Metric       |Value|
 |-------------------|----:|
+|**Avg.**               |**27.63**|
+|IFEval (0-Shot)    |56.23|
+|BBH (3-Shot)       |37.16|
+|MATH Lvl 5 (4-Shot)|14.5|
+|GPQA (0-shot)      |9.62|
+|MuSR (0-shot)      |15.1|
+|MMLU-PRO (5-shot)  |33.21|
 ### MT-Bench-French