ssmits commited on
Commit
d7bfe10
·
verified ·
1 Parent(s): a0c2d0e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -194,3 +194,16 @@ pipeline = transformers.pipeline(
194
 
195
  outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
196
  print(outputs[0]["generated_text"])
 
 
 
 
 
 
 
 
 
 
 
 
 
 
194
 
195
  outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
196
  print(outputs[0]["generated_text"])
197
+
198
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
199
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ssmits__Qwen2.5-95B-Instruct)
200
+
201
+ | Metric |Value|
202
+ |-------------------|----:|
203
+ |Avg. |37.43|
204
+ |IFEval (0-Shot) |84.31|
205
+ |BBH (3-Shot) |58.53|
206
+ |MATH Lvl 5 (4-Shot)| 6.04|
207
+ |GPQA (0-shot) |15.21|
208
+ |MuSR (0-shot) |13.61|
209
+ |MMLU-PRO (5-shot) |46.85|