leaderboard-pr-bot commited on
Commit
0347bce
·
verified ·
1 Parent(s): 658350b

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -200,6 +200,20 @@ Model evaluation on OpenLLM LeaderBoard
200
 
201
 
202
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
203
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
204
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
205
 
 
200
 
201
 
202
 
203
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
204
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
205
+
206
+ | Metric |Value|
207
+ |---------------------------------|----:|
208
+ |Avg. |64.23|
209
+ |AI2 Reasoning Challenge (25-Shot)|60.15|
210
+ |HellaSwag (10-Shot) |81.84|
211
+ |MMLU (5-Shot) |61.32|
212
+ |TruthfulQA (0-shot) |44.89|
213
+ |Winogrande (5-shot) |79.95|
214
+ |GSM8k (5-shot) |57.24|
215
+
216
+
217
  # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
218
  Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_manishiitg__open-aditi-hi-v4)
219