leaderboard-pr-bot commited on
Commit
46025ea
1 Parent(s): e5e4e5b

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +21 -8
README.md CHANGED
@@ -1,20 +1,20 @@
1
  ---
 
 
2
  tags:
3
  - generated_from_trainer
4
  - code
5
  - coding
6
  - gemma
7
- model-index:
8
- - name: gemma-2b-coder
9
- results: []
10
  license_name: gemma-terms-of-use
11
  license_link: https://ai.google.dev/gemma/terms
12
- language:
13
- - code
14
  thumbnail: https://huggingface.co/mrm8488/gemma-2b-coder/resolve/main/logo.png
15
- datasets:
16
- - HuggingFaceH4/CodeAlpaca_20K
17
  pipeline_tag: text-generation
 
 
 
18
  ---
19
 
20
  <div style="text-align:center;width:250px;height:250px;">
@@ -142,4 +142,17 @@ print(generate(instruction))
142
 
143
  ### Citation
144
 
145
- WIP
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - code
4
  tags:
5
  - generated_from_trainer
6
  - code
7
  - coding
8
  - gemma
9
+ datasets:
10
+ - HuggingFaceH4/CodeAlpaca_20K
 
11
  license_name: gemma-terms-of-use
12
  license_link: https://ai.google.dev/gemma/terms
 
 
13
  thumbnail: https://huggingface.co/mrm8488/gemma-2b-coder/resolve/main/logo.png
 
 
14
  pipeline_tag: text-generation
15
+ model-index:
16
+ - name: gemma-2b-coder
17
+ results: []
18
  ---
19
 
20
  <div style="text-align:center;width:250px;height:250px;">
 
142
 
143
  ### Citation
144
 
145
+ WIP
146
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
147
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MAISAAI__gemma-2b-coder)
148
+
149
+ | Metric |Value|
150
+ |---------------------------------|----:|
151
+ |Avg. |45.65|
152
+ |AI2 Reasoning Challenge (25-Shot)|48.98|
153
+ |HellaSwag (10-Shot) |71.43|
154
+ |MMLU (5-Shot) |37.02|
155
+ |TruthfulQA (0-shot) |33.54|
156
+ |Winogrande (5-shot) |66.85|
157
+ |GSM8k (5-shot) |16.07|
158
+