neuralmagic
/

granite-3.1-8b-instruct-quantized.w8a8

Text Generation

Inference Endpoints

8-bit precision

compressed-tensors

Model card Files Files and versions Community

nm-research commited on 15 days ago

Commit

b50cd3a

·

verified ·

1 Parent(s): 133efb6

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -206,6 +206,19 @@ evalplus.evaluate \
 | **Average Score**                       | **70.30**                        | **70.26**                                   |
 | **Recovery**                            | **100.00**                       | **99.95**                                   |
 #### HumanEval pass@1 scores
 | Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w8a8 |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|

 | **Average Score**                       | **70.30**                        | **70.26**                                   |
 | **Recovery**                            | **100.00**                       | **99.95**                                   |
+#### OpenLLM Leaderboard V2 evaluation scores
+| Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w8a8 |
+|-----------------------------------------|:---------------------------------:|:-------------------------------------------:|
+| IFEval (Inst Level Strict Acc, 0-shot)| 74.01                           | 73.50                                         |
+| BBH (Acc-Norm, 3-shot)            | 53.19                             | 52.59                                        |
+| Math-Hard (Exact-Match, 4-shot)   | 14.77                            | 15.73                                        |
+| GPQA (Acc-Norm, 0-shot)           | 31.76                             | 30.62                                        |
+| MUSR (Acc-Norm, 0-shot)           | 46.01                             | 44.30                                        |
+| MMLU-Pro (Acc, 5-shot)            | 35.81                             | 35.41                                        |
+| **Average Score**                 | **42.61**                         | **42.03**                                    |
+| **Recovery**                      | **100.00**                         | **98.64**                                    |
 #### HumanEval pass@1 scores
 | Metric                                  | ibm-granite/granite-3.1-8b-instruct             | neuralmagic-ent/granite-3.1-8b-instruct-quantized.w8a8 |
 |-----------------------------------------|:---------------------------------:|:-------------------------------------------:|