File size: 4,102 Bytes
315ecb4
 
 
b55e6fc
 
1c48c1e
 
da3ef5f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: apache-2.0
---
# GreenBit LLMs

This is GreenBitAI's pretrained **low-bit** LLMs with extreme compression yet still strong performance.

Please refer to our [Github page](https://github.com/GreenBitAI/green-bit-llm) for the code to run the model and more information.

### Zero-shot Evaluation

We evaluate the zero-shot ability of low-bit quantized Qwen1.5 models using the `llm_eval` library and list the results below: 

| **Repository (Qwen Family)**      | **Avg Acc.** |  **OpenBQ**  |  **ARC-E**  |  **Winogr.**  |  **HellaS.**  |  **ARC-C**  |  **PIQA**  |  **BoolQ**  |  **RACE**   |  **ANLI-R1**  |  **ANLI-R2**  |  **ANLI-R3**  |  **WiC**  |
|:----------------------------------|:------------:|:------------:|:-----------:|:-------------:|:-------------:|:-----------:|:----------:|:-----------:|:-----------:|:-------------:|:-------------:|:-------------:|:---------:|
| `Qwen-1.5-0.5B-layer-mix-bpw-2.2` |    0.398     |    0.170     |    0.443    |     0.527     |     0.332     |    0.238    |   0.634    |    0.620    |    0.318    |     0.332     |     0.338     |     0.330     |   0.500   | 
| `Qwen-1.5-0.5B-layer-mix-bpw-2.5` |    0.394     |    0.170     |    0.514    |     0.541     |     0.337     |    0.232    |   0.637    |    0.496    |    0.318    |     0.316     |     0.358     |     0.326     |   0.490   |
| `Qwen-1.5-0.5B-layer-mix-bpw-3.0` |    0.407     |    0.198     |    0.533    |     0.536     |     0.348     |    0.234    |   0.671    |    0.552    |    0.323    |     0.330     |     0.333     |     0.335     |   0.495   |
| `Qwen-1.5-1.8B-layer-mix-bpw-2.2` |    0.415     |    0.218     |    0.539    |     0.586     |     0.392     |    0.260    |   0.678    |    0.622    |    0.333    |     0.333     |     0.333     |     0.336     |   0.464   |
| `Qwen-1.5-1.8B-layer-mix-bpw-2.5` |    0.423     |    0.222     |    0.592    |     0.585     |     0.406     |    0.267    |   0.695    |    0.629    |    0.336    |     0.314     |     0.339     |     0.361     |   0.507   |
| `Qwen-1.5-1.8B-layer-mix-bpw-3.0` |    0.438     |    0.246     |    0.576    |     0.563     |     0.413     |    0.277    |   0.694    |    0.645    |    0.352    |     0.323     |     0.336     |     0.343     |   0.492   |
| `Qwen-1.5-4B-layer-mix-bpw-2.2`   |    0.480     |    0.254     |    0.663    |     0.623     |     0.463     |    0.339    |   0.712    |    0.718    |    0.349    |     0.326     |     0.355     |     0.384     |   0.513   |
| `Qwen-1.5-4B-layer-mix-bpw-2.5`   |    0.490     |    0.266     |    0.677    |     0.629     |     0.473     |    0.365    |   0.732    |    0.717    |    0.351    |     0.372     |     0.352     |     0.360     |   0.502   |
| `Qwen-1.5-4B-layer-mix-bpw-3.0`   |    0.502     |    0.268     |    0.678    |     0.642     |     0.494     |    0.358    |   0.755    |    0.757    |    0.380    |     0.395     |     0.395     |     0.392     |   0.519   |
| `Qwen-1.5-7B-layer-mix-bpw-2.2`   |    0.513     |    0.278     |    0.669    |     0.654     |     0.504     |    0.389    |   0.741    |    0.759    |    0.376    |     0.383     |     0.410     |     0.403     |   0.517   |
| `Qwen-1.5-7B-layer-mix-bpw-2.5`   |    0.520     |    0.294     |    0.705    |     0.650     |     0.520     |    0.387    |   0.750    |    0.769    |    0.371    |     0.445     |     0.424     |     0.398     |   0.564   |
| `Qwen-1.5-7B-layer-mix-bpw-3.0`   |    0.531     |    0.292     |    0.713    |     0.654     |     0.545     |    0.405    |   0.764    |    0.807    |    0.383    |     0.424     |     0.393     |     0.414     |   0.627   |
| `Qwen-1.5-14B-layer-mix-bpw-2.5`  |    0.553     |    0.318     |    0.727    |     0.682     |     0.564     |    0.413    |   0.775    |    0.792    |    0.390    |     0.472     |     0.434     |     0.446     |   0.623   |
| `Qwen-1.5-32B-layer-mix-bpw-3.0`  |    0.599     |    0.346     |    0.775    |     0.722     |     0.620     |    0.492    |   0.807    |    0.853    |    0.444    |     0.515     |     0.494     |     0.478     |   0.642   |