QwQ-32B_exl2_8.0bpw / README.md
Dracones's picture
Upload folder using huggingface_hub
154942e verified
metadata
license: apache-2.0
license_link: https://huggingface.co/Qwen/QWQ-32B/blob/main/LICENSE
language:
  - en
pipeline_tag: text-generation
base_model: Qwen/QwQ-32B
tags:
  - chat
  - exl2
library_name: transformers

QwQ-32B - EXL2 8.0bpw

This is a 8.0bpw EXL2 quant of Qwen/QwQ-32B

Details about the model can be found at the above model page.

Perplexity Scoring

Below are the perplexity scores for the EXL2 models. A lower score is better.

Quant Level Perplexity Score
8.0 6.4393
7.0 6.4452
6.0 6.4693
5.0 6.4732
4.5 6.5417
4.0 6.6190