GPTQ Quantization

by CyberTimon - opened Nov 9, 2023

Discussion

CyberTimon

Nov 9, 2023

Hello there!

Thank you for this amazing model. Can you maybe provide a quantized model with GPTQ?
Thank you very much.

Kind regards,
Timon Käch

KnutJaegersberg

Owner Nov 9, 2023

Glad you like it. I tried gptq already on a model merged with the adapter, but it said something like llama was expecting an llm.header layer and then instantiated the layer itself. The same for awq.
The model that came out of that only generated rubbish, currently I don't know how to circumvent this issue.

KnutJaegersberg

Owner Nov 9, 2023

I asked @TheBloke to help out

KnutJaegersberg

Owner Nov 11, 2023

@CyberTimon you can try https://huggingface.co/KnutJaegersberg/Deacon-34B-200k-AWQ instead :)

KnutJaegersberg changed discussion status to closed Nov 11, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment