3 bits or 4 bits GPTQ in this project?

#13

by wyklq - opened Jun 5, 2023

wyklq

Jun 5, 2023

quantize_config.json contains
"bits": 3,
But the model is in 4 bits in the name "gptq_model-4bit".

This caused automatic loading failed with the example code in the model card.
It looks like the bits shall be 4.

Owner Jun 5, 2023

Sorry! Yes this is the 4bit model, quantize_config.json was wrong. I have corrected it now.

There is a separate 3bit version as well, linked in the README.

TheBloke changed discussion status to closed Jun 5, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment