vllm (installed from main branch) doesn't like this model

by Gershwin69 - opened Sep 20, 2023

Sep 20, 2023

ValueError: torch.bfloat16 is not supported for quantization method awq. Supported dtypes: [torch.float16]

Owner Sep 20, 2023

Ah OK. This likely needs to be reported to the vLLM team.

Can you try editing the config.json locally to change the dtype to float16 and see if it loads OK then.

Sep 20, 2023

That worked and I've mentioned it on their discord server.

Oct 5, 2023

prompt token ids: None.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment