Qwen
/

Qwen2-VL-72B-Instruct-GPTQ-Int4

Image-Text-to-Text

4-bit precision

Model card Files Files and versions Community

Qwen2-VL-72B-Instruct-GPTQ-Int4 / model-00008-of-00011.safetensors

Commit History

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm

288c348

可亲 commited on Sep 24, 2024

Upload folder using huggingface_hub

1a46b59
verified

clonefy commited on Sep 17, 2024