Qwen2-VL-72B-Instruct-GPTQ-Int4 / model-00008-of-00011.safetensors

Commit History

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm
288c348

可亲 commited on

Upload folder using huggingface_hub
1a46b59
verified

clonefy commited on