Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

World's first gptq 4bit quant of glm-4-9b-chat model.

Autogptq PR: https://github.com/AutoGPTQ/AutoGPTQ/pull/683

Please note ChatGLM has tendency to switch from English to Chinese in mid-reply or in direct reply to English prompt. This issue happens in both native and quantized model and needs further investigation.

Downloads last month
22
Safetensors
Model size
2.33B params
Tensor type
I32
·
FP16
·
Inference API
Unable to determine this model’s pipeline type. Check the docs .