chatglm2-6b-int4 / quantization.py
duzx16
Update quantized gemm kernel
5579a9f
File too large to display, you can check the raw version instead.