Weight output_partition_size = 576 is not divisible by weight quantization block_n = 128
#18 opened 1 day ago
by
yuwanpeng
Optimal `weight_block_size` for Intel AMX `amx_int8` `amx_tile`?
1
#17 opened 3 days ago
by
ubergarm
what about `ollama`?
#16 opened 3 days ago
by
ice6
是否有明确的sglang镜像版本推荐:)
1
#14 opened 6 days ago
by
wangkkk956