Can you add an fp8 or int4 quantization loader?

#3
by win10 - opened

Can you add an fp8 or int4 quantization loader?
Please, please let 3050 generate a picture in at least 30 seconds.

Alpha-VLLM org

We will released int8 or int4 model in the future. please stay tunned.

Sign up or log in to comment