Just a q8_0 and q4_0 version. If you need versions with other quantization parameters, please let me know
- Downloads last month
- 179
Inference API (serverless) does not yet support model repos that contain custom code.
Just a q8_0 and q4_0 version. If you need versions with other quantization parameters, please let me know