Text Generation
Transformers
PyTorch
gpt2
text-generation-inference
Inference Endpoints
4-bit precision
gptq
gpt-sw3-6.7b-v2-instruct-4bit-gptq / quantize_config.json

Commit History

Create quantize_config.json
6f72481

Ekgren commited on