Text Generation
Transformers
llama
Inference Endpoints
File size: 170 Bytes
ae47724
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
## Guanaco 65B GPTQ

From timdettmers: https://huggingface.co/timdettmers/guanaco-65b

### Folders

**ggml:** q4_0 and q4_1

**gptq:** works with Triton and CUDA branches