seonglae
/

llama-2-13b-chat-hf-gptq

Text Generation

text-generation-inference

Model card Files Files and versions Community

llama-2-13b-chat-hf-gptq

1 contributor

History: 4 commits

seonglae's picture

Create README.md

ff3c993 about 1 year ago

.gitattributes

1.52 kB

initial commit about 1 year ago
README.md

931 Bytes

Create README.md about 1 year ago
config.json

625 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False about 1 year ago
generation_config.json

170 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False about 1 year ago
gptq_model-4bit-128g.safetensors

7.26 GB
LFS

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False about 1 year ago
quantize_config.json

225 Bytes

build: AutoGPTQ for meta-llama/Llama-2-13b-chat-hf: 4bits, gr128, desc_act=False about 1 year ago