Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ tags:
|
|
17 |
|
18 |
These files are GPTQ model files for [Meta's Llama 2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf/tree/main) but with new FP16 files, made with the last transformers version. (transformers-4.32.0.dev0)
|
19 |
|
20 |
-
|
21 |
|
22 |
## Quant parameters
|
23 |
|
|
|
17 |
|
18 |
These files are GPTQ model files for [Meta's Llama 2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf/tree/main) but with new FP16 files, made with the last transformers version. (transformers-4.32.0.dev0)
|
19 |
|
20 |
+
GQA Works with exllama, but not GPTQ for LLaMA/AutoGPTQ.
|
21 |
|
22 |
## Quant parameters
|
23 |
|