TheBloke
/

wizardLM-7B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 27, 2023

Commit

8a39b82

·

1 Parent(s): 259ae1c

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -18,6 +18,14 @@ This repo contains 4bit GPTQ models for GPU inference, quantised using [GPTQ-for
 * [4bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizardLM-7B-GGML)
 * [Unquantised model in HF format](https://huggingface.co/TheBloke/wizardLM-7B-HF)
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `wizardLM-7B-GPTQ-4bit-128g.no-act-order.safetensors` unless you are able to use the latest GPTQ-for-LLaMa code.

 * [4bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizardLM-7B-GGML)
 * [Unquantised model in HF format](https://huggingface.co/TheBloke/wizardLM-7B-HF)
+## PERFORMANCE ISSUES
+I am currently working on re-creating these GPTQs due to performance issues reported by many people.
+If you've not yet downloaded the models you might want to wait an hour to see if the new files I'm making now will fix this problem.
+This message will disappear once the problem is resolved.
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `wizardLM-7B-GPTQ-4bit-128g.no-act-order.safetensors` unless you are able to use the latest GPTQ-for-LLaMa code.