TheBloke
/

wizardLM-7B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 29, 2023

Commit

0091d6b

·

1 Parent(s): 2180bec

Update README.md

Files changed (1) hide show

README.md +9 -7

README.md CHANGED Viewed

@@ -20,17 +20,19 @@ This repo contains 4bit GPTQ models for GPU inference, quantised using [GPTQ-for
 ## How to easily download and use this model in text-generation-webui
-Load text-generation-webui as you normally do.
 1. Click the **Model tab**.
-2. Under **Download custom model or LoRA**, enter this repo name: `TheBloke/wizardLM-7B-GPTQ`.
 3. Click **Download**.
 4. Wait until it says it's finished downloading.
-5. As this is a GPTQ model, fill in the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
-6. Now click the **Refresh** icon next to **Model** in the top left.
-7. In the **Model drop-down**: choose this model: `wizardLM-7B-GPTQ`.
-8. Click **Reload the Model** in the top right.
-9. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## GIBBERISH OUTPUT IN `text-generation-webui`?

 ## How to easily download and use this model in text-generation-webui
+Open the text-generation-webui UI as normal.
 1. Click the **Model tab**.
+2. Under **Download custom model or LoRA**, enter `TheBloke/wizardLM-7B-GPTQ`.
 3. Click **Download**.
 4. Wait until it says it's finished downloading.
+5. Click the **Refresh** icon next to **Model** in the top left.
+6. In the **Model drop-down**: choose the model you just downloaded,`wizardLM-7B-GPTQg`.
+7. If you see an error in the bottom right, ignore it - it's temporary.
+8. Fill out the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
+9. Click **Save settings for this model** in the top right.
+10. Click **Reload the Model** in the top right.
+11. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
 ## GIBBERISH OUTPUT IN `text-generation-webui`?