TheBloke
/

alpaca-lora-65B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Apr 23, 2023

Commit

95c64cd

•

1 Parent(s): edf850c

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: other
 inference: false
 ---
-# Alpaca LoRA GPTQ 4bit
 This is a [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) [changsung's alpaca-lora-65B](https://huggingface.co/chansung/alpaca-lora-65b)
@@ -15,6 +15,8 @@ I can't guarantee that the two 128g files will work in only 40GB of VRAM.
 I haven't specifically tested VRAM requirements yet but will aim to do so at some point. If you have any experiences to share, please do so in the comments.
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `alpaca-lora-65B-GPTQ-4bit-128g.no-act-order.safetensors` unless you are able to use the latest Triton branch of GPTQ-for-LLaMa.

 inference: false
 ---
+# Alpaca LoRA 65B GPTQ 4bit
 This is a [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa) [changsung's alpaca-lora-65B](https://huggingface.co/chansung/alpaca-lora-65b)
 I haven't specifically tested VRAM requirements yet but will aim to do so at some point. If you have any experiences to share, please do so in the comments.
+If you want to try CPU inference instead, you can try my GGML repo instead: [TheBloke/alpaca-lora-65B-GGML](https://huggingface.co/TheBloke/alpaca-lora-65B-GGML).
 ## GIBBERISH OUTPUT IN `text-generation-webui`?
 Please read the Provided Files section below. You should use `alpaca-lora-65B-GPTQ-4bit-128g.no-act-order.safetensors` unless you are able to use the latest Triton branch of GPTQ-for-LLaMa.