TheBloke
/

koala-7B-GPTQ

@@ -2,7 +2,7 @@
 license: other
 library_name: transformers
 pipeline_tag: text-generation
-datasets:
 - RyokoAI/ShareGPT52K
 - Hello-SimpleAI/HC3
 tags:
@@ -12,6 +12,17 @@ tags:
 - gptq
 inference: false
 ---
 # Koala: A Dialogue Model for Academic Research
 This repo contains the weights of the Koala 7B model produced at Berkeley. It is the result of combining the diffs from https://huggingface.co/young-geng/koala with the original Llama 7B model.
@@ -24,7 +35,7 @@ I have the following Koala model repositories available:
 * [Unquantized 13B model in HF format](https://huggingface.co/TheBloke/koala-13B-HF)
 * [GPTQ quantized 4bit 13B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-13B-GPTQ-4bit-128g)
 * [4-bit, 5-bit and 8-bit GGML models for `llama.cpp`](https://huggingface.co/TheBloke/koala-13B-GGML)
 **7B models:**
 * [Unquantized 7B model in HF format](https://huggingface.co/TheBloke/koala-7B-HF)
 * [Unquantized 7B model in GGML format for llama.cpp](https://huggingface.co/TheBloke/koala-7b-ggml-unquantized)
@@ -57,7 +68,7 @@ Details of the files provided:
   * The older GPTQ code does not support all the latest features, so the quality may be fractionally lower.
   * Command to create:
     * `python3 llama.py koala-7B-HF c4 --wbits 4 --true-sequential --groupsize 128 --save koala-7B-4bit-128g.no-act-order.ooba.pt`
 ## How to run in `text-generation-webui`
 File `koala-7B-4bit-128g.no-act-order.ooba.pt` can be loaded the same as any other GPTQ file, without requiring any updates to [oobaboogas text-generation-webui](https://github.com/oobabooga/text-generation-webui).
@@ -122,6 +133,17 @@ PYTHON_PATH="${PWD}:$PYTHONPATH" python \
 --tokenizer_path=/content/LLaMA-7B/tokenizer.model
 ```
 ## Further info
 Check out the following links to learn more about the Berkeley Koala model.

 license: other
 library_name: transformers
 pipeline_tag: text-generation
+datasets:
 - RyokoAI/ShareGPT52K
 - Hello-SimpleAI/HC3
 tags:
 - gptq
 inference: false
 ---
+<div style="width: 100%;">
+    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+    <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
+    </div>
+    <div style="display: flex; flex-direction: column; align-items: flex-end;">
+        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
+    </div>
+</div>
 # Koala: A Dialogue Model for Academic Research
 This repo contains the weights of the Koala 7B model produced at Berkeley. It is the result of combining the diffs from https://huggingface.co/young-geng/koala with the original Llama 7B model.
 * [Unquantized 13B model in HF format](https://huggingface.co/TheBloke/koala-13B-HF)
 * [GPTQ quantized 4bit 13B model in `pt` and `safetensors` formats](https://huggingface.co/TheBloke/koala-13B-GPTQ-4bit-128g)
 * [4-bit, 5-bit and 8-bit GGML models for `llama.cpp`](https://huggingface.co/TheBloke/koala-13B-GGML)
 **7B models:**
 * [Unquantized 7B model in HF format](https://huggingface.co/TheBloke/koala-7B-HF)
 * [Unquantized 7B model in GGML format for llama.cpp](https://huggingface.co/TheBloke/koala-7b-ggml-unquantized)
   * The older GPTQ code does not support all the latest features, so the quality may be fractionally lower.
   * Command to create:
     * `python3 llama.py koala-7B-HF c4 --wbits 4 --true-sequential --groupsize 128 --save koala-7B-4bit-128g.no-act-order.ooba.pt`
 ## How to run in `text-generation-webui`
 File `koala-7B-4bit-128g.no-act-order.ooba.pt` can be loaded the same as any other GPTQ file, without requiring any updates to [oobaboogas text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 --tokenizer_path=/content/LLaMA-7B/tokenizer.model
 ```
+## Want to support my work?
+I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
+So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions, and I'll gladly quantise any model you'd like to try.
+* Patreon: coming soon! (just awaiting approval)
+* Ko-Fi: https://ko-fi.com/TheBlokeAI
+* Discord: https://discord.gg/UBgz4VXf
 ## Further info
 Check out the following links to learn more about the Berkeley Koala model.