TheBloke
/

Airoboros-L2-70b-2.2-GGUF

Transformers

GGUF

llama

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Sep 15, 2023

Commit

6a15066

•

1 Parent(s): bc17c31

Upload README.md

Browse files

Files changed (1) hide show

README.md +59 -2

README.md CHANGED Viewed

@@ -36,8 +36,6 @@ quantized_by: TheBloke
 This repo contains GGUF format model files for [Jon Durbin's Airoboros L2 70B 2.2](https://huggingface.co/jondurbin/airoboros-l2-70b-2.2).
-Note: these GGUF models were re-created on 15th September, as Jon has re-uploaded the original source weights. The first source upload was based on a new method for merging qLoRA weights. This has proved to cause problems, and therefore Jon has re-uploaded the weights in the usual way, and I have re-done all my GGUF and GPTQ models.
 <!-- description end -->
 <!-- README_GGUF.md-about-gguf start -->
 ### About GGUF
@@ -72,10 +70,12 @@ Here is an incomplate list of clients and libraries that are known to support GG
 A chat.
 USER: {prompt}
 ASSISTANT:
 ```
 <!-- prompt-template end -->
 <!-- compatibility_gguf start -->
 ## Compatibility
@@ -154,6 +154,63 @@ del airoboros-l2-70b-2.2.Q8_0.gguf-split-a airoboros-l2-70b-2.2.Q8_0.gguf-split-
 </details>
 <!-- README_GGUF.md-provided-files end -->
 <!-- README_GGUF.md-how-to-run start -->
 ## Example `llama.cpp` command

 This repo contains GGUF format model files for [Jon Durbin's Airoboros L2 70B 2.2](https://huggingface.co/jondurbin/airoboros-l2-70b-2.2).
 <!-- description end -->
 <!-- README_GGUF.md-about-gguf start -->
 ### About GGUF
 A chat.
 USER: {prompt}
 ASSISTANT:
 ```
 <!-- prompt-template end -->
 <!-- compatibility_gguf start -->
 ## Compatibility
 </details>
 <!-- README_GGUF.md-provided-files end -->
+<!-- README_GGUF.md-how-to-download start -->
+## How to download GGUF files
+**Note for manual downloaders:** You almost never want to clone the entire repo! Multiple different quantisation formats are provided, and most users only want to pick and download a single file.
+The following clients/libraries will automatically download models for you, providing a list of available models to choose from:
+- LM Studio
+- LoLLMS Web UI
+- Faraday.dev
+### In `text-generation-webui`
+Under Download Model, you can enter the model repo: TheBloke/Airoboros-L2-70b-2.2-GGUF and below it, a specific filename to download, such as: airoboros-l2-70b-2.2.q4_K_M.gguf.
+Then click Download.
+### On the command line, including multiple files at once
+I recommend using the `huggingface-hub` Python library:
+```shell
+pip3 install huggingface-hub>=0.17.1
+```
+Then you can download any individual model file to the current directory, at high speed, with a command like this:
+```shell
+huggingface-cli download TheBloke/Airoboros-L2-70b-2.2-GGUF airoboros-l2-70b-2.2.q4_K_M.gguf --local-dir . --local-dir-use-symlinks False
+```
+<details>
+  <summary>More advanced huggingface-cli download usage</summary>
+You can also download multiple files at once with a pattern:
+```shell
+huggingface-cli download TheBloke/Airoboros-L2-70b-2.2-GGUF --local-dir . --local-dir-use-symlinks False --include='*Q4_K*gguf'
+```
+For more documentation on downloading with `huggingface-cli`, please see: [HF -> Hub Python Library -> Download files -> Download from the CLI](https://huggingface.co/docs/huggingface_hub/guides/download#download-from-the-cli).
+To accelerate downloads on fast connections (1Gbit/s or higher), install `hf_transfer`:
+```shell
+pip3 install hf_transfer
+```
+And set environment variable `HF_HUB_ENABLE_HF_TRANSFER` to `1`:
+```shell
+HUGGINGFACE_HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download TheBloke/Airoboros-L2-70b-2.2-GGUF airoboros-l2-70b-2.2.q4_K_M.gguf --local-dir . --local-dir-use-symlinks False
+```
+Windows CLI users: Use `set HUGGINGFACE_HUB_ENABLE_HF_TRANSFER=1` before running the download command.
+</details>
+<!-- README_GGUF.md-how-to-download end -->
 <!-- README_GGUF.md-how-to-run start -->
 ## Example `llama.cpp` command