TheBloke
/

Llama-2-70B-Chat-GGML

Text Generation

Model card Files Files and versions Community

TheBloke commited on Jul 23, 2023

Commit

ca3c949

•

1 Parent(s): 5f8a081

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 This model is still uploading. README will be here shortly.
 If you're too impatient to wait for that (of course you are), to run these files you need:
-1. llama.cpp as of this commit: https://github.com/ggerganov/llama.cpp/commit/e76d630df17e235e6b9ef416c45996765d2e36fb
 2. To add new command line parameter `-gqa 8`
 Example command:
@@ -9,4 +9,6 @@ Example command:
 /workspace/git/llama.cpp/main -m llama-2-70b-chat/ggml/llama-2-70b-chat.ggmlv3.q4_0.bin -gqa 8 -t 13 -p "[INST] <<SYS>>You are a helpful assistant<</SYS>>Write a story about llamas[/INST]"
 ```
-There is no CUDA support at this time, but it should hopefully be coming soon.

 This model is still uploading. README will be here shortly.
 If you're too impatient to wait for that (of course you are), to run these files you need:
+1. llama.cpp as of [this commit or later](https://github.com/ggerganov/llama.cpp/commit/e76d630df17e235e6b9ef416c45996765d2e36fb)
 2. To add new command line parameter `-gqa 8`
 Example command:
 /workspace/git/llama.cpp/main -m llama-2-70b-chat/ggml/llama-2-70b-chat.ggmlv3.q4_0.bin -gqa 8 -t 13 -p "[INST] <<SYS>>You are a helpful assistant<</SYS>>Write a story about llamas[/INST]"
 ```
+There is no CUDA support at this time, but it should hopefully be coming soon.
+There is no support in third-party UIs or Python libraries (llama-cpp-python, ctransformers) yet. That will come in due course.