shaowenchen
/

chinese-llama-2-7b-16k-gguf

Text Generation

Model card Files Files and versions Community

shaowenchen commited on Sep 12, 2023

Commit

50090a2

•

1 Parent(s): 3bbf3f3

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -39,6 +39,14 @@ tags:
 | chinese-llama-2-7b-16k.Q8_0.gguf   | Q8_0         | 6.9 GB |
 | chinese-llama-2-7b-16k.gguf        | full         | 13 GB  |
 ## Provided images
 | Name                                             | Quant method | Size    |

 | chinese-llama-2-7b-16k.Q8_0.gguf   | Q8_0         | 6.9 GB |
 | chinese-llama-2-7b-16k.gguf        | full         | 13 GB  |
+Usage:
+```
+docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
+```
+and you can view http://localhost:8000/docs to see the swagger UI.
 ## Provided images
 | Name                                             | Quant method | Size    |