shaowenchen
commited on
Commit
•
50090a2
1
Parent(s):
3bbf3f3
Update README.md
Browse files
README.md
CHANGED
@@ -39,6 +39,14 @@ tags:
|
|
39 |
| chinese-llama-2-7b-16k.Q8_0.gguf | Q8_0 | 6.9 GB |
|
40 |
| chinese-llama-2-7b-16k.gguf | full | 13 GB |
|
41 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
42 |
## Provided images
|
43 |
|
44 |
| Name | Quant method | Size |
|
|
|
39 |
| chinese-llama-2-7b-16k.Q8_0.gguf | Q8_0 | 6.9 GB |
|
40 |
| chinese-llama-2-7b-16k.gguf | full | 13 GB |
|
41 |
|
42 |
+
Usage:
|
43 |
+
|
44 |
+
```
|
45 |
+
docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
|
46 |
+
```
|
47 |
+
|
48 |
+
and you can view http://localhost:8000/docs to see the swagger UI.
|
49 |
+
|
50 |
## Provided images
|
51 |
|
52 |
| Name | Quant method | Size |
|