shaowenchen commited on
Commit
50090a2
1 Parent(s): 3bbf3f3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -39,6 +39,14 @@ tags:
39
  | chinese-llama-2-7b-16k.Q8_0.gguf | Q8_0 | 6.9 GB |
40
  | chinese-llama-2-7b-16k.gguf | full | 13 GB |
41
 
 
 
 
 
 
 
 
 
42
  ## Provided images
43
 
44
  | Name | Quant method | Size |
 
39
  | chinese-llama-2-7b-16k.Q8_0.gguf | Q8_0 | 6.9 GB |
40
  | chinese-llama-2-7b-16k.gguf | full | 13 GB |
41
 
42
+ Usage:
43
+
44
+ ```
45
+ docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
46
+ ```
47
+
48
+ and you can view http://localhost:8000/docs to see the swagger UI.
49
+
50
  ## Provided images
51
 
52
  | Name | Quant method | Size |