sahilsuneja
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -106,7 +106,7 @@ _Note: first prompt may be slower as there is a slight warmup time_
|
|
106 |
#### start the server
|
107 |
|
108 |
```bash
|
109 |
-
model=ibm-fms/
|
110 |
volume=$PWD/data # share a volume with the Docker container to avoid downloading weights every run
|
111 |
|
112 |
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:latest --model-id $model
|
|
|
106 |
#### start the server
|
107 |
|
108 |
```bash
|
109 |
+
model=ibm-fms/llama2-70b-accelerator
|
110 |
volume=$PWD/data # share a volume with the Docker container to avoid downloading weights every run
|
111 |
|
112 |
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:latest --model-id $model
|