Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ This repo contains model files for [llama2.c 110M tinystories](https://huggingfa
|
|
14 |
This model was pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|
15 |
|
16 |
## Inference
|
17 |
-
Install [NM-vLLM](https://github.com/neuralmagic/nm-vllm) for fast inference:
|
18 |
```bash
|
19 |
pip install nm-vllm[sparse]
|
20 |
```
|
|
|
14 |
This model was pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|
15 |
|
16 |
## Inference
|
17 |
+
Install [NM-vLLM](https://github.com/neuralmagic/nm-vllm) for fast inference and low memory-usage:
|
18 |
```bash
|
19 |
pip install nm-vllm[sparse]
|
20 |
```
|