mgoin commited on
Commit
ca4faba
1 Parent(s): 2ca503f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -24,4 +24,10 @@ base_model: mistralai/Mistral-7B-Instruct-v0.3
24
 
25
  This model is ready for optimized inference using the Marlin mixed-precision kernels in vLLM: https://github.com/vllm-project/vllm
26
 
27
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60466e4b4f40b01b66151416/3bX2Hqj4LaJxFhPHRucAn.png)
 
 
 
 
 
 
 
24
 
25
  This model is ready for optimized inference using the Marlin mixed-precision kernels in vLLM: https://github.com/vllm-project/vllm
26
 
27
+ Simply start this model as an inference server with:
28
+ ```bash
29
+ python -m vllm.entrypoints.openai.api_server --model neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit
30
+ ```
31
+
32
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60466e4b4f40b01b66151416/SC_tYXjoS3yIoOYtfqZ2E.png)
33
+