JustinLin610
commited on
Commit
•
8dfcd82
1
Parent(s):
22d9ef9
Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ Qwen2-7B-Instruct-GPTQ-Int8 supports a context length of up to 131,072 tokens, e
|
|
19 |
|
20 |
For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2/), [GitHub](https://github.com/QwenLM/Qwen2), and [Documentation](https://qwen.readthedocs.io/en/latest/).
|
21 |
|
22 |
-
**Note**: If you encounter ``RuntimeError: probability tensor contains either `inf`, `nan` or element < 0`` during inference with ``
|
23 |
<br>
|
24 |
|
25 |
## Model Details
|
|
|
19 |
|
20 |
For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2/), [GitHub](https://github.com/QwenLM/Qwen2), and [Documentation](https://qwen.readthedocs.io/en/latest/).
|
21 |
|
22 |
+
**Note**: If you encounter ``RuntimeError: probability tensor contains either `inf`, `nan` or element < 0`` during inference with ``transformers``, we recommand [deploying this model with vLLM](https://qwen.readthedocs.io/en/latest/deployment/vllm.html).
|
23 |
<br>
|
24 |
|
25 |
## Model Details
|