Update README.md
Browse files
README.md
CHANGED
@@ -26,10 +26,10 @@ Among its main features are:
|
|
26 |
- device: Nvidia A100 40G
|
27 |
- batch size: 8
|
28 |
|
29 |
-
**Since early chatGLM version
|
30 |
|
31 |
|
32 |
-
**According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s
|
33 |
|
34 |
|version|speed|
|
35 |
|:-:|:-:|
|
|
|
26 |
- device: Nvidia A100 40G
|
27 |
- batch size: 8
|
28 |
|
29 |
+
**Since early chatGLM version didn't suport batch inference, `original` in below table was measured on batch_size=1**
|
30 |
|
31 |
|
32 |
+
**According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s. We will evaluate and update the latest performance.**
|
33 |
|
34 |
|version|speed|
|
35 |
|:-:|:-:|
|