TMElyralab
/

lyraChatGLM

Model card Files Files and versions Community

benleader commited on May 17, 2023

Commit

b98f261

·

1 Parent(s): 77bcf50

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -26,10 +26,10 @@ Among its main features are:
 - device: Nvidia A100 40G
 - batch size: 8
-**Since early chatGLM version dosen't suport batch inference, `original` in below table is measured on batch_size=1**
-**According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s**
 |version|speed|
 |:-:|:-:|

 - device: Nvidia A100 40G
 - batch size: 8
+**Since early chatGLM version didn't suport batch inference, `original` in below table was measured on batch_size=1**
+**According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s. We will evaluate and update the latest performance.**
 |version|speed|
 |:-:|:-:|