benleader commited on
Commit
b98f261
1 Parent(s): 77bcf50

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -26,10 +26,10 @@ Among its main features are:
26
  - device: Nvidia A100 40G
27
  - batch size: 8
28
 
29
- **Since early chatGLM version dosen't suport batch inference, `original` in below table is measured on batch_size=1**
30
 
31
 
32
- **According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s**
33
 
34
  |version|speed|
35
  |:-:|:-:|
 
26
  - device: Nvidia A100 40G
27
  - batch size: 8
28
 
29
+ **Since early chatGLM version didn't suport batch inference, `original` in below table was measured on batch_size=1**
30
 
31
 
32
+ **According to [this discussion](https://huggingface.co/TMElyralab/lyraChatGLM/discussions/6), this bug has been fixed and the speed on batch_size=8 reachs up to 137 tokens/s. We will evaluate and update the latest performance.**
33
 
34
  |version|speed|
35
  |:-:|:-:|