markoarnauto
commited on
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -57,3 +57,4 @@ curl http://localhost:8000/v1/completions -H "Content-Type: application/json
|
|
57 |
"prompt": "San Francisco is a"
|
58 |
} '
|
59 |
```
|
|
|
|
57 |
"prompt": "San Francisco is a"
|
58 |
} '
|
59 |
```
|
60 |
+
⚡ This model is optimized to handle heavy workloads providing a total throughput of ️**4623 tokens per second** using one NVIDIA L40S ⚡
|