imone commited on
Commit
65238d2
1 Parent(s): cfeba57

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -82,7 +82,7 @@ If you want to deploy the server as an online service, you can use `--api-keys s
82
 
83
  | Model | Size | Context | Weights | Serving |
84
  |-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
85
- | OpenChat 3.5 0106 | 7B | 8192 | [Huggingface](https://huggingface.co/openchat/openchat-3.5-0106) | `python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-0106 --engine-use-ray --worker-use-ray` |
86
 
87
  <details>
88
  <summary>Example request (click to expand)</summary>
@@ -217,12 +217,13 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
217
 
218
  🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
219
 
220
- | | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
221
- |-------------------|-------------|---------|----------|------|-----------|----------|----------|
222
- | OpenChat 3.5 1210 | Apache-2.0 | **7B** | **60.1** | 65.3 | **68.9** | **28.9** | **77.3** |
223
- | OpenChat 3.5 | Apache-2.0 | **7B** | 56.4 | 64.3 | 55.5 | 28.6 | **77.3** |
224
- | Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
225
- | Grok-1 | Proprietary | ???B | 55.8 | 73 | 63.2 | 23.9 | 62.9 |
 
226
 
227
  *: Grok results are reported by [X.AI](https://x.ai/).
228
 
 
82
 
83
  | Model | Size | Context | Weights | Serving |
84
  |-------------------|------|---------|------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
85
+ | OpenChat-3.5-0106 | 7B | 8192 | [Huggingface](https://huggingface.co/openchat/openchat-3.5-0106) | `python -m ochat.serving.openai_api_server --model openchat/openchat-3.5-0106 --engine-use-ray --worker-use-ray` |
86
 
87
  <details>
88
  <summary>Example request (click to expand)</summary>
 
217
 
218
  🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
219
 
220
+ | | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
221
+ |-----------------------|-------------|---------|----------|--------|-----------|----------|----------|
222
+ | **OpenChat-3.5-0106** | Apache-2.0 | **7B** | **70.0** | 65.8 | **71.3** | **29.3** | **77.4** |
223
+ | OpenChat-3.5-1210 | Apache-2.0 | **7B** | 60.1 | 65.3 | 68.9 | 28.9 | 77.3 |
224
+ | OpenChat-3.5 | Apache-2.0 | **7B** | 56.4 | 64.3 | 55.5 | 28.6 | 77.3 |
225
+ | Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
226
+ | Grok-1 | Proprietary | ???B | 55.8 | **73** | 63.2 | 23.9 | 62.9 |
227
 
228
  *: Grok results are reported by [X.AI](https://x.ai/).
229