Update README.md
Browse files
README.md
CHANGED
@@ -40,12 +40,13 @@ The models are evaluated using open-ended and multiple-choice math problems from
|
|
40 |
|---------------------------|---------------|-----------|-----------|-----------|
|
41 |
| MAmmoTH-7B | **Hybrid** | 53.6 | 31.5 | 44.5 |
|
42 |
| MAmmoTH-Coder-7B | **Hybrid** | 59.4 | 33.4 | 47.2 |
|
43 |
-
| MetaMath-7B-Mistral | **CoT** |
|
44 |
| OpenChat-3.5-7B | **CoT** | 77.3 | 28.6 | 49.6 |
|
|
|
45 |
| DeepSeek-Coder-34B | **PoT** | 58.2 | 35.3 | 46.5 |
|
46 |
| Grok-1 | **CoT** | 62.9 | 15.7 | - |
|
47 |
| QWen-72B | **CoT** | 78.9 | 35.2 | - |
|
48 |
-
|
|
49 |
| MAmmoTH-7B-Mistral | **Hybrid** | 75.0 | **40.0** | **52.5** |
|
50 |
|
51 |
## Usage
|
|
|
40 |
|---------------------------|---------------|-----------|-----------|-----------|
|
41 |
| MAmmoTH-7B | **Hybrid** | 53.6 | 31.5 | 44.5 |
|
42 |
| MAmmoTH-Coder-7B | **Hybrid** | 59.4 | 33.4 | 47.2 |
|
43 |
+
| MetaMath-7B-Mistral | **CoT** | 77.7 | 28.2 | 49.3 |
|
44 |
| OpenChat-3.5-7B | **CoT** | 77.3 | 28.6 | 49.6 |
|
45 |
+
| ChatGLM-3-6B | **CoT** | 72.3 | 25.7 | 45.6 |
|
46 |
| DeepSeek-Coder-34B | **PoT** | 58.2 | 35.3 | 46.5 |
|
47 |
| Grok-1 | **CoT** | 62.9 | 15.7 | - |
|
48 |
| QWen-72B | **CoT** | 78.9 | 35.2 | - |
|
49 |
+
| DeepSeek-67B-Chat | **CoT** | **84.1** | 32.6 | - |
|
50 |
| MAmmoTH-7B-Mistral | **Hybrid** | 75.0 | **40.0** | **52.5** |
|
51 |
|
52 |
## Usage
|