Update README.md
Browse files
README.md
CHANGED
@@ -41,4 +41,4 @@ print(output_text)
|
|
41 |
Zero-shot performance. Evaluated using select datasets from the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness/tree/main) with additions:
|
42 |
| PIQA | HellaSwag | OpenbookQA | ARC-e | ARC-c | MMLU | CMMLU | WinoGrande |
|
43 |
| ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- |
|
44 |
-
| 0.7383
|
|
|
41 |
Zero-shot performance. Evaluated using select datasets from the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness/tree/main) with additions:
|
42 |
| PIQA | HellaSwag | OpenbookQA | ARC-e | ARC-c | MMLU | CMMLU | WinoGrande |
|
43 |
| ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- |
|
44 |
+
| 0.7383±0.0103 | 0.5323±0.0050 | 0.3080±0.0207 | 0.7260±0.0092 | 0.4684±0.0146 | 0.6567±0.0038 | 0.5515±0.0045 | 0.6646±0.0133 |
|