Update README.md
Browse files
README.md
CHANGED
@@ -42,8 +42,6 @@ This model is a Mixure of Experts (MoE) made with [mergekit](https://github.com/
|
|
42 |
|agieval_sat_math | 0|acc |34.55|± | 3.21|
|
43 |
| | |acc_norm|32.27|± | 3.16|
|
44 |
|
45 |
-
Average: 45.29%
|
46 |
-
|
47 |
### GPT4All
|
48 |
| Task |Version| Metric |Value| |Stderr|
|
49 |
|-------------|------:|--------|----:|---|-----:|
|
@@ -60,16 +58,12 @@ Average: 45.29%
|
|
60 |
| | |acc_norm|83.95|± | 0.86|
|
61 |
|winogrande | 0|acc |78.69|± | 1.15|
|
62 |
|
63 |
-
Average: 75.95%
|
64 |
-
|
65 |
### TruthfulQA
|
66 |
| Task |Version|Metric|Value| |Stderr|
|
67 |
|-------------|------:|------|----:|---|-----:|
|
68 |
|truthfulqa_mc| 1|mc1 |44.55|± | 1.74|
|
69 |
| | |mc2 |60.86|± | 1.57|
|
70 |
|
71 |
-
Average: 60.86%
|
72 |
-
|
73 |
### Bigbench
|
74 |
| Task |Version| Metric |Value| |Stderr|
|
75 |
|------------------------------------------------|------:|---------------------|----:|---|-----:|
|
@@ -93,10 +87,6 @@ Average: 60.86%
|
|
93 |
|bigbench_tracking_shuffled_objects_seven_objects| 0|multiple_choice_grade|19.03|± | 0.94|
|
94 |
|bigbench_tracking_shuffled_objects_three_objects| 0|multiple_choice_grade|52.00|± | 2.89|
|
95 |
|
96 |
-
Average: 46.4%
|
97 |
-
|
98 |
-
Average score: 57.13%
|
99 |
-
|
100 |
## 🧩 Configuration
|
101 |
|
102 |
```yaml
|
|
|
42 |
|agieval_sat_math | 0|acc |34.55|± | 3.21|
|
43 |
| | |acc_norm|32.27|± | 3.16|
|
44 |
|
|
|
|
|
45 |
### GPT4All
|
46 |
| Task |Version| Metric |Value| |Stderr|
|
47 |
|-------------|------:|--------|----:|---|-----:|
|
|
|
58 |
| | |acc_norm|83.95|± | 0.86|
|
59 |
|winogrande | 0|acc |78.69|± | 1.15|
|
60 |
|
|
|
|
|
61 |
### TruthfulQA
|
62 |
| Task |Version|Metric|Value| |Stderr|
|
63 |
|-------------|------:|------|----:|---|-----:|
|
64 |
|truthfulqa_mc| 1|mc1 |44.55|± | 1.74|
|
65 |
| | |mc2 |60.86|± | 1.57|
|
66 |
|
|
|
|
|
67 |
### Bigbench
|
68 |
| Task |Version| Metric |Value| |Stderr|
|
69 |
|------------------------------------------------|------:|---------------------|----:|---|-----:|
|
|
|
87 |
|bigbench_tracking_shuffled_objects_seven_objects| 0|multiple_choice_grade|19.03|± | 0.94|
|
88 |
|bigbench_tracking_shuffled_objects_three_objects| 0|multiple_choice_grade|52.00|± | 2.89|
|
89 |
|
|
|
|
|
|
|
|
|
90 |
## 🧩 Configuration
|
91 |
|
92 |
```yaml
|