Update README.md
Browse files
README.md
CHANGED
@@ -84,19 +84,16 @@ You are to roleplay as Edward Elric from fullmetal alchemist. You are in the wor
|
|
84 |
|
85 |
## Benchmark Results
|
86 |
|
87 |
-
Hermes 2 on Mistral-7B outperforms all Nous & Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
|
88 |
|
89 |
-
### GPT4All:
|
90 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/VGTeKBp4v9ptXjeNZUClz.png)
|
91 |
|
92 |
-
|
93 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Suf6uQC-PgaUYFuxfgFvY.png)
|
94 |
-
|
95 |
-
### BigBench:
|
96 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/UdYJA5dGuWQ5OMXD7fMU1.png)
|
97 |
|
98 |
### Averages Compared:
|
99 |
-
|
|
|
|
|
100 |
|
101 |
GPT-4All Benchmark Set
|
102 |
```
|
|
|
84 |
|
85 |
## Benchmark Results
|
86 |
|
87 |
+
Hermes 2.5 on Mistral-7B outperforms all Nous-Hermes & Open-Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
|
88 |
|
89 |
+
### GPT4All, Bigbench, TruthfulQA, and AGIEval Model Comparisons:
|
|
|
90 |
|
91 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Kxq4BFEc-d1kSSiCIExua.png)
|
|
|
|
|
|
|
|
|
92 |
|
93 |
### Averages Compared:
|
94 |
+
|
95 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Q9uexgcbTLcywlYBvORTs.png)
|
96 |
+
|
97 |
|
98 |
GPT-4All Benchmark Set
|
99 |
```
|