HiroseKoichi
commited on
Commit
•
069cf1d
1
Parent(s):
2d08961
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,14 @@ tags:
|
|
9 |
---
|
10 |
|
11 |
# Llama-Salad-4x8B-V2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
|
13 |
|
14 |
# Details
|
|
|
9 |
---
|
10 |
|
11 |
# Llama-Salad-4x8B-V2
|
12 |
+
Changes in V2:
|
13 |
+
- Swapped Tess-2.0-Llama-3-8B for Llama-3-8B-Synthia-v3.5
|
14 |
+
- Swapped L3-8B-Stheno-v3.1 for Llama-3-Soliloquy-8B-v2
|
15 |
+
- Removed Llama3-OpenBioLLM-8B and added opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5
|
16 |
+
|
17 |
+
V2 has improvements in all areas from V1; it's not a massive improvement, but I can confidently say it's a direct upgrade. Llama-3-8B-Synthia-v3.5 is better than Tess-2.0-Llama-3-8B in every way; Llama-3-Soliloquy-8B-v2 is more intelligent than L3-8B-Stheno-v3.1 and has less bias towards NSFW content; and the inclusion of opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5 has greatly improved its storytelling and narration abilities.
|
18 |
+
|
19 |
+
I really like the model selection in this one, so I don't know how much more I can improve if I make another 4x8B merge. If I were to make a V3, swapping Meta-Llama-3-8B-Instruct would likely be the only change. I will try my hand at making an 8x8B merge in the future, but I still need to find some models to fill the gaps; making sure there's no routing conflicts between 8 different models at once will be the biggest challenge.
|
20 |
|
21 |
|
22 |
# Details
|