macadeliccc
/

laser-dolphin-mixtral-2x7b-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

macadeliccc commited on Jan 9

Commit

543c7da

•

1 Parent(s): 74f4a14

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -10,6 +10,10 @@ Credit to Fernando Fernandes and Eric Hartford for their project [laserRMT](http
 This model is a medium-sized MoE implementation based on [cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser](https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser)
 ## Prompt Format
 This model follows the same prompt format as the aforementioned model.

 This model is a medium-sized MoE implementation based on [cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser](https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser)
+A 2x7b configuration offers better performance than a standard 7b model even if loaded in 4 bit.
+If this 2x7b model is loaded in 4 bit the hellaswag score is .8260 which is higher than the base model achieves on its own in full precision.
 ## Prompt Format
 This model follows the same prompt format as the aforementioned model.