SteelStorage
/

Lumosia-MoE-4x10.7

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Steelskull commited on Jan 8

Commit

07e18df

•

1 Parent(s): 897667b

Create README.md

Files changed (1) hide show

README.md +37 -0

README.md ADDED Viewed

	@@ -0,0 +1,37 @@

+---
+# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
+# Doc / guide: https://huggingface.co/docs/hub/model-cards
+{}
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+This model is an extreme experiment, I wanted to test making an MoE out of multiple High Performing Solar models. Let me know what you think
+## Model Details
+### Model Description
+This MoE model is an extreme experiment, I wanted to test making an MoE out of multiple High Performing Solar models. Let me know what you think.
+Thinking about finetuning on a RP dataset later on to direct the model more
+- **Model type:** [More Information Needed]
+### Model Sources [optional]
+model_name: Lumosia-MoE-4x10.7
+base_model: DopeorNope/SOLARC-M-10.7B
+gate_mode: hidden
+dtype: bfloat16
+experts:
+  - source_model: DopeorNope/SOLARC-M-10.7B
+    positive_prompts: [""]
+  - source_model: maywell/PiVoT-10.7B-Mistral-v0.2-RP
+    positive_prompts: [""]
+  - source_model: kyujinpy/Sakura-SOLAR-Instruct
+    positive_prompts: [""]
+  - source_model: jeonsworld/CarbonVillain-en-10.7B-v1
+    positive_prompts: [""]