Lumosia-MoE-4x10.7 / README.md

Steelskull

Update README.md

a737a29 verified 8 months ago

preview code

raw

history blame

No virus

4.38 kB

	---
	license: apache-2.0
	tags:
	- moe
	- merge
	- mergekit
	- lazymergekit
	- DopeorNope/SOLARC-M-10.7B
	- maywell/PiVoT-10.7B-Mistral-v0.2-RP
	- kyujinpy/Sakura-SOLAR-Instruct
	- jeonsworld/CarbonVillain-en-10.7B-v1
	---

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/64545af5ec40bbbd01242ca6/Qb88YeudOf7MYuGKTirXC.png)

	# Lumosia-MoE-4x10.7

	"Lumosia" was selected as its a MoE of Multiple SOLAR Merges so it really "Lights the way".... its 3am.

	This is a very experimantal model. its a MoE of all good performing Solar models (based off of personal experiance not open leaderboard),

	Why? Dunno whated to see what would happen

	context is maybe 16k?

	Chat-instruct breaks the model at the moment, not really sure why, even tho it will follow instructions.

	Quants by @thebloke (thank you)

	https://huggingface.co/TheBloke/Lumosia-MoE-4x10.7-GGUF

	https://huggingface.co/TheBloke/Lumosia-MoE-4x10.7-GPTQ

	Update: (In Progress)
	Lumosia v1.5 will have the the same MoE models but with a fully trained head and gates. Sould increase performance alot.


	Template: ChatML
	```
	### System:

	### USER:{prompt}

	### Assistant:
	```


	Settings:
	```
	Temp: 1.0
	min-p: 0.02-0.1
	```

	Lumosia-MoE-4x10.7 is a Mixure of Experts (MoE) made with the following models:
	* [DopeorNope/SOLARC-M-10.7B](https://huggingface.co/DopeorNope/SOLARC-M-10.7B)
	* [maywell/PiVoT-10.7B-Mistral-v0.2-RP](https://huggingface.co/maywell/PiVoT-10.7B-Mistral-v0.2-RP)
	* [kyujinpy/Sakura-SOLAR-Instruct](https://huggingface.co/kyujinpy/Sakura-SOLAR-Instruct)
	* [jeonsworld/CarbonVillain-en-10.7B-v1](https://huggingface.co/jeonsworld/CarbonVillain-en-10.7B-v1)

	## Evals:

	* Avg: 69.61
	* ARC: 68.34
	* HellaSwag: 87.13
	* MMLU: 64.38
	* T-QA: 63.81
	* Winogrande: 82.95
	* GSM8K: 51.02

	## Examples:
	```
	Example 1:

	User:
	Hello lumi, what is the meaning of life
	in around 100 words.

	Lumosia:
	The meaning of life can vary greatly
	depending on individual beliefs,
	values, and experiences. One common
	perspective suggests that the primary
	purpose of life is to seek happiness,
	fulfillment, and personal growth
	through relationships
	accomplishments, and spiritual
	awareness. Living with integrity.
	compassion, and gratitude while
	making a positive impact on others
	and the world around us can also
	contribute to finding meaning in life
	UItimately, the meaning of life is
	subjective and unique to each
	individual, who must define it for
	themselves through exploration
	reflection, and personal choice.

	```
	```
	Example 2:

	User:
	Explain a MoE (Mixture of Experts) in
	around 100 words

	Lumosia:
	A MoE, short for Mixture of Experts, is
	a machine learning technique utilized
	in ensemble learning. It involves
	combining multiple specialist models
	(experts) to optimize predictive
	performance. Each expert focuses on
	a subset of the input features or data
	domains, making them specialized in
	particular areas. Through a weighting
	mechanism, like Gaussian Mixture
	Model, the predictions of experts are
	blended to produce a final result. MoE
	models demonstrate effectiveness in
	handling complex or ambiguous
	inputs where a single model might
	struggle. They are commonly used in
	natural language processing.
	computer vision, and speech synthesis.
	```

	## 🧩 Configuration

	```
	yamlbase_model: DopeorNope/SOLARC-M-10.7B
	gate_mode: hidden
	dtype: bfloat16
	experts:
	- source_model: DopeorNope/SOLARC-M-10.7B
	positive_prompts: [""]
	- source_model: maywell/PiVoT-10.7B-Mistral-v0.2-RP
	positive_prompts: [""]
	- source_model: kyujinpy/Sakura-SOLAR-Instruct
	positive_prompts: [""]
	- source_model: jeonsworld/CarbonVillain-en-10.7B-v1
	positive_prompts: [""]
	```

	## 💻 Usage

	```
	python
	!pip install -qU transformers bitsandbytes accelerate

	from transformers import AutoTokenizer
	import transformers
	import torch

	model = "Steelskull/Lumosia-MoE-4x10.7"

	tokenizer = AutoTokenizer.from_pretrained(model)
	pipeline = transformers.pipeline(
	"text-generation",
	model=model,
	model_kwargs={"torch_dtype": torch.float16, "load_in_4bit": True},
	)

	messages = [{"role": "user", "content": "Explain what a Mixture of Experts is in less than 100 words."}]
	prompt = pipeline.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
	outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
	print(outputs[0]["generated_text"])
	```