flammenai
/

Mahou-1.3a-mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mahou-1.3a-mistral-7B / README.md

nbeerbower's picture

Update README.md

63c062d verified 6 months ago

|

history blame contribute delete

1.45 kB

	---
	library_name: transformers
	license: apache-2.0
	base_model:
	- nbeerbower/Mahou-1.3-M1-mistral-7B
	datasets:
	- flammenai/MahouMix-v1
	---
	![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)

	# Mahou-1.3a-mistral-7B

	Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.

	### Chat Format

	This model has been trained to use ChatML format. Note the additional tokens in [tokenizer_config.json](tokenizer_config.json).

	```
	<\|im_start\|>system
	{{system}}<\|im_end\|>
	<\|im_start\|>{{char}}
	{{message}}<\|im_end\|>
	<\|im_start\|>{{user}}
	{{message}}<\|im_end\|>
	```

	### Roleplay Format

	- Speech without quotes.
	- Actions in `asterisks`

	```
	leans against wall cooly so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
	```

	### SillyTavern Settings

	1. Use ChatML for the Context Template.
	2. Enable Instruct Mode.
	3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
	4. Recommended Additonal stopping strings: `["\n", "<\|", "</"]`

	### Method

	DPO finetuned for 6 epochs using an A100 on Google Colab.

	[Fine-tune a Mistral-7b model with Direct Preference Optimization](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac) - [Maxime Labonne](https://huggingface.co/mlabonne)