|
--- |
|
license: apache-2.0 |
|
tags: |
|
- merge |
|
- mergekit |
|
- lazymergekit |
|
--- |
|
|
|
# LLama3-Mix-8B |
|
|
|
LLama3-Mix-8B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit): |
|
|
|
For this model, I think ChatML Preset Templete. |
|
|
|
``` |
|
<|im_start|>system |
|
{system_message}<|im_end|> |
|
<|im_start|>user |
|
Hello, who are you?<|im_end|> |
|
<|im_start|>assistant |
|
{prompt}<|im_end|> |
|
|
|
``` |
|
|
|
## 🧩 Configuration |
|
|
|
```yaml |
|
models: |
|
- model: PJMixers/LLaMa-3-CursedStock-v2.0-8B |
|
- model: NousResearch/Hermes-2-Theta-Llama-3-8B |
|
- model: failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 |
|
merge_method: model_stock |
|
base_model: failspy/Meta-Llama-3-8B-Instruct-abliterated-v3 |
|
dtype: bfloat16 |
|
|
|
``` |