llamagnific-3-87b / README.md
nisten's picture
Update README.md
995053d verified
|
raw
history blame
1.16 kB
---
base_model: [meta-llama/Meta-Llama-3-70B-Instruct]
merge_model: [NousResearch/Hermes-2-Pro-Llama-3-70B]
library_name: transformers
tags:
- mergekit
- merge
license: llama3
---
*** Yoloing this 99-layer llamastein straight to the leaderboard wish me luck
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/K-ZX8HE_ph5eRlieFbEQj.png)
# base_model: [meta-llama/Meta-Llama-3-70B-Instruct]
# merge_model: [NousResearch/Hermes-2-Pro-Llama-3-70B]
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the passthrough merge method.
### Models Merged
The following models were included in the merge:
* /home/admin/meta70
* /home/admin/hermes2pro
### Configuration
The following YAML configuration was used to produce this model:
```yaml
dtype: bfloat16
merge_method: passthrough
slices:
- sources:
- layer_range: [0, 15]
model: /home/admin/meta70
- sources:
- layer_range: [10, 69]
model: /home/admin/hermes2pro
- sources:
- layer_range: [55, 80]
model: /home/admin/meta70
```