KaraKaraWitch
/

Matsutei-Qwen2.5-72b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

KaraKaraWitch commited on 4 days ago

Commit

4bafe8f

•

1 Parent(s): 34f072c

Update README.md

Files changed (1) hide show

README.md +69 -1

README.md CHANGED Viewed

	@@ -1 +1,69 @@
1	- ~~![image/png](https://cdn-uploads.huggingface.co/production/uploads/633e85093a17ab61de8d9073/IPyNLo-Yd1X0Kw9Csn3dx.png)~~

+---
+base_model:
+- KaraKaraWitch/SteyrCannon-Qwen2.5-72b
+- EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1
+- ZeusLabs/Chronos-Platinum-72B
+- m8than/banana-2-b-72b
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# Matsutei-Qwen2.5-72b
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+<div>
+  <a href="https://www.youtube.com/watch?v=Bv1u_e_91Ow"><img src="https://cdn-uploads.huggingface.co/production/uploads/633e85093a17ab61de8d9073/IPyNLo-Yd1X0Kw9Csn3dx.png" style="margin-left:auto;margin-right:auto"></a>
+</div>
+SteyrCannon had a weird vibe issue when it comes to world book lore info. So I've been falling back to my EurobeatVARemix merge. This merge should address that inital issue but I think there's other quirks to this one.
+## Quants & Hosts
+<TBC>
+## Merge Details
+### Merge Method
+This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1) as a base.
+### Models Merged
+The following models were included in the merge:
+* [KaraKaraWitch/SteyrCannon-Qwen2.5-72b](https://huggingface.co/KaraKaraWitch/SteyrCannon-Qwen2.5-72b)
+* [EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1)
+* [ZeusLabs/Chronos-Platinum-72B](https://huggingface.co/ZeusLabs/Chronos-Platinum-72B)
+* [EVAm8than/banana-2-b-72b](https://huggingface.co/m8than/banana-2-b-72b)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: KaraKaraWitch/SteyrCannon-Qwen2.5-72b
+    parameters:
+      density: 0.25
+      weight: 0.5
+  - model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1
+    parameters:
+      density: 0.5
+      weight: 0.75
+  - model: ZeusLabs/Chronos-Platinum-72B
+    parameters:
+      density: 0.5
+      weight: 0.5
+  - model: m8than/banana-2-b-72b
+    parameters:
+      density: 0.65
+      weight: 0.40
+merge_method: ties
+base_model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1
+parameters:
+  normalize: true
+dtype: bfloat16
+```