BigLlama-20b / README.md
athirdpath's picture
Create README.md
cafba46
|
raw
history blame
544 Bytes
I'm going to compare DARE merges using this (mostly vanilla, alpaca-tinted) 20b model vs using Harmonia.
slices:
- sources:
- model: athirdpath/alpaca-2-13b-english_full-model
-
layer_range: [0, 16]
- sources:
- model: TheBloke/Llama-2-13B-fp16
-
layer_range: [8, 24]
- sources:
- model: athirdpath/alpaca-2-13b-english_full-model
-
layer_range: [17, 32]
- sources:
- model: TheBloke/Llama-2-13B-fp16
-
layer_range: [25, 40]
merge_method: passthrough
dtype: float16