Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,7 @@ license: apache-2.0
|
|
18 |
> He stood in the way, for he didn't understand. Unfortunate - there was potential.
|
19 |
|
20 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
21 |
This is my fifth model. The original model was created as a simple test of *Model Stock*, thus the name (nepoticide=nephew, not as important nor direct). A broken tokenizer caused me to remerge the model. I also chose to use [TheDrummer/UnslopNemo-12B-v4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4), as TheDrummer stated that this model has more anti-gptism influence while taking a hit to intelligence, which should get balanced by the other models.
|
22 |
|
23 |
## Testing stage: early testing
|
@@ -27,7 +28,10 @@ I do not know how this model holds up over long term context. Early testing show
|
|
27 |
- **Context size:** Not more than *20k* recommended - coherency may degrade.
|
28 |
- **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
|
29 |
- **Samplers:** A *Temperature-Last* of 1 and *Min-P* of 0.1 are viable, but haven't been finetuned. Activate *DRY* if repetition appears. *XTC* is untested.
|
|
|
30 |
## Quantization
|
|
|
|
|
31 |
## Parameters
|
32 |
- **Context size:** Not more than *20k* recommended - coherency may degrade.
|
33 |
- **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
|
|
|
18 |
> He stood in the way, for he didn't understand. Unfortunate - there was potential.
|
19 |
|
20 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
21 |
+
|
22 |
This is my fifth model. The original model was created as a simple test of *Model Stock*, thus the name (nepoticide=nephew, not as important nor direct). A broken tokenizer caused me to remerge the model. I also chose to use [TheDrummer/UnslopNemo-12B-v4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4), as TheDrummer stated that this model has more anti-gptism influence while taking a hit to intelligence, which should get balanced by the other models.
|
23 |
|
24 |
## Testing stage: early testing
|
|
|
28 |
- **Context size:** Not more than *20k* recommended - coherency may degrade.
|
29 |
- **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
|
30 |
- **Samplers:** A *Temperature-Last* of 1 and *Min-P* of 0.1 are viable, but haven't been finetuned. Activate *DRY* if repetition appears. *XTC* is untested.
|
31 |
+
|
32 |
## Quantization
|
33 |
+
Static **GGUF** Quants available at [redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2-GGUF](https://huggingface.co/redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2-GGUF)
|
34 |
+
|
35 |
## Parameters
|
36 |
- **Context size:** Not more than *20k* recommended - coherency may degrade.
|
37 |
- **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
|